Download The human testis-specific proteome defined by

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Molecular Human Reproduction, Vol.20, No.6 pp. 476–488, 2014
Advanced Access publication on March 5, 2014 doi:10.1093/molehr/gau018
ORIGINAL RESEARCH
The human testis-specific proteome
defined by transcriptomics and
antibody-based profiling
D. Djureinovic1,2, L. Fagerberg 3, B. Hallström3, A. Danielsson1,2,
C. Lindskog1,2, M. Uhlén 3, and F. Pontén 1,2,*
1
Science for Life Laboratory, Rudbeck Laboratory, Uppsala University, Uppsala SE-751 85, Sweden 2Department of Immunology, Genetics
and Pathology, Rudbeck Laboratory, Uppsala University, Uppsala SE-751 85, Sweden 3Science for Life Laboratory, KTH – Royal Institute
of Technology, Stockholm SE-171 21, Sweden
*Correspondence address. Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University,
751 85 Uppsala, Sweden. Tel: +46-18-611-3846. E-mail: [email protected]
Submitted on November 25, 2013; resubmitted on February 20, 2014; accepted on February 21, 2014
abstract: The testis’ function is to produce haploid germ cells necessary for reproduction. Here we have combined a genome-wide transcriptomics analysis with immunohistochemistry-based protein profiling to characterize the molecular components of the testis. Deep sequencing (RNA-Seq) of normal human testicular tissue from seven individuals was performed and compared with 26 other normal human tissue types.
All 20 050 putative human genes were classified into categories based on expression patterns. The analysis shows that testis is the tissue with the
most tissue-specific genes by far. More than 1000 genes show a testis-enriched expression pattern in testis when compared with all other analyzed
tissues. Highly testis enriched genes were further characterized with respect to protein localization within the testis, such as spermatogonia, spermatocytes, spermatids, sperm, Sertoli cells and Leydig cells. Here we present an immunohistochemistry-based analysis, showing the localization
of corresponding proteins in different cell types and various stages of spermatogenesis, for 62 genes expressed at .50-fold higher levels in testis
when compared with other tissues. A large fraction of these genes were unexpectedly expressed in early stages of spermatogenesis. In conclusion,
we have applied a genome-wide analysis to identify the human testis-specific proteome using transcriptomics and antibody-based protein
profiling, providing lists of genes expressed in a tissue-enriched manner in the testis. The majority of these genes and proteins were previously
poorly characterised in terms of localization and function, and our list provides an important starting point to increase our molecular understanding of human reproductive biology and disease.
Key words: immunohistochemistry / RNA sequencing / spermatogenesis / testis / tissue specificity
Introduction
The main function of the testis is to produce haploid germ cells necessary for our reproduction. This is achieved through a process that
requires high fidelity throughout the life of fertile males. Spermatogenesis is a stepwise and highly coordinated series of events that start in
cells located in the basal region of the seminiferous duct, where selfrenewing spermatogonial stem cells mitotically divide into spermatogonia. After several rounds of mitosis, the spermatogonia develop
into primary and secondary spermatocytes that divide by meiosis to
form spermatids. The final stage of spermatogenesis occurs in the
luminal region of the seminiferous duct where spermatids differentiate
into sperm. The various phases of spermatogenesis are distinguishable
due to the morphologically different cell types present at each stage,
since each cell type being closely associated with a particular phase
of spermatogenesis (Russell et al., 1990). In addition to germ cells,
the Sertoli cell is intimately involved in all stages of spermatogenesis.
This somatic cell, which lies interspersed throughout the seminiferous
duct, regulates both the development and the number and quality of
spermatogenic cells (Griswold 1998). Approximately 90% of the cell
mass in the testis is occupied by seminiferous ducts. In addition to
the generation of sperm, the testis is crucial for synthesis of androgens
that are necessary for spermatogenesis but also play a key role in
the development of male sex characteristics. The main production
of androgens, primarily testosterone, is localized in the testicular
Leydig cell.
The highly specialized functions, specific events and cell structure of
the testis endow a uniqueness to this particular tissue type. Although
spermatogenesis has been studied in depth and the different stages of
sperm maturation have been well defined, a large fraction of the genes
involved in the different stages are yet unknown. A detailed knowledge
regarding the molecular repertoire of the different cell types in the
testis is essential to understand the complex interaction under normal
and pathological conditions.
& The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved.
For Permissions, please email: [email protected]
477
The human testis-specific proteome
The development of gene expression profiling techniques has allowed
for characterization of tissues with high resolution. Several previous
studies, based on expressed sequence tags (ESTs) and various microarray studies, have identified an unusual and diverse set of genes
expressed in testes. A large number of these genes were shown to be
unique to spermatogenic cells in both mouse (Kerr et al., 1994; Hoog,
1995) and humans (Pawlak et al., 1995; Penttilä et al., 1995). Studies
using Next Generation Sequencing technology (RNA-Seq) have also
been employed on testicular tissue. In a recent study, 10 different
types of human normal tissues showed that a higher fraction of genes
and more diverse mRNA populations (less dominance of a few highly
expressed genes) were expressed in the testis when compared with
other tissue types (Ramsköld et al., 2009). Although RNA-Seq and
other gene expression profiling techniques offer several advantages
(Mortazavi et al., 2008; Wang et al., 2009), the acquired data from the
transcriptomics analyses based on a heterogeneous tissue material
does not provide information regarding the expression pattern of corresponding proteins, e.g. what particular cell types express a specific
protein or in what subcellular compartment a specific protein is
expressed.
Here we analyzed genes expressed in normal human testis and these
data are compared with the transcriptome of 26 other human tissue and
organ types based on recently published RNA-seq data (Fagerberg et al.,
2014). Using this approach we not only detected all genes expressed in
the adult testis, but also identified genes enriched in the testis versus
other organs and tissue types. In order to translate the mRNA expression data to protein levels and to determine cell type-specific expression,
the proteins corresponding to the tissue-specific genes were studied
using the immunohistochemistry images at the Human Protein Atlas
(www.proteinatlas.org) (Uhlén et al., 2005, 2010; Pontén et al., 2011).
The testis-specific transcriptome and the identified cell type-specific proteins in normal human testis can serve as a starting point for further
disease-specific research.
Materials and Methods
Patient characteristics
The tissue samples used for transcript profiling of human testis consisted of
histologically normal tissue from surgical specimens, including testis from
four patients with cancer (two seminoma (Patient 1, age 34 years and
Patient 2, age 37 years), one embryonal cancer (Patient 3, age 26 years)
and one atypical plasmocytoma (Patient 4, age 56 years), two patients with
only scar tissue (Patient 5, age 62 years and Patient 6, age 50 years) and
one patient with focal inflammation (Patient 7, age 34 years). The corresponding histology of each case is shown in Supplementary data, Fig. S1.
Transcript profiling (RNA-Seq)
The use of human tissue samples was approved by the Uppsala Ethical Review
Board (Ref # 2011/473). Tissues samples, collected within the infrastructure
of an established biobank, were embedded in Optimal Cutting Temperature
(O.C.T.) compound and stored at 2808C. A hematoxylin– eosin-stained
frozen section (4 mm) was prepared from each sample using a cryostat and
the CryoJanew Tape-Transfer System (Instrumedics, St. Louis, MO, USA).
Each slide was examined by a pathologist to ensure accurate tissue morphology. Three sections (10 mm) were cut from each frozen tissue block and
homogenized prior to extraction of total RNA, using the RNeasy Mini Kit
(Qiagen, Hilden, Germany) according to the manufacturer’s instructions.
The extracted RNA samples were analyzed using either an Agilent 2100 Bioanalyzer system (Agilent Biotechnologies, Palo Alto, USA) with the RNA 6000
Nano Labchip Kit or an Experion automated electrophoresis system
(Bio-Rad Laboratories, Hercules, CA, USA) with the standard-sensitivity
RNA chip. Only samples of high-quality RNA (RNA Integrity Number
≥7.5) were used in the following mRNA sample preparation for sequencing.
mRNA sequencing was performed on Illumina HiSeq2000 and 2500
machines (Illumina, San Diego, CA, USA) using the standard Illumina
RNA-Seq protocol with a read length of 2 × 100 bases.
Analysis of data
The raw reads from 20 050 transcripts obtained from the sequencing system
were trimmed for low-quality ends with the software Sickle, using a phred
quality threshold of 20. All reads shorter than 54 bp after the trimming
were discarded. The processed reads were mapped to the GRCh37
version of the human genome with Tophat v2.0.3 (Trapnell et al., 2010).
Potential PCR duplicates were eliminated by applying the MarkDuplicates
module of Picard 1.77. To obtain quantification scores for all human genes,
FPKM (fragments per kilobase of exon model per million mapped reads)
values were calculated with Cufflinks v2.0.2 (Trapnell et al., 2010), which corrects for transcript length and the total number of mapped reads from the
library to compensate for different read depths for different samples. The
gene models from Ensembl build 69 (Flicek et al., 2012) were used in Cufflinks. In addition to Cufflinks, HTSeq v0.5.1 was run to calculate read
counts for each gene, which were used for analyses of differentially expressed
genes utilizing the DESeq package (Anders and Huber 2010; Flicek et al.,
2012). All data were analyzed with R Statistical Environment (RCoreTeam
2013) with the addition of the package ‘gplots’. A network analysis was performed using Cytoscape 3.0 (Shannon et al., 2003). For analyses performed in
this study where a log2-scale of the data was used, pseudo-counts of +1
were added to the data set.
Specificity classification
The average FPKM value of all individual samples for each tissue was used to
estimate the gene expression level. A cutoff value of 1 FPKM was used as the
detection limit. Each of the 20 050 gene transcripts were classified into one of
eight categories based on the FPKM levels: (i) ‘Not detected’—,1 FPKM
in testis; (ii) ‘highly testis enriched’—50-fold higher FPKM level in testis
compared with all other tissues; (iii) ‘moderately testis enriched’—5-fold
higher FPKM level in testis compared with all other tissues; (iv) ‘group
enriched’—5-fold higher average FPKM level in a group of 2 – 7 tissues
including testis compared with all other tissues; (v) ‘expressed in all low’—
detected in 27 tissues and at least one tissue ,10 FPKM; (vi) ‘expressed
in all high’—detected in 27 tissues and all tissues .10 FPKM; (vii) ‘testis
enhanced’—5-fold higher FPKM level in testis compared with the average
FPKM value of all 27 tissues; (viii) ‘mixed’—genes expressed in 1 – 26
tissues and in none of the above categories. The ‘testis-specific score’ was
defined as the Testis FPKM divided by the maximum FPKM value in any of
the other 26 tissues.
Hierarchical clustering
FPKM values for all 20 050 genes were used to calculate a correlation matrix
based on Spearman’s rank correlation coefficient for each pairwise combination of individual samples (n ¼ 7). The correlation matrix was used for
unsupervised hierarchical clustering analyses of individual samples using
the average linkage method to measure distances between clusters. The
results were visualized in a heatmap of all pairwise correlation coefficients.
478
Djureinovic et al.
Gene ontology analysis
Data availability
A gene ontology (GO) analysis (Ashburner et al., 2000) was performed using
the GOrilla tool (Eden et al., 2009) in order to determine overrepresented
GO categories in the gene set of tissue-enriched genes. For the cellular component analysis the GOSlim GOA associations were used to determine
whether genes were extracellular, intracellular or membrane bound. The
number of genes for each term was counted, allowing a gene to be associated
with more than one term. A list of all genes analyzed in this study was used as
the background list in GOrilla.
All the data (FPKM values for all the samples) will be available for downloads
without any restrictions (www.proteinatlas.org/about/download). The
primary data (reads) are available through the Array Express Archive (www.
ebi.ac.uk/arrayexpress/) under the accession number: E-MTAB-1733. The
transcript profiling data (FPKM values) for each gene in each cell and tissue
will also be available in the next version of the Human Protein Atlas (www.
proteinatlas.org).
Results
Tissue profiling
Tissue microarrays (TMAs) were generated as previously described (Kampf
et al., 2012), containing 1 mm cores of 46 different normal tissues in triplicates and 216 cancer tissues, representing the 20 most common cancers
(Pontén et al., 2008). The samples were received from the Department of
Pathology, Uppsala University Hospital, Sweden, approved by the Research
Ethics Committee at Uppsala University (Ref # Ups 02 – 577). The immunohistochemical (IHC) staining was performed as previously described (Kampf
et al., 2012). In brief, 4 mm sections of the TMA blocks were cut and mounted
on adhesive slides, baked at 608C for 45 min, followed by deparaffinization in
xylene and hydration in graded alcohols. Blocking for endogenous peroxidase
was performed in 0.3% hydrogen peroxide, and for antigen retrieval, a pressure boiler (Decloaking Chamber, Biocare Medical, Walnut Creek, CA,
USA) was used, boiling the slides for 4 min at 1258C in Citrate buffer, pH 6
(Lab Vision, Freemont, CA, USA). Automated immunohistochemistry was
performed using an Autostainer 480 instrumentw (Lab Vision). Primary antibodies diluted in UltraAb Diluentw (Lab Vision) and the secondary reagent
UltraVision LP HRP polymerw (Lab Vision) were incubated for 30 min
each at room temperature (RT). The slides were developed for 10 min at
RT, adding Diaminobenzidine (Lab Vision) as chromogen, followed by counterstaining with Mayers hematoxylin (Histolab) and mounting with Pertexw
(Histolab). The immunohistochemically stained and mounted slides were
scanned using an Aperio ScanScope XT Slide Scanner (Aperio Technologies,
Vista, CA, USA) for generation of high-resolution digital whole slide images,
followed by annotation by certified pathologists. In brief, the manual score of
IHC-based protein expression was determined as the fraction of positive cells
defined in different tissues: 0 ¼ 0 – 1%, 1 ¼ 2 – 25%, 2 ¼ 26 – 75%, 3.75%
and intensity of immunoreactivity: 0 ¼ negative, 1 ¼ weak, 2 ¼ moderate
and 3 ¼ strong staining. All of the tissues used as donor blocks were acquired
from the archives at the Department of Pathology of Uppsala University Hospital in agreement with approval from the Research Ethics Committee at
Uppsala University (Ref # Ups 02-577).
All annotation data together with validation of the antibodies is publically
available at the website www.proteinatlas.org (Uhlen et al., 2010; Pontén
et al., 2011), with the current version containing protein expression profiles
from 21 984 antibodies, corresponding to 16 621 genes.
Antibodies
Primary antibodies used for shown examples of IHC stainings include
HPA027850 (DMRT1), HPA011122 (PASD1), HPA003473 (PAGE1),
HPA036490 (DMRTB1), HPA059199 (C3ORF22), HPA019777 (DAZL),
HPA041915 (TEX101), HPA036124 (SHCBP1L), HPA049690 (TPTE),
HPA011284 (FMR1NB), HPA044387 (TNP1), HPA035105 (SMCP),
HPA021803 (ACTL7B), HPA038719 (ACRV1), HPA056530 (TMCO5A),
HPA056386 (PRM2), HPA042666 (GAPDHS), HPA05949 (AKAP4),
HPA049917 (C2ORF57), HPA048927 (C10ORF40), HPA054126 (SLCO6A1),
HPA034604 (FATE1), HPA043059 (DEFB119) and HPA028615 (INSL3)
have been developed within the Human Protein Atlas and are available
from AtlasAntibodies, Stockholm, Sweden.
The transcriptomic analysis of testicular tissue
The transcriptomes of 27 different human organ- and tissue-types were
analyzed using next generation sequencing (RNA-Seq) based on specimens from altogether 95 individuals (Fagerberg et al., 2014). The normalized mRNA levels were determined for each sample, calculated as FPKM
values where 1 FPKM roughly corresponds to one mRNA molecule per
average cell in the sample (Hebenstreit et al., 2011). Testicular tissues
from seven individuals (age 26– 62 years) were included in the analysis.
Microscopical evaluation of cryosections showed normal appearing testicular tissue with intact spermatogenesis, with only rare inflammation
and without contamination of tumor cells (Supplementary data, Fig. S1).
A total of 15 224 genes were detected in testis, with a mean expression value of .1 FPKM (with a variation from 15 397 genes in individual
2 to 15 039 genes in individual 3). Approximately 77% of all putative
protein coding genes (n ¼ 20 050) were detected in testis and of the
27 different tissues analyzed, the testis had the largest number of
detected genes.
The distribution of the genes detected in testis ranged from the
maximum of 4956 FPKM (the mitochondrially encoded cytochrome
c oxidase III gene) down to 1 FPKM to give a dynamic range of 5000.
The 30 genes with the highest levels of expression in the testis are
listed in Supplementary data, Table SI, which shows that a majority of
these genes are those with ‘house-keeping’ functions, such as genes
coding for mitochondrial and ribosomal proteins.
Analysis of the biological variation between the seven individual testis
samples was performed for the expression levels of all the protein-coding
genes in pairwise correlation plots and hierarchical clustering analysis.
The correlation overall was high with Spearman’s coefficients from
0.99 (Fig. 1A) to 0.91 (Fig. 1B). The average correlation coefficient for
all the seven testis samples was 0.96 (Supplementary data, Table S2,
Supplementary data, Fig. S2). These results demonstrate high technical
reproducibility between the testis samples, but also show low interindividual variation across the genome-wide expression pattern,
although variations due to isoform differences of the transcripts cannot
be ruled out.
As expected, similar comparisons between testis and other tissue
types show a higher degree of variation. The comparison of the testis
samples with data from the other 26 tissue types shows that the tissue
with the highest similarity to testis was the ovary with a Spearman coefficient of 0.75 (Fig. 1C), while the lowest correlation is found between
testis and liver with a Spearman coefficient of 0.65 (Fig. 1D).
Classification of the genes expressed in testis
The transcriptomics data obtained from the 27 tissues enabled us to
classify all the 20 050 protein-encoding genes into four major categories
The human testis-specific proteome
479
Figure 1 Correlation between samples and tissues. Scatter plot of average fragments per kilobase of exon model per million mapped reads (FPKM) values
for all detected genes comparing the two samples of normal testis with the highest correlation (A) and the two individuals with lowest correlation (B). Similar
analysis with a scatter plot showing the average FPKM values for the seven testis samples compared with the most similar other tissue type, ovary (C) and
most dissimilar other tissue type, liver (D). Color code: not detected (grey), highly testis enriched, moderately testis enriched, group enriched or testis
enhanced (blue), mixed (green), expressed in all, low (red), expressed in all, high (dark red). The correlation scores are given as pairwise Spearman’s correlation coefficients between the respective individuals or tissues.
based on their expression levels in testis (Fig. 2A). The major categories
included (i) genes that were not detected in testis (23%), (ii) genes that
showed a mixed expression pattern, being expressed in several, but
not in all tissue types (21%), (iii) genes that were expressed in all
tissues (46%) and (iv) genes that were categorized to have elevated
levels of expression in testis when compared with the average expression
levels in other tissue types (10%). The latter tissue-specific genes (n ¼
1934) were further divided into four subcategories depending on
degree of tissue specificity. Three hundred and sixty-four genes were
defined as highly enriched in testis (Supplementary data, Table SIII), characterized as the highest level of tissue-specificity with at least 50-fold
higher FPKM level in testis compared with any other tissue type. The
top 50 highly enriched genes are displayed in Table I, with the tissuespecific score calculated as the ratio between the average FPKM in the
seven testis tissues and the maximal FPKM in all the other tissues.
Seven hundred and sixty genes in testis were defined as moderately
enriched in testis (Supplementary data, Table SIV), with at least 5-fold
higher FPKM level in testis compared with all other tissues and 254
genes were defined as group enriched (Supplementary data, Table
SIV), with 5-fold higher average FPKM level in a group of 2 –7 tissues
including testis compared with all other tissues. In addition, 556 genes
were identified as testis enhanced (Supplementary data, Table SIV),
defined as having a 5-fold higher FPKM level in testis when compared
with the average FPKM value of all the 27 tissues.
From the transcriptomics analysis, it is possible to estimate the relative
mRNA pool for each of the categories, as shown in Fig. 2B. Almost
80% of the transcripts in the testis correspond to genes with potential
‘house-keeping’ functions expressed also in all other tissues analyzed.
This suggests that despite the highly specialized function of testis, most
of the transcription apparatus is still devoted to ‘house-hold’ functions
necessary for all types of cells (Fig. 2C, Supplementary data, Table SI).
An analysis of the testis-enriched genes shows that the mRNA pool constitutes 14% of the transcripts, with most of these corresponding to
genes highly (6%) or moderately (5%) enriched in testis. The expression
480
Djureinovic et al.
Figure 2 Classification of all human protein-coding genes and gene expression levels for highly abundant, highly enriched and group-enriched genes in
testis. Pie chart showing the distribution of gene expression comprising all 20 050 human putative protein-coding genes in testis (A). The pie chart is divided
into four major categories based on the transcript abundance in analyzed tissues, including not detected in testis (grey), expressed in all tissues (red color
shades), mixed expression with genes expressed in a large subset, but not all tissues (green) and genes with elevated expression in testis (blue color shades).
The genes with elevated expression in testis are further subdivided dependent on the degree of specificity as highly testis-enriched genes (light blue), moderately enriched in testis (blue), group enriched in testis (darker blue) and genes enhanced in testis (dark blue). Pie chart displaying the distribution of mRNA
molecules for the genes expressed in testis using the same color codes (B). Nearly 80% of all mRNA molecules are derived from housekeeping genes. The
FPKM values are shown across all 27 different tissue types for the 30 genes with most abundant expression in the testis. Each color corresponds to a gene
(C). Similar diagrams across all 27 different tissue types showing FPKM values for the top 30 highly testis-enriched genes. Each color corresponds to a gene
(D) and the top 30 testis group-enriched genes. Each color corresponds to a gene (E).
levels across the 27 analyzed tissues for the 30 most abundant highly
enriched genes are shown in Fig. 2D and in Fig. 2E the top 30 groupenriched genes are shown. The shared expression of these groupenriched genes is distributed among the other tissues, but with a slight
overrepresentation of shared genes with the brain.
The relation of the group-enriched genes is presented in a network
plot in Fig. 3A, which shows the amount of genes shared with a particular
tissue type (up to three different tissues). The tissue type with most
group-enriched genes in common with testis is the brain (n ¼ 48), followed by lung (n ¼ 33), whereas the remaining 24 other tissue types
only show a few group-enriched genes. A GO-based analysis of the
group of enriched genes shared between testis and lung shows an
enrichment for genes related to cilia function and movement. For the
group-enriched genes shared between testis and brain there was no
clear pattern of enriched GO-terms.
A GO-based analysis of all the 1124 genes identified as highly enriched
(n ¼ 364) and moderately enriched (n ¼ 760) in the testis indicated
a clear overrepresentation of genes associated with reproductive
process (154 genes) and spermatogenesis (105 genes). The cellular
localization of the corresponding proteins encoded by the highly and
moderately testis enriched genes were distributed as intra-cellular
(38%), membrane bound (5%) and extracellular (6%), whereas for half
of the proteins (50%) the localization was unknown (Supplementary
data, Fig. S3A).
481
The human testis-specific proteome
Table I The top 50 highly enriched genes in the testis.
Gene name
Gene description
Testis FPKM
Testis-specific score
.............................................................................................................................................................................................
PRM2
Protamine 2
2447
2520
PRM1
Protamine 1
2174
1489
TNP1
Transition protein 1 (during histone to protamine replacement)
1377
1353
PHF7
PHD finger protein 7
889
79
ANKRD7
Ankyrin repeat domain 7
567
505
TSACC
TSSK6-activating co-chaperone
488
604
INSL3
Insulin-like 3 (Leydig cell)
406
114
BOD1L2
Biorientation of chromosomes in cell division 1-like 2
405
4049
SMCP
Sperm mitochondria-associated cysteine-rich protein
344
470
LELP1
Late cornified envelope-like proline-rich 1
335
3346
TCP11
t-complex 11 homolog (mouse)
317
468
HMGB4
High mobility group box 4
307
2400
TUBA3C
Tubulin, alpha 3c
301
1244
C11orf 20
Chromosome 11 open reading frame 20
279
287
LDHC
Lactate dehydrogenase C
269
167
SPATA8
Spermatogenesis associated 8
250
1288
DEFB119
Defensin, beta 119
248
447
GSG1
Germ cell associated 1
244
110
FATE1
Fetal and adult testis expressed 1
179
300
ODF1
Outer dense fiber of sperm tails 1
177
1685
SPATA22
Spermatogenesis associated 22
172
172
AC007557.1
HCG1816075; uncharacterized protein
156
1146
PGK2
Phosphoglycerate kinase 2
149
1485
GAPDHS
Glyceraldehyde-3-phosphate dehydrogenase, spermatogenic
147
68
ATPAF1-AS1
ATPAF1 antisense RNA 1
145
815
AKAP4
A kinase (PRKA) anchor protein 4
145
1446
ACSBG2
acyl-CoA synthetase bubblegum family member 2
141
109
SHCBP1L
SHC SH2-domain binding protein 1-like
132
1171
ZPBP
Zona pellucida-binding protein
132
445
DDX4
DEAD (Asp-Glu-Ala-Asp) box polypeptide 4
127
691
BRDT
Bromodomain, testis specific
125
111
C10orf62
Chromosome 10 open reading frame 62
122
367
TPD52L3
Tumor protein D52-like 3
121
1210
CAPZA3
Capping protein (actin filament) muscle Z-line, alpha 3
115
68
GTSF1L
Gametocyte-specific factor 1-like
114
83
SPATA4
Spermatogenesis associated 4
114
68
UBQLN3
Ubiquilin 3
112
1124
DKKL1
Dickkopf-like 1
112
348
CETN1
Centrin, EF-hand protein, 1
110
1097
TPTE
Transmembrane phosphatase with tensin homology
105
170
TMCO5A
Transmembrane and coiled-coil domains 5A
105
193
ZPBP2
Zona pellucida binding protein 2
104
207
ACTL7A
Actin-like 7A
104
1036
EPPIN
Epididymal peptidase inhibitor
103
192
ADAD1
Adenosine deaminase domain containing 1 (testis specific)
103
1025
TEX101
Testis expressed 101
96
95
CCDC62
Coiled-coil domain containing 62
94
138
Continued
482
Djureinovic et al.
Table I Continued
Gene name
Gene description
Testis FPKM
Testis-specific score
.............................................................................................................................................................................................
DYDC1
DPY30 domain containing 1
91
71
C17orf64
Chromosome 17 open reading frame 64
85
266
SYCP3
Synaptonemal complex protein 3
84
105
Figure 3 Network plot of enriched genes, characterization of the highly enriched genes and cell type-specific expression for the 62 highly enriched genes
previously not well characterized. Network plot of the testis-enriched genes showing the relation of group-enriched genes (A). The light blue circle shows
the highly and moderately testis-enriched genes (n ¼ 1124). Dark blue nodes show the number of genes that are group enriched in two or three different
tissue types (grey circles). The size of each blue node is related to the square root of the number of genes enriched in a particular combination of tissues. Pie
chart based on the 364 highly testis-enriched genes, showing the distribution of preexisting knowledge with regard to function and cell types these genes are
expressed in (B). Sixty-two percent of the highly testis-enriched genes are previously poorly characterized. Pie chart showing the cell type localization based
on protein profiling using immunohistochemistry for 62 of the previously uncharacterized highly testis-enriched genes (C).
Cell type distribution of the testis specific
genes
With the large amount of unknown genes based on the GO analysis, we
further investigated how well characterized the 364 highly enriched genes
in testis were by using gene summary information from Uniprot and
NCBI data regarding function and the cell type specificity. Similar to
the GO analysis, a large fraction (62%) of the highly enriched genes
was not well characterized and only for 16% of the genes (n ¼ 60) the
specific cell type expressing the gene was known (Fig. 3B). The cell typespecific expression in relation to the different stages of the spermatogenesis of these previously known genes is shown in Supplementary data,
483
The human testis-specific proteome
Fig. S3B and Table SV. According to the analysis of previous literature,
more than one-third of these genes (40%) were expressed in more
than one cell type and of the cell type specifically expressed genes over
half were expressed in the later stages of spermatogenesis, including
spermatids (21%) and sperm (30%). Relatively few testis-specific genes
have previously been identified as expressed in early stages of the spermatogenesis, in Sertoli or Leydig cells.
The PASD1 gene encodes a putative protein with nuclear expression in
spermatogonia. PASD1 is a proposed transcription factor and has also
been classified as a cancer testis antigen (Liggins et al., 2004). The
DMRTB1 gene contains a DNA-binding domain but the putative protein
is unknown with regard to function and expression pattern (Ottolenghi
et al., 2002). Expression of the putative protein encoded by chromosome
3 open reading frame 22 (C3ORF22) is also found in spermatogonia and
shows a cytoplasmic expression pattern.
Antibody-based profiling of the testis
For the remaining highly enriched genes where cell type previously was
unknown (n ¼ 305), the expression patterns were evaluated, for the
corresponding proteins with available antibodies (n ¼ 260), using the
online Human Protein Atlas (HPA) database. The HPA provides immunohistochemically stained TMA cores from 46 different normal human
tissues, including three cores of human testicular tissue. The IHC staining
was visually evaluated with regard to testis specificity and cellular distribution. The antibody-based profiling allows further determination of localization within different subcellular compartments and cell types within
the testis, such as different morphologically defined stages during spermatogenesis, i.e. spermatogonia, spermatocytes, spermatids and sperm,
and other testis specific cell types such as Sertoli cells and Leydig cells.
The expression pattern for genes with fully congruent RNA-Seq and
protein expression data (n ¼ 62) was analyzed in detail, and a large
number of genes expressed in early phases of spermatogenesis were
identified (Fig. 3C and Supplementary data, Table SVI). The protein expression pattern in testis corresponding to these highly testis-enriched
genes are displayed in Supplementary data, Fig. S4. Almost half of
these genes showed expression in the earliest stages of spermatogenesis,
including spermatogonia (38%) and spermatocytes (11%). Only a few
genes were found to be localized to Sertoli cells and none of the highly
enriched genes where cell type previously was unknown in testis was
found to be expressed in Leydig cells.
Genes expressed in spermatocytes
Spermatocytes, derived from type B spermatogonia, can be subdivided
into primary spermatocytes that enter the first meiosis and secondary
spermatocytes that enter the second meiosis to produce haploid
spermatids. Several of the testis-specific proteins localized to spermatocytes were involved in testicular differentiation, proliferation and
meiosis. Examples of proteins expressed in spermatocytes as revealed
by the immunohistochemistry analysis are shown in Fig. 4B. The
deleted-in-azoospermia-like (DAZL) gene encodes a well-known RNAbinding protein that is essential for gametogenesis in both males and
females. In the testis, this protein is localized to the nucleus of
spermatogonia, but relocates to the cytoplasm in spermatocytes
during meiosis (Reijo et al., 2000). The testis expressed 101 (TEX101)
gene, has previously been reported to be associated with various
phases of spermatogenesis, but more precise knowledge regarding
the specific function and expression pattern is unknown (Teng et al.,
2006). Two additional examples of highly testis-enriched genes showing
cytoplasmic expression predominantly in spermatocytes are the two
cancer testis antigens transmembrane phosphatase with tensin homology
(TPTE) (Tapparel et al., 2003) and fragile X mental retardation 1 neighbor (FMR1NB) with previously no information regarding what cell type in
testis this gene is expressed in as well as the uncharacterized SHC
SH2-domain binding protein 1-like (SHCBP1L) gene.
Genes expressed in spermatogonia
Genes expressed in spermatids
Spermatogonia are diploid cells that form the basal layer of the seminiferous duct and present the initial phase of the spermatogenesis. The
spermatogonia can be divided into two subtypes, type A cells that
have stem cell-like properties and maintain the spermatogonia population and type B cells that produce primary spermatocytes. Both type
A and B cells result from mitotic divisions (de Rooij and Russell 2000).
In Fig. 4A, five examples of proteins with specific expression in
spermatogonia are shown as revealed by the immunohistochemistry
analysis. Doublesex- and mab-3-related transcription factor 1 (DMRT1)
encodes a protein that is described to primarily be expressed in the
nuclei of spermatogonia (Jørgensen et al., 2012). This transcription
factor plays a key role in male sex determination and differentiation by controlling testis development and male germ cell proliferation (Matson et al.,
2010). Also PAS domain-containing 1 (PASD1), P antigen family member
1 (PAGE1) and DMRT-like family B with proline-rich C-terminal,
1 (DMRTB1) show nuclear expression restricted to spermatogonia
located along the basement membrane of seminiferous ducts. The function of PAGE1 is unknown, but the PAGE1 gene belongs to a family of
genes that has been described to be expressed in a variety of tumors,
but with a testis restricted expression in normal tissues (cancer testis antigens) (Brinkmann et al., 1998). For the PASD1 and DMRTB1 genes
there is previously only evidence of existence at the transcript level.
Spermatids are derived from secondary spermatocytes and these cells
are subdivided into early, round spermatids that are transcriptionally
active and late, elongated spermatids that are transcriptionally inert
(Kierszenbaum and Tres, 1975). The formation of the acrosome, a testisspecific organelle required for sperm– egg interactions, is formed during
the meiotic development of the late spermatid into mature spermatozoa.
The highly testis-enriched proteins with known function and predominant expression in spermatids are involved in conversion of nucleosomal
chromatin and sperm development and maturation. Five examples of
proteins showing expression in spermatids, including two proteins with
expression restricted to the acrosome, are shown in Fig. 4C. The transition protein 1 (TNP1) gene encodes a spermatid-specific product of
the haploid genome which replaces histone and is itself replaced in the
mature sperm by the protamines. TNP1 protein is expressed in late
stage, elongating spermatids and is involved in the conversion of nucleosomal chromatin to the compact, non-nucleosomal form found in the
sperm nucleus (Wouters-Tyrou et al., 1998). The proteins encoded by
sperm mitochondria-associated cysteine-rich protein (SMCP) and actinlike protein 7B (ACTL7B) show a similar expression pattern in spermatids.
SMCP encodes a sperm mitochondria-associated cysteine-rich protein
that localizes to the capsule associated with the mitochondrial outer
membranes and is thought to function in the organization and
484
Djureinovic et al.
Figure 4 Examples of immunohistochemistry-based protein profiling of highly testis-enriched genes, showing expression patterns in cell types representing different stages of spermatogenesis. Tissue sections showing parts of normal seminiferous ducts with proteins expressed in spermatogonia (A), spermatocytes (B), spermatids (C) and sperm (D). Note differences in expression pattern between DMRT1, PASD1, PAGE1 and DMRTB1 (nuclei) and
C3ORF22 (cytoplasm) in spermatogonia. In spermatids TNP1, SMCP and ACTL7B are expressed in the nuclei, whereas ACRV1 and TMCO5A show a
distinct acrosomal pattern of expression.
Figure 5 IHC analysis of highly enriched genes in testis that are expressed in Sertoli cells and Leydig cells. Membranous positivity of Sertoli cells shown by
antibody towards SLCO6A1, whereas FATE1 and DEFB119 show cytoplasmic staining pattern in both spermatogenic cells and Sertoli cells. INSL3 shows
cytoplasmic expression restricted to the Leydig cells.
485
The human testis-specific proteome
stabilization of the helical structure of the sperm’s mitochondrial sheath
(Hawthorne et al., 2006). The ACTL7B gene, with evidence of existence
at the transcript level, encodes a putative protein of unknown function
that is suggested to be a member of a family of actin-related proteins
which share significant amino acid sequence identity to conventional
actins (Chadwick et al., 1999). The two examples of proteins showing
a distinct acrosomal expression pattern, include the well-characterized
acrosomal vesicle protein 1 (ACRV1) and the uncharacterized transmembrane and coiled-coil domains 5A (TMCO5A) gene which only has
evidence of existence at the transcript level. The ACRV1 gene encodes
an acrosomal vesicle protein, which arises within the acrosomal vesicle
during spermatogenesis, and is associated with the acrosomal membranes and matrix of mature sperm (Herr et al., 1990). ACRV1 may be
involved in binding or penetration of the oocyte zona pellucidum by
the mature sperm.
Genes expressed in sperm
Following the maturation of late stage spermatids, the testicular end stage
of spermatogenesis results in the formation of sperm. Sperm consists of
uniflagellar sperm cells that are localized along the lumen of seminiferous
ducts. Specific proteins localized to mature sperm are involved in functions including sperm motility and male fertility. Examples of proteins
highly enriched in testis that are expressed in sperm are shown in
Fig. 4D. The protamine 2 (PRM2) gene shows the highest expression
level (2447 FPKM) of all the 364 highly testis-enriched genes (Wykes
et al., 1995). Protamines substitute for histones in sperm and represent
the major DNA-binding proteins in sperm nuclei. The function of protamines is to package sperm DNA into a highly condensed, stable and inactive complex. The glyceraldehyde-3-phosphate dehydrogenase,
spermatogenic (GAPDHS) gene, with specific expression in the testis,
has a similar function as the ubiquitously expressed GAPDH in regulating
glycolysis. The encoded enzyme catalyzes an important energy-yielding
step in carbohydrate metabolism and during spermatogenesis, this
enzyme may play an important role in regulating the switch between different energy-producing pathways, required for sperm motility (Welch
et al., 2000) The A kinase (PRKA) anchor protein 4 (AKAP4) gene
encodes for a protein, suggested to be a major structural component
of sperm fibrous sheath, which also plays a role in sperm motility
(Turner et al., 2001). The additional two examples include the chromosome 2 open reading frame 57 (C2orf 57) and chromosome 10 open
reading frame 40 (C10orf40), both genes of unknown function and only
evidence of existence at the transcript level.
Genes expressed in Sertoli and Leydig cells
The interaction between germinal cells and Sertoli cells is essential both
for the development of testis and for the progress of the spermatogenesis (Griswold, 1998). Of the highly testis-enriched genes in testis, only
a few were specifically localized to Sertoli or Leydig cells when compared
with the germ cell-specific proteins (Fig. 5). The solute carrier organic
anion transporter family, member 6A1 (SLCO6A1) with currently only
evidence at transcript level has previously been reported to be strongly
expressed in the human testis but not specifically in Sertoli cells. For the
remaining examples of genes we found to be associated with Sertoli cells,
the expression did not seem to be restricted to this particular cell type.
For example, fetal and adult testis expressed (FATE1) protein is shown to
be expressed in spermatogonia, primary spermatocytes and Sertoli cells
which also has been reported previously (Yang et al., 2005). Another
example was defensin, beta 119 (DEFB119), a member of the betadefensin family assumed to be involved in antibacterial activity. The
secreted DEFB119 protein, previously reported to be expressed in the
testis and epididymis, showed an expression pattern suggesting that
this protein also is expressed in Sertoli cells (Radhakrishnan et al.,
2005). Only one of the highly enriched genes in testis, the insulin-like 3
(Leydig cell) (INSL3) gene, showed specific expression in the androgenproducing Leydig cells. The INSL3 gene encodes a well-known marker of
Leydig cells, suggested to be involved in the development of urogenital
tract and intra-abdominal testicular descent (Burkhardt et al., 1994).
Discussion
Here we demonstrate the power of combining genome-wide RNA and
protein analysis to define the tissue-specific proteome of human testis.
The analysis shows that 77% of all putative protein coding genes
are expressed in the testis, which is more than in any of the other
26 tissues analyzed. In addition, the testis also has by far more
tissue-enriched genes than any other human tissue with .1000 genes
showing an at least 5-fold higher expression. One explanation to the
large abundance of enriched genes in testis, when compared with
other tissue types, could be that testis harbors the only male cell type
that undergoes meiosis to generate haploid cells. Meiosis involves
several specific events such as reduction of the number of chromosomes,
condensation of nucleus and removal of excess cytoplasm within the
sperm. In addition, the sperm cells can be described as an independent
unicellular organism that has acquired unique morphological features and
structures that are necessary for sperm motility and fertilization of the
egg. It is not unlikely that many of the genes here identified as testis specific could show shared expression with the cells developing into the
haploid egg in females. An important extension of this could therefore
be to include additional specialized tissue types from females including
an enrichment of oocytes. More work is needed to explore the expression pattern of the development of the egg in females to establish the differences between the development of the haploid cell in males and
females.
It is noteworthy that the term “testis specific” is used for the genes with
enriched or enhanced expression in testis, but in reality a vast majority of
these genes have transcripts present also in other tissues. A tissuespecific score has therefore been obtained for all genes to define the
ratio of mRNA levels in testis when compared with the maximum
mRNA levels in all other tissues. In this analysis we have focused on
the highly enriched genes with a ratio over 50 and found that most of
these were expressed in germ cells of the seminiferous epithelium,
whereas the two somatic cell types in testis, Leydig and Sertoli cells,
only expressed a few highly enriched genes. This supports the assumption that the cell types specifically involved in spermatogenesis account
for the large amount of specific genes. It is therefore likely that the expression profiles of Sertoli and Leydig cells more resemble other somatic cell
types and hence do not contain as many specific genes as germ cells. For
example, suggested Sertoli cell markers such as the anti-Mullerian
hormone (AMH) (Josso et al., 1977) was also expressed in the brain,
the follicle-stimulating hormone receptor (Heckert and Griswold,
2002) was expressed in the ovary and the inhibin alpha (Merchenthaler
et al., 1987) was expressed in the adrenal gland, ovary and placenta
and accordingly these genes were categorized as group enriched or
486
enhanced in testis. It should also be pointed out that genes with low levels
of expression may still exist with a specific expression in Sertoli or Leydig
cells, but given that 90% of the mRNA from testicular tissue is derived
from germ cells, such genes may not readily be detected. A more
in-depth analysis of the testis genes detected in the categories moderately enriched, group enriched and enhanced would be of interest to generate a more complete view of expression patterns in testicular tissue.
Altogether the tissue-specific genes, including testis-enriched, groupenriched and testis-enhanced genes, constitute 10% of all the human
genes and 14% of the transcript pool. An analysis of the most expressed
genes in testis also support that few of the testis-specific genes are
expressed at high mRNA levels. Despite the high number of tissuespecific genes in testis, an in-depth analysis of the mRNA pool derived
from testicular tissue demonstrates that 80% of the transcripts are
encoded by genes expressed in all 27 tissues (‘house-keeping’) (Fagerberg et al., 2014), well consistent with that a large fraction of all expressed
proteins is shared between different tissues (Pontén et al., 2009). The list
of all these genes with elevated expression in testis can be downloaded
from the Human Protein Atlas portal (www.proteinatlas.org) and are
also provided as Supplementary data, Table SIV.
An interesting category of proteins is the genes with shared elevated
expression with at least another tissue (group enriched). The network
analysis demonstrates that the tissues with most shared tissue-enriched
genes are the brain (n ¼ 48) and the lung (n ¼ 33). A functional analysis
shows that several of the genes shared with the lung are involved in functions involving motility of cilia and sperm, while the genes shared with the
brain show less enrichment for any specific functions. A subset of genes
expressed in the testis and brain are RNA-binding proteins evidently
involved in the regulation of transcription and another subset are
expressed during development, such as the AMH gene that encodes
a growth factor involved in male sexual differentiation by causing the
regression of Mullerian ducts.
It has earlier been suggested that tissue-specific genes are overrepresented for genes coding for secreted and transmembrane spanning proteins, summarized as the inside –outside rule (Ramsköld et al., 2009;
Fagerberg et al., 2010). In contrast, the analysis of the testis-specific
proteins shows that a large majority (76%) of the tissue-specific genes
with known subcellular localization are intracellular. This suggests that
the inside –outside rule for tissue specificity does not apply for the
testis and that most of the testis-specific functions are expressed inside
the cell which is not surprising since meiosis, histone replacement and
sperm differentiation mainly reflect intracellular events.
Combining the data from previously well-characterized testis-specific
proteins with present data regarding subcompartment localization in
testis suggests that approximately half of the testis-specific genes are
expressed in late stage spermatogenesis, sperm (30%) and spermatids
(21%), whereas only a few are expressed in earlier stages, spermatogonia
(3%) and spermatocytes (4%). Interestingly, a majority of the genes
analyzed here with previous unknown knowledge regarding cell-type
localization were expressed in early stage spermatogenesis. A substantial
subset of these proteins belong to the family of cancer testis antigens. At
present .100 cancer testis antigens genes have been identified and a
similar study including data from cancer tissues could be of interest to
identify additional cancer testis antigens. These genes are important
since they are restricted to germ cells in normal tissues, but frequently
expressed in various cancer tissues. In addition, the corresponding
proteins appear immunogenic and recognized by the immune system
Djureinovic et al.
of cancer patients and thus believed to be of use in the development
of antigen-specific cancer vaccines (van der Bruggen et al., 1991; Chen
et al., 1997).
In conclusion, the analysis has identified genes of interest to study
testis biology. These genes are interesting starting points for studying
mechanisms for infertility and basic cell biology. In this work we have
focused our protein analysis on genes with corresponding unambiguously staining antibodies and further analysis is needed for the identified
testis-specific genes not yet characterized on the protein level. The
ultimate aim of such an effort should be to provide a complete map of
expression patterns of all genes expressed in a tissue-specific manner.
Supplementary data
Supplementary data are available at http://molehr.oxfordjournals.org/.
Acknowledgements
Pathologists and staff at the Department of Clinical Pathology, Uppsala
University Hospital are greatly acknowledged for providing the tissues
used in the study, in particular, the authors would like to thank Simin
Tahmasebpoor for excellent help with preparing frozen tissues for
RNA extraction. We thank Christer Höög for valuable comments on
the manuscript. The authors also wish to thank the staff of the Human
Protein Atlas project in both Sweden and India for their efforts generating
the Human Protein Atlas.
Authors’ roles
F.P. and M.U. conceived and designed the study. F.P., M.U. and D.D.
wrote the paper. L.F. and B.H. performed the bioinformatics analysis.
D.D. and C.L. performed the antibody-based protein profiling and
data comparison. A.D. prepared the RNA samples.
Funding
This work was supported by the Knut and Alice Wallenberg Foundation.
Conflict of interest
None declared.
References
Anders S, Huber W. Differential expression analysis for sequence count data.
Genome Biol 2010;11:R106.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP,
Dolinski K, Dwight SS, Eppig JT et al. Gene ontology: tool for the unification
of biology. The Gene Ontology Consortium. Nat Genet 2000;25:25 – 29.
Brinkmann U, Vasmatzis G, Lee B, Yerushalmi N, Essand M, Pastan I. PAGE-1,
an X chromosome-linked GAGE-like gene that is expressed in normal and
neoplastic prostate, testis, and uterus. Proc Natl Acad Sci USA 1998;
95:10757 – 10762.
Burkhardt E, Adham IM, Brosig B, Gastmann A, Mattei MG, Engel W.
Structural organization of the porcine and human genes coding for a
Leydig cell-specific insulin-like peptide (LEY I-L) and chromosomal
localization of the human gene (INSL3). Genomics 1994;20:13 – 19.
The human testis-specific proteome
Chadwick BP, Mull J, Helbling LA, Gill S, Leyne M, Robbins CM, Pinkett HW,
Makalowska I, Maayan C, Blumenfeld A et al. Cloning, mapping, and
expression of two novel actin genes, actin-like-7A (ACTL7A) and
actin-like-7B (ACTL7B), from the familial dysautonomia candidate
region on 9q31. Genomics 1999;58:302– 309.
Chen YT, Scanlan MJ, Sahin U, Türeci O, Gure AO, Tsang S, Williamson B,
Stockert E, Pfreundschuh M, Old LJ. A testicular antigen aberrantly
expressed in human cancers detected by autologous antibody screening.
Proc Natl Acad Sci USA 1997;94:1914– 1918.
De Rooij DG, Russel LD. All you wanted to know about spermatogonia but
were afraid to ask. J Androl 2000;21:776 – 798.
Eden E, Navon R, Steinfeld I, Lipson D, Yakhini Z. GOrilla: a tool for
discovery and visualization of enriched GO terms in ranked gene lists.
BMC Bioinformatics 2009;10:48.
Fagerberg L, Jonasson K, von Heijne G, Uhlén M, Berglund L. Prediction of the
human membrane proteome. Proteomics 2010;10:1141 –1149.
Fagerberg L, Hallström BM, Oksvold P, Kampf C, Djureinovic D, Odeberg J,
Habuka M, Tahmasebpoor S, Danielsson A, Edlund K et al. Analysis of
the human tissue-specific expression by genome-wide integration of
transcriptomics and antibody-based proteomics. Mol Cell Proteomics
2014;13:397 – 406.
Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P,
Coates G, Fairley S, Fitzgerald S et al. Ensembl 2012. Nucleic Acids Res 2012;
40(Database issue): D84 –D90.
Griswold MD. The central role of Sertoli cells in spermatogenesis. Semin
Cell Dev Biol 1998;9:411 – 416.
Hawthorne SK, Goodarzi G, Bagarova J, Gallant KE, Busanelli RR, Olend WJ,
Kleene KC. Comparative genomics of the sperm mitochondria-associated
cysteine-rich protein gene. Genomics 2006;87:382 – 391.
Hebenstreit D, Fang M, Gu M, Charoensawan V, van Oudenaarden A,
Teichmann SA. RNA sequencing reveals two major classes of gene
expression levels in metazoan cells. Mol Syst Biol 2011;7:497.
Heckert LL, Griswold MD. The expression of the follicle-stimulating
hormone receptor in spermatogenesis. Recent Prog Horm Res 2002;
57:129– 148.
Herr JC, Flickinger CJ, Homyk M, Klotz K, John E. Biochemical and
morphological characterization of the intra-acrosomal antigen SP-10
from human sperm. Biol Reprod 1990;42:181 – 193.
Hoog C. Expression of a large number of novel testis-specific genes during
spermatogenesis coincides with the functional reorganization of the
male germ cell. Int J Dev Biol 1995;39:719 – 726.
Jørgensen A, Nielsen JE, Jensen MB, Græm N, Rajpert-De Meyts E.
Analysis of meiosis regulators in human gonads: a sexually dimorphic
spatio-temporal expression pattern suggests involvement of DMRT1 in
meiotic entry. Mol Hum Reprod 2012;18:523 –534.
Josso N, Picard JY, Tran D. The anti-Mullerian hormone. Birth Defects Orig
Artic Ser 1977;13:59 – 84.
Kampf C, Olsson I, Ryberg U, Sjöstedt E, Pontén F. Production of tissue
microarrays, immunohistochemistry staining and digitalization within the
human protein atlas. J Vis Exp 2012. doi: 10.3791/3620.
Kerr SM, Vambrie S, McKay SJ, Cooke HJ. Analysis of cDNA sequences from
mouse testis. Mamm Genome 1994;5:557 – 565.
Kierszenbaum AL, Tres LL. Structural and transcriptional features of the
mouse spermatid genome. J Cell Biol 1975;65:258– 270.
Liggins AP, Brown PJ, Asker K, Pulford K, Banham AH. A novel diffuse large
B-cell lymphoma-associated cancer testis antigen encoding a PAS domain
protein. Br J Cancer 2004;91:141 – 149.
Matson CK, Murphy MW, Griswold MD, Yoshida S, Bardwell VJ,
Zarkower D. The mammalian doublesex homolog DMRT1 is a
transcriptional gatekeeper that controls the mitosis versus meiosis
decision in male germ cells. Dev Cell 2010;19:612 – 624.
487
Merchenthaler I, Culler MD, Petrusz P, Negro-Vilar A. Immunocytochemical
localization of inhibin in rat and human reproductive tissues. Mol Cell
Endocrinol 1987;54:239– 243.
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and
quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 2008;
5:621 – 628.
Ottolenghi C, Fellous M, Barbieri M, McElreavey K. Novel paralogy relations
among human chromosomes support a link between the phylogeny of
doublesex-related genes and the evolution of sex determination.
Genomics 2002;79:333 – 343.
Pawlak A, Toussaint C, Lévy I, Bulle F, Poyard M, Barouki R, Guellaën G.
Characterization of a large population of mRNAs from human testis.
Genomics 1995;26:151 – 158.
Penttilä TL, Yuan L, Mali P, Höög C, Parvinen M. Haploid gene expression:
temporal onset and storage patterns of 13 novel transcripts during rat
and mouse spermiogenesis. Biol Reprod 1995;53:499 – 510.
Pontén F, Jirström K, Uhlen M. The Human Protein Atlas—a tool for
pathology. J Pathol 2008;216:387– 393.
Pontén F, Gry M, Fagerberg L, Lundberg E, Asplund A, Berglund L, Oksvold P,
Björling E, Hober S, Kampf C. A global view of protein expression in human
cells, tissues, and organs. Mol Syst Biol 2009;5:337.
Pontén F, Schwenk JM, Asplund A, Edqvist PH. The Human Protein Atlas as a
proteomic resource for biomarker discovery. J Intern Med 2011;
270:428– 446.
Radhakrishnan Y, Hamil KG, Yenugu S, Young SL, French FS, Hall SH.
Identification, characterization, and evolution of a primate beta-defensin
gene cluster. Genes Immun 2005;6:203 – 210.
Ramsköld D, Wang ET, Burge CB, Sandberg R. An abundance of ubiquitously
expressed genes revealed by tissue transcriptome sequence data. PLoS
Comput Biol 2009;5:e1000598.
RCoreTeam. A Language and Environment for Statistical Computing. Vienna,
Austria: R Foundation for Statistical Computing, 2013.
Reijo RA, Dorfman DM, Slee R, Renshaw AA, Loughlin KR, Cooke H,
Page DC. DAZ family proteins exist throughout male germ cell
development and transit from nucleus to cytoplasm at meiosis in
humans and mice. Biol Reprod 2000;63:1490 – 1496.
Russell LD, Ettlin RA, Sinha Hikim AP, Clegg ED. Histological and
Histopathological Evaluation of the Testis. Clearwater, FL, USA: Cache
River Press, 1990.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N,
Schwikowski B, Ideker T. Cytoscape: a software environment for
integrated models of biomolecular interaction networks. Genome Res
2003;13:2498 – 2504.
Tapparel C, Reymond A, Girardet C, Guillou L, Lyle R, Lamon C, Hutter P,
Antonarakis SE. The TPTE gene family: cellular expression, subcellular
localization and alternative splicing. Gene 2003;323:189 – 199.
Teng X, Yang J, Xie Y, Ni Z, Hu R, Shi L, Lin Z, Hu L, Zhao G, Ding X et al. A
novel spermatogenesis-specific uPAR gene expressed in human and
mouse testis. Biochem Biophys Res Commun. 2006;342:1223 – 1227.
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ,
Salzberg SL, Wold BJ, Pachter L. Transcript assembly and quantification
by RNA-Seq reveals unannotated transcripts and isoform switching
during cell differentiation. Nat Biotechnol 2010;28:511 – 515.
Turner RM, Musse MP, Mandal A, Klotz K, Jayes FC, Herr JC, Gerton GL,
Moss SB, Chemes HE. Molecular genetic analysis of two human sperm
fibrous sheath proteins, AKAP4 and AKAP3, in men with dysplasia of
the fibrous sheath. J Androl 2001;22:302 – 315.
Uhlén M, Björling E, Agaton C, Szigyarto CA, Amini B, Andersen E,
Andersson AC, Angelidou P, Asplund A, Asplund C et al. A human
protein atlas for normal and cancer tissues based on antibody
proteomics. Mol Cell Proteomics 2005;4:1920– 1932.
488
Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M,
Zwahlen M, Kampf C, Wester K, Hober S et al. Towards a knowledgebased Human Protein Atlas. Nat Biotechnol 2010;28:1248 – 1250.
van der Bruggen P, Traversari C, Chomez P, Lurquin C, De Plaen E,
Van den Eynde BJ, Knuth A, Boon T. A gene encoding an antigen
recognized by cytolytic T lymphocytes on a human melanoma. Science
1991;254:1643 – 1647.
Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for
transcriptomics. Nat Rev Genet 2009;10:57 – 63.
Welch JE, Brown PL, O’Brien DA, Magyar PL, Bunch DO, Mori C,
Eddy EM. Human glyceraldehyde 3-phosphate dehydrogenase-2 gene
Djureinovic et al.
is expressed specifically in spermatogenic cells. J Androl 2000;
21:328 – 338.
Wouters-Tyrou D, Martinage A, Chevaillier P, Sautière P. Nuclear basic
proteins in spermiogenesis. Biochimie 1998;80:117 – 128.
Wykes SM, Nelson JE, Visscher DW, Djakiew D, Krawetz SA. Coordinate
expression of the PRM1, PRM2, and TNP2 multigene locus in human
testis. DNA Cell Biol 1995;14:155 – 161.
Yang XA, Dong XY, Qiao H, Wang YD, Peng JR, Li Y, Pang XW, Tian C,
Chen WF. Immunohistochemical analysis of the expression of FATE/
BJ-HCC-2 antigen in normal and malignant tissues. Lab Invest 2005;
85:205 – 213.