Download Complex Germline Architecture: Two Genes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Public health genomics wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Mutation wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Epigenomics wikipedia , lookup

Gene therapy wikipedia , lookup

Molecular cloning wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Oncogenomics wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Copy-number variation wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Primary transcript wikipedia , lookup

Gene desert wikipedia , lookup

Pathogenomics wikipedia , lookup

Genetic engineering wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Gene expression programming wikipedia , lookup

Metagenomics wikipedia , lookup

Transposable element wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Ridge (biology) wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Point mutation wikipedia , lookup

Genomic imprinting wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Human genome wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

RNA-Seq wikipedia , lookup

Genomic library wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Microsatellite wikipedia , lookup

Genomics wikipedia , lookup

Genome (book) wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene expression profiling wikipedia , lookup

Minimal genome wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Gene wikipedia , lookup

History of genetic engineering wikipedia , lookup

Genome editing wikipedia , lookup

Genome evolution wikipedia , lookup

Microevolution wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Designer baby wikipedia , lookup

Transcript
Complex Germline Architecture: Two Genes Intertwined on Two Loci
Shiuhyang Kuo, Wei-Jen Chang, and Laura F. Landweber
Department of Ecology and Evolutionary Biology, Princeton University
The germline micronuclear genome of some ciliated protists can be scrambled, with coding segments disordered relative to
the expressed macronuclear genome. Here, we report a surprisingly complex pair of genes that assemble from interwoven
segments on two germline loci in the ciliate Uroleptus. This baroque organization requires two scrambled genes to be
disentangled from each other from two clusters in the genome, one containing segments 1-2-4-5-6-8-11-13-15-16 and the
other 7-9-3-10-12-14, with pieces 1–5 comprising the first gene and 6–16 the second gene. Both genes remain linked in the
somatic genome on a 1.5-kb ‘‘nanochromosome.’’ This study is the first to reveal that two genes can become scrambled
during evolution with their coding segments intertwined. These twin scrambled genes underscore the beauty and exceptions of protist genome architecture, pointing to the critical need for evolutionary biologists to survey protist genomes
broadly.
Introduction
Ciliates possess two types of nuclei, a germline micronucleus and a somatic macronucleus that develop from
a copy of the germline after cell mating. In spirotrichous
ciliates, massive deletion and rearrangement of the 1- to
2.5-Gb germline genome constructs an ;50-Mb set of
approximately 2-kb tiny chromosomes (Prescott 1994),
sometimes called nanochromosomes (Doak et al. 2003) because of their size and because they typically contain just
one gene each. These together comprise the gene-dense somatic genome. The process of deletion of up to 98% of the
germline DNA removes internal eliminated segments (IES)
that interrupt genes, as well as transposons and intergenic
DNA. The remaining macronuclear-destined segments
(MDS), containing mostly coding DNA with limited regulatory sequences and introns, join together to form the nanochromosomes. In some cases, the germline order of coding
segments is permuted, requiring their decryption (unscrambling) to create functional genes in the macronucleus
(Prescott 2000). Scrambled segments can be present on either strand within a locus or even dispersed over unlinked
loci in the germline (Ardell, Lozupone, and Landweber
2003). While three scrambled genes have been extensively
studied to date (a-telomere–binding protein [Mitcham,
Lynn, and Prescott 1992]; DNA polymerase a [Hoffman
and Prescott 1996; Landweber, Kuo, and Curtis 2000];
and actin I [Greslin et al. 1989; Hogan et al. 2001;
Dalby and Prescott 2004]), no new scrambled genes have
been reported for almost a decade.
To search for new scrambled genes, we constructed
a small macronuclear library from the spirotrichous ciliate
Uroleptus, known to have scrambled genes (e.g., Dalby and
Prescott 2004; Chang et al. 2005), and randomly selected 11
clones whose sequences were used to search for their counterparts in the germline micronuclear DNA by polymerase
chain reaction (PCR). One of the clones contained a 1,554bp macronuclear chromosome, including 36-bp telomeric
repeats (C4A4) at each end, with two predicted open reading
frames that we confirmed by 5#- and 3#-rapid amplification
of cDNA ends (RACE) (fig. 1D). Because few twoKey words: micronucleus, scrambled gene, Uroleptus, hypotrich,
spirotrich, ciliate.
E-mail: [email protected].
Mol. Biol. Evol. 23(1):4–6. 2006
doi:10.1093/molbev/msj017
Advance Access publication September 14, 2005
Ó The Author 2005. Published by Oxford University Press on behalf of
the Society for Molecular Biology and Evolution. All rights reserved.
For permissions, please e-mail: [email protected]
gene chromosomes have been described (e.g., Seegmiller,
Williams, and Herrick 1997), we examined this molecule
further. The upstream mRNA putatively encodes a 77-aa
peptide of unknown function (positions 146–379), which
shares no significant database matches at the protein or
nucleotide level. The downstream mRNA encodes a protein
of 198 aa with high similarity to eukaryotic 60S ribosomal
protein L13 (58% identical/75% similar to ribosomal
protein L13a in Homo sapiens).
We determined the germline organization of these
genes by a variation of walking PCR (Myrick and Gelbart
2002) that is successful on ciliate micronuclear DNA. The
genetic map splits this gene into two regions (fig. 1C and
table 1). The overall germline architecture is quite surprising: while both genes are located in tandem on a single
1.5-kb macronuclear nanochromosome, their 16 segments
are scrambled and comingled on two clusters. One locus
contains 10 segments in the order 1-2-4-5-6-8-11-13-1516 and the other contains 6 segments in the order 7-9-310-12-14, with pieces 1–5 comprising the first gene and
6–16 the second gene. Pointer sequence repeats of 3–
14 bp are always present at the end of segment n and
the beginning of segment n 1 1, with one copy of the repeat retained in the macronuclear sequence (table 1) and
longer pointers between scrambled segments that have to
be reordered to make a functional copy. These pointer sequence repeats are thought to participate in homologous
DNA recombination during macronuclear development
(Prescott 2000).
Curiously, we also found a duplicated region of 1859
bp (91% identity) between these two loci (highlighted in
fig. 1C) that probably indicates where these two loci broke
apart during evolution (fig. 1A and B) because this region
contains paralogous copies of four segments for the first
gene. Three of these segments contain frameshifts and
are too divergent to contribute to a functional gene (table
1). So far, these two genes have not been found to be linked
in the macronucleus in three other spirotrichous ciliates
that contain scrambled genes (A. J. Li, W.-J. Chang, and
L. Landweber, unpublished data), suggesting that this unusual architecture may be specific to the Uroleptus lineage.
These twin scrambled genes underscore the complexities of ciliate genome architecture, with segments for
more than one gene woven together during germline evolution. With the genome project of a spirotrichous ciliate,
Two Scrambled Genes Intertwined on Two Loci 5
FIG. 1.—Schematic maps of the somatic and corresponding germline genes. Segments (macronuclear-destined segments [MDS]) 1–16 are marked in
each case and numbered according to (D). (A) and (B) present an evolutionary model for the origin of the architecture in (C), based on these data, with (A) the
inferred ancestral state, after the region became scrambled but before translocation of segment 3. (B) The inferred ancestral state containing both genes on the
same locus. (C) Germline map. Segments derived from the longer germline locus are white boxes, and the shorter loci are dark gray boxes. Internal eliminated segments and flanking DNA are not drawn to scale but as solid black lines with sizes (bp) in parentheses. The flanking DNA from the end of MDS 14
to another gene downstream (white arrow in opposite orientation) is subdivided into three sections with lengths indicated in brackets. Gray highlight indicates
that the middle section (1859 bp) overlaps both loci, containing paralogous copies of segments 1, 2, 4, and 5. The geometric representation reflects the
presence of pointer sequence repeats at the end of each segment that recombine with their copy at the beginning of the next segment (table 1). (D) Somatic
map. Hatched ends represent telomeres. Arrowed lines illustrate mRNA transcripts, with dots representing mRNA cap sites (positions 69 and 721) and
arrows representing poly-adenylation sites (437, 577, and 1420). Small vertical arrows mark inferred start and stop codons for both genes.
Oxytricha trifallax (Sterkiella histriomuscorum), currently
underway (Doak et al. 2003), we anticipate that more
surprises are yet to come.
Methods
Uroleptus sp. (similar to Uroleptus gallina) was isolated from soil in a meadow close to Plainsboro township
in Princeton, N.J., and grown as previously described
(Chang et al. 2004). Monoclonal culture of Uroleptus sp.
was achieved by isolating one single cell and growing
it into a population. Macronuclei and micronuclei were
isolated from vegetative Uroleptus cells as previously described (Chang et al. 2004). DNA was purified by phenol/
chloroform extractions and treated with RNase A to remove
RNA. Micronuclear DNA was further purified from residual macronuclear DNA by low melting point agarose gel
electrophoresis. Uroleptus macronuclear DNA was treated
with exonuclease I (New England BioLabs, Ipswich,
Mass.) to trim telomeric overhangs and with shrimp alkaline phosphatase (Promega, Madison, Wis.) to remove 5#phosphates. Blunt-ended DNA was then cloned into a
pCR4Blunt-TOPO vector (Invitrogen, Carlsbad, Calif.),
and 11 clones were randomly selected for sequencing.
The sequence in figure 1D (GenBank AY875978) was verified by several independent PCR clones, as in Chang et al.
(2004). Southern hybridization confirmed that there were
no alternatively processed versions of this macronuclear
6
Kuo et al.
Table 1
Features of DNA Segments for Both Scrambled Genes
5# Pointer Sequencea
5# Telomere addition site
GTT
AATTCTTATCATaT
CAGGATAATtAAA
ATAAATTTT
ATA
GTATGTTCgAAAAG
CACTTGCTC
AGAAACAAGCT
AAGTTCCACGAATT
CAACCCAA
TCAGTT
GTATTTTCT
TCAGAG
TTGGCtGCATGAA
TCTCGTcGAA
Segment (MDS)
number n
3# Pointer Sequencea
MDS Length
(bp)b
Percent Similarity
to MDS n#c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
GTT
AATTCTTATCATtT
CAGGATAATaAAA
ATAAATTTT
ATA
GTATGTTCaAAAAG
CACTTGCTC
AGAAACAAGCT
AAGTTCCACGAATT
CAACCCAA
TCAGTT
GTATTTTTCT
TCAGAG
TTGGCcGCATGAA
TCTCGTtGAA
3# Telomere addition site
3
321
65
64
46
167
24
96
1
21
7
20
9
39
144
313
100
91.6
92.3
80.8
NOTE.—MDS: macronuclear-destined segments.
a
Lowercase nucleotides indicate pointer mismatches, with the retained somatic sequence underlined.
b
MDS length excludes pointer sequence repeats.
c
See figure 1C.
chromosome (data not shown). Micronuclear organizations
of these two genes were determined by using traditional
PCR against 20 ng micronuclear DNA. Sequences flanking
these micronuclear PCR fragments were obtained by Universal Fast Walking (UFW) PCR (Myrick and Gelbart
2002). Sequences of the major (accession number
AY875979) and minor (accession number AY875980) micronuclear loci including flanking sequences are deposited
in GenBank. RNA isolation and 5#- and 3#-RACE were as
described (Chang et al. 2004). All primer sequences are
available in Supplementary Material online.
Supplementary Material
Primer sequences and additional detail of methods are
available at Molecular Biology and Evolution online (http://
www.mbe.oxfordjournals.org/).
Acknowledgments
We thank Jingmei Wang for ciliate culture, Mark
Daley and Ian McQuillan for suggesting the representation
in fig. 1C, and Glenn Herrick for suggesting and Kyl Myrick
for discussing UFW PCR. This study was supported by
National Institute of General Medical Sciences grant
GM59708 and National Science Foundation grant
0121422. S.K. thanks H. Nick for comments.
Literature Cited
Ardell, D. H., C. A. Lozupone, and L. F. Landweber. 2003.
Polymorphism, recombination and alternative unscrambling
in the DNA polymerase alpha gene of the ciliate Stylonychia
lemnae (Alveolata; class Spirotrichea). Genetics 165:
1761–1777.
Chang, W. J., P. D. Bryson, H. Liang, M. K. Shin, and L. F.
Landweber. 2005. The evolutionary origin of a complex
scrambled gene. Proc. Natl. Acad. Sci. USA (in press).
Chang, W. J., N. A. Stover, V. M. Addis, and L. F. Landweber.
2004. A micronuclear locus containing three protein-coding
genes remains linked during macronuclear development in
the spirotrichous ciliate Holosticha. Protist 155:245–255.
Dalby, A. B., and D. M. Prescott. 2004. The scrambled actin I gene
in Uroleptus pisces. Chromosoma 112:247–254.
Doak, T. G., A. R. O. Cavalcanti, N. A. Stover, D. M. Dunn, R.
Weiss, G. Herrick, and L. F. Landweber. 2003. Sequencing the
Oxytricha trifallax macronuclear genome: a pilot project.
Trends Genet. 19:603–607.
Greslin, A. F., D. M. Prescott, Y. Oka, S. H. Loukin, and J. C.
Chappell. 1989. Reordering of nine exons is necessary to form
a functional actin gene in Oxytricha nova. Proc. Natl. Acad.
Sci. USA 86:6264–6268.
Hoffman, D. C., and D. M. Prescott. 1996. The germline gene
encoding DNA polymerase alpha in the hypotrichous ciliate
Oxytricha nova is extremely scrambled. Nucleic Acids Res.
24:3337–3340.
Hogan, D. J., E. A. Hewitt, K. E. Orr, D. M. Prescott, and K. M.
Muller. 2001. Evolution of IESs and scrambling in the actin I
gene in hypotrichous ciliates. Proc. Natl. Acad. Sci. USA
98:15101–15106.
Landweber, L. F., T. C. Kuo, and E. A. Curtis. 2000. Evolution
and assembly of an extremely scrambled gene. Proc. Natl.
Acad. Sci. USA 97:3298–3303.
Mitcham, J. L., A. J. Lynn, and D. M. Prescott. 1992. Analysis of
a scrambled gene: the gene encoding alpha-telomere-binding
protein in Oxytricha nova. Genes Dev. 6:788–800.
Myrick, K. V., and W. M. Gelbart. 2002. Universal fast walking
for direct and versatile determination of flanking sequence.
Gene 284:125–131.
Prescott, D. M. 1994. The DNA of ciliated protozoa. Microbiol.
Rev. 58:233–267.
———. 2000. Genome gymnastics: unique modes of DNA evolution and processing in ciliates. Nat. Rev. Genet. 1:191–198.
Seegmiller, A., K. R. Williams, and G. Herrick. 1997. Two twogene macronuclear chromosomes of the hypotrichous ciliates
Oxytricha fallax and O. trifallax generated by alternative
processing of the 81 locus. Dev. Genet. 20:348–357.
Jennifer Wernegreen, Associate Editor
Accepted August 30, 2005