Download An archaebacterial homolog of pelota, a meiotic cell division protein

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Metalloprotein wikipedia , lookup

Gene nomenclature wikipedia , lookup

Paracrine signalling wikipedia , lookup

Promoter (genetics) wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Protein wikipedia , lookup

Gene regulatory network wikipedia , lookup

Magnesium transporter wikipedia , lookup

Evolution of metal ions in biological systems wikipedia , lookup

Western blot wikipedia , lookup

Interactome wikipedia , lookup

Protein structure prediction wikipedia , lookup

Expression vector wikipedia , lookup

Transcriptional regulation wikipedia , lookup

Homology modeling wikipedia , lookup

Endogenous retrovirus wikipedia , lookup

Ancestral sequence reconstruction wikipedia , lookup

Gene expression wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Silencer (genetics) wikipedia , lookup

Point mutation wikipedia , lookup

Proteolysis wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Transcript
MICROBIOLOGY
LETTERS
ELSEVIER
FEMS Microbiology Letters 144 (1996) 151-155
An archaebacterial homolog of pelota, a meiotic
cell division protein in eukaryotes
Mark A. Ragan ajbl*, John M. Logsdon Jr. c, Christoph W. Sensen alb,
Robert L. Charlebois a,d, W. Ford Doolittle asc
’ Program in Evolutionary Biology, Canadian Institute for Advanced Research, National Research Council of Canada, 1411 Oxford Street,
Halifax, Nova Scotia B3H 321. Canada
b Institute for Marine Biosciences, National Research Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia B3H 321, Canada
’ Department of Biochemistry, Sir Charles Tupper Medical Building, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada
’ Department of Biology, University of Ottawa, 30 Marie Curie, Ottawa, Ontario KlN 6N5, Canada
Received8 August 1996; revised 28 August 1996; accepted29 August 1996
Abstract
An open reading frame (pelA) specifying a homolog of pelota and DOM34, proteins required for meiotic cell division in
melanogaster and Saccharomyces
cerevisiae, respectively, has been cloned, sequenced and identified from the
archaebacterium Sulfolobus solfataricus. The S. solfataricus PelA protein is about 20% identical with pelota, DOM34 and the
hypothetical protein R74.6 of Caenorhabditis elegans. The presence of a pelota homolog in archaebacteria implies that the
meiotic functions of the eukaryotic protein were co-opted from, or added to, other functions existing before the emergence of
eukaryotes. The nuclear localization signal and negatively charged carboxy-terminus characteristic of eukaryotic pelota-like
proteins are absent from the S. solfataricus homolog, and hence may be indicative of the acquired eukaryotic function(s).
Drosophila
Keywords: (Sulfolobus solfataricus) ;
Archaea; Archaebacterium; Meiosis; Pelota; DOM34
1. Introduction
Archaebacteria
(Archaea) are phylogenetically
distinct from eubacteria (Bacteria) and eukaryotes (Eukarya) [l-4]. Phylogenetic analyses of macromolecular sequences suggest that within the archaebacteria
there are two main lineages, Euryarchaeota
(methanogens, halobacteria
and some thermophiles)
and
Crenarchaeota
(other thermophiles, including Su2folobus) [l]. Although most molecular-sequence
anal* Corresponding author. Tel. : +l (902) 426-1674;
Fax: +l (902)-426-9413;E-mail: [email protected]
0378-1097 /96/$12.00 Copyright
PIISO378-1097(96)00351-5
0 1996 Federation
of European
yses indicate that the Archaebacteria
are monophyletic [2,3], there is some evidence that they may
be paraphyletic [4], with the crenarchaeotes
as the
sister-group of eukaryotes. In either case, archaebacteria and eukaryotes are thought to share a common
ancestor more recent than that shared with eubacteria [2,4].
The specific common ancestry of archaebacteria
and eukaryotes implied by quantitative
analyses of
macromolecular
sequences [2,4] extends to gene content. ‘Eukaryotic’ gene products encoded by archaebacterial
(but not eubacterial)
genomes include
TFIIB- and TBP-like transcription factors [5,6], subMicrobiological
Societies. Published
by Elsevier Science B.V
152
M.A.
Ragan
rl al. I FEMS
Microbiology
units of RNA polymerase [7], translation initiation
factors [8], ribosomal proteins [9], and a VCP-like
two-domain ATPase that in eukaryotes is involved
in cell-cycle regulation [lo]. Thus, an appropriate
archaebacterial genome could be a better ‘prokaryotic model of the eukaryotic genome’ than could any
eubacterial genome.
Sulfolobus solfataricus has one of the larger genames known among archaea ([1 I] and unpublished), and some experimental genetics has been developed [12]. Because Sulfolobus species thrive in
sulfur hot springs at 70-87°C pH 24 [13], all proteins in these organisms must be resistant to denaturation at this temperature and, by extension, under
other extreme conditions. These organisms can also
oxidize elemental sulfur to sulfuric acid, metabolize
organosulfur
compounds,
oxidize mineral sulfides,
and carry out a variety of other biochemically interesting reactions, including several of potential industrial interest [13,14]. All of these factors make S.
solfataricus a promising candidate for complete genome characterization;
we have now sequenced
more 1 Mbp (more than one-third) of its genome
[9]. In the course of this work, we discovered an
open reading frame (pelA) homologous with several
eukaryotic sequences, two of which (the pelota gene
of Drosophila melanogaster [ 151 and the dom34 gene
of Saccharomyces cerevisiae [ 16,171) are involved in
meiotic cell division. Herein we describe pelA, the
first prokaryotic homolog of pelota and dom34.
2. Materials and methods
A genomic library was prepared by digesting
DNA from S. solfataricus DSM 1617 with Hind111
to a mean size of 40-45 kb, and cloning into Escherichia coli using the cosmid vector Tropist3 [ 181. Cosmids were sorted by the restriction landmark strategy to yield a provisional
set of minimally
overlapping cosmid clones [9]. Two overlapping cosmids were selected, individually subcloned into pUC
18 by nebulization
and blunt-end ligation [9], and
sequenced using automated fluorescent techniques.
Contigs were linked, ambiguities resolved, and database searches conducted as detailed elsewhere [9].
Predicted proteins were aligned using ClustalW [19]
followed by modest visual adjustment.
Letters
144 (1996)
151-155
3. Results and discussion
Nucleotide sequences of the two cosmid inserts
were assembled into a single 56 105base pair contig
containing 71 open reading frames (ORFs) at least
300 base pairs in length [9]. Approximately
38 of
these ORFs could be identified, either solidly on
the basis of strong matches with well-identified database entries, or provisionally based on matches with
less well-identified entries, weaker matches, and/or
the presence of characteristic motifs. One of the provisionally identified ORFs, ORF ~04039, is located
between positions 33 827 and 32 793 on the complementary strand of this 56-kb contig [9].
Blastx analysis [20] of ORF co4039 against the
molecular-sequence
databases returned match probabilities of about 9 X lOe-16 with pelota protein
from
D.
melanogaster
[15], 4 X lOe- 13 with
DOM34 protein from S. cerevisiae [ 16,171, and
1 X lOe-13
with the hypothetical
protein R74.6
from Caenorhabditis elegans [21], levels significantly
greater than expected by chance for proteins of this
length. No matches were recorded against eubacterial entries, although the eubacterial database includes
the complete genomes of Haemophilus influenzae and
Mycoplasma genitalium. We conclude that pelota
homologs are either absent from eubacteria, or are
so highly changed as to be unrecognizable.
Within the multiple alignment (Fig. l), pairwise
comparison
of the S. solfataricus co4039 protein
(PelA) with pelota shows 76 identities (22.2%) and
73 conservative changes (21.3%) over 343 alignable
positions, with eight gaps. Comparing PelA with the
three full-length eukaryotic proteins (pelota, DOM34
and R74.6), 33 positions (9.6%) are identical in all
four, and a further 38 (11.1%) are identical in PelA
and two of the three others. These identical residues
are distributed fairly uniformly along the length of
the proteins, giving further support for homology of
pelA and pelota. By comparison,
89 of 378 positions
(23.5%) are identical among the three complete eukaryotic sequences, with no gaps.
In addition to the three complete eukaryotic pelota homologs, several other related sequences were
identified. Searches of the Expressed Sequence Tag
(EST) division of GenBank revealed five sequences
from Homo sapiens, one from Rattus sp. and one
from Arabidopsis thaliana. Four of the five Homo
M.A. Ragan et al. IFEMS Microbiology Letters 144 (1996) 151-155
153
344
Wh,ta F..~.LQ.A.SQ.I..L.ISDN.FRCQWSL.I.S.~...KI~~~.~....A.L...R~
Cae"~lhb&s
R74.6 F..E1W.NR.EIPEL.T..L..~.FRI\QDI.T.RXYV.L".~.I(.~I~S~~.~...CPI.....~~
sacchuomyc*r Ix*134 W..EltE.VI(.A.Y..ISYL.~.H..NLI\P_ ~Sv.SNG.K..CQ....AC..RIPL~
EST
...K..?K.N-AM.~.L.ISDE.FRWCIWAT.SRW.LVDSVKEN ?.T.RI?SSLWSDEQ?SP..."A.....PVPELSDPEGDSSSE~
Homo
+ + ++ * ++r+X+IY*#*+
+
w *+X4+*
+ x
* + +X+
.x
Droropikih
.*
3%
381
386
. *
Fig. 1. Amino acid alignment of pelota homologs: Sulfolobus solfataricus PelA, GenBank accession number U67942; Drosophila melanogarter pelota [15], accession number U27197; Caenorhabditis elegans R74.6 [21], accession number 236238; Saccharomyces cerevisiae
M)M34 [16,17], accession number X771 14; Ratrus sp.-Homo sapiens EST (fused accession numbers H31427, T30453, T19552 and
T19553); Arabidopsis thaliana EST (accession number T20628); and Homo sapiens EST (accession number D20778). Gaps introduced for
alignment are indicated by dashes (-). A dot c) indicates identity with the corresponding position in PelA (top row). An asterisk (*) under
a column indicates that the same amino acid occurs in PelA and all three full-length eukaryotic sequences (pelota, R74.6 and DOM34); a
pound sign () indicates identity of PelA and two of the three eukaryotic sequences. A plus (+) indicates positions identical in the three
full-length eukaryotic sequences but not PelA. The putative nuclear localization signal (NLS) is indicated at pelota positions 168-172.
Question marks (?) in EST sequences are shown where the nucleotide sequences ambiguously predict the protein sequence; residues at the
termini of ESTs of dubious alignment were omitted. Cumulative amino acid positions are given to the right of the complete sequences.
ESTs and the Rattus EST sequences were significantly overlapping and, in these regions, were nearly
identical in nucleotide sequence (data not shown).
We assembled these five sequences into a single representative sequence shown in Fig. 1 as ‘RattusHomo’. The remaining
Homo sequence could not
be assembled with Rattus-Homo; it may represent a
distinct gene, and is presented separately in Fig. 1.
One additional
S. cerevisiae homolog, XYZ3, was
revealed by tBlastn [20] searches of GenBank; it contains numerous single-nucleotide
insertions and deletions, and may be a pseudogene copy of the dom34
locus (cited in [16]). We PCR-amplified,
cloned and
sequenced XYZ3 from S. cerevisiae (data not shown)
and confirmed it to be a probable psuedogene [16].
S. solfataricus PelA is predicted to be a 39.6-kDa
protein with pZ 5.10. Structural predictions using the
methods of Gamier [22] and GGBSM [23] reveal it
to be a rather nondescript,
mostly helical protein
interrupted by four or more regions of coil. In one
turn- or coil-rich region of PelA (Fig. 1, pelota positions 168-172), a canonical eukaryotic nuclear localization signal PRKRK [ 151 is conserved precisely in
the Drosophila, Caenorhabditis and Homo sequences,
and is at least partially present in the Arubidopsis
protein, but is absent from the predicted PelA protein. The negatively charged carboxy-terminus
characteristic of all these eukaryotic pelota homologs is
likewise absent from PelA. Both of these features are
thus predicted to reflect the presence or function of
this protein
specifically within eukaryotic
cells.
Although there are no extensive regions of identity
between PelA and the eukaryotic proteins, one region of apparent conservation appears at the extreme
carboxy end of PelA, just prior to the negatively
charged terminus of the eukaryotic proteins; this region (in pelota, positions 374-383), which is even
more conserved among the eukaryotic
sequences,
could reflect a conserved ancestral function. These
and other residues that are conserved between PeiA
154
M.A. Rugan et al. I FEMS
Microbiology
and its eukaryotic homologs are potential targets for
site-directed mutagenesis and functional characterization.
Pelotu was first identified as a male-sterile mutant
in D. melanoguster, and subsequent work has demonstrated its involvement
in meiotic cell division
[15]. Strong mutant alleles of pelota result in meiotic
arrest of spermatocytes, although mitotic division in
spermatogenesis
appears normal [15]. On the other
hand, dom34 mutants of S. cerevisiae suggest both
mitotic and meiotic functions for the DOM34 protein, since they result in slowing of the mitotic cell
cycle, acceleration of meiotic divisions, and reduction of spore production (cited in [15]). Introduction
of the D. melanogaster pelota gene into S. cerevisiae
substantially rescues growth defects arising from mutations in dom34 (cited in [15]), incidentally providing further evidence that pelota and dom34 are
homologs. It might be more difficult to examine
whether S. solfataricus pelA can likewise rescue
dom34 or pelota mutants, as the putative nuclear
localization signal (and perhaps other necessary features of the protein, such as the negatively charged
carboxy-terminal
extension) are absent, and because
the PelA protein is presumably adapted to function
at elevated temperature;
functional characterization
might better be approached by mutagenesis experiments in cultures of S. solfataricus.
The discovery of a pelota homolog in an archaebacterium provides evidence that some of the machinery involved in meiosis predates the origin of
meiosis itself, which presumably arose after the divergence of eukaryotes from archaebacteria.
Thus,
the meiosis-specific role(s) of some eukaryotic proteins, including pelota, most likely arose within the
eukaryotic lineage, being recruited from some other
function(s).
Examination
of the function of such
meiotic-protein
homologs in prokaryotes may thus
allow a better understanding
not only of the role
of their eukaryotic counterparts in meiosis, but perhaps also of the evolutionary origin of meiosis itself.
To our knowledge, the only other identified archaebacterial homologs of meiotic genes are the radA
genes, themselves homologs of eubacterial recA [24]
and the meiosis-specific DMCI gene of S. cerevisiae
[25]. By contrast, no pelA homolog could be identified in eubacteria, suggesting that this gene, like cer-
Letters 144 (1996)
151-155
tain others [5-lo], may be specific to the archaebacterial-eukaryotic
lineage.
4. Note added in proof
After this paper was accepted, the complete genome sequence of the euryarchaeote Methanococcus
jannaschii was reported (Bult et al. (1996) Science
273, 105%1073) revealing a pelota homolog which
is 34% identical to S. solfataricus reported here.
Acknowledgments
We gratefully acknowledge the expert assistance of
G. Allard, C.C.-Y. Chan, T. Gaasterland, Q.Y. Liu,
S.L. Penny, M.E. Schenk, R.K. Singh and F. Young,
and critical review by S.L. Baldauf and M.E. Reith.
J. Chew and M. Dobson kindly provided S. cerevisiae DNA. This project was supported by the Canadian Genome Analysis and Technology Program, the
Canadian Institute for Advanced Research, the National Research Council of Canada, and the Medical
Research Council of Canada. J.M.L. is supported by
a Postdoctoral
Research Fellowship in Molecular
Evolution from the Alfred P. Sloan Foundation.
References
[I] Woese, CR., Kandler, 0. and Wheelis, M. (1990) Towards a
natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc. Nat1 Acad. Sci. USA 87,
45764519.
[2] Brown, J.R. and Doolittle, W.F. (1995) Root of the universal
tree of life based on ancient aminoacyl-tRNA
synthetase gene
duplications. Proc. Natl Acad. Sci. USA 92, 244-2445.
[3] Olsen, G.J., Woese, C.R. and Overbeek, R. (1994) The winds
of (evolutionary) change: breathing new life into microbiology. J. Bacterial. 176: 1-6.
[4] Baldauf, S.L., Palmer, J.D. and Doolittle, W.F. (1996) The
root of the universal tree and the origin of eukaryotes based
on elongation factor phylogeny. Proc. Natl Acad. Sci. USA
93, 7149-7754.
[5] Rowlands, T., Baumann, P. and Jackson, S.P. (1994) The
TATA-binding
protein: a genera1 transcription
factor in eukaryotes and archaebacteria.
Science 264, 1326-1329.
[6] Creti, R., Londei, P. and Cammarano,
P. (1993) Complete
nucleotide sequence of an archaeal (Pyrococcus woesei) gene
MA. Ragan et al. IFEMS Microbiology Letters 144 (1996) 151-155
encoding a homolog of eukaryotic transcription
factor IIB
(TFIIB). Nucl. Acids Res. 21, 2942.
[71 Langer, D., Hain, .I., Thuriaux, P. and Zillig, W. (1995) Transcription in Archaea: similarity to that in Eucarya. Proc. Nat1
Acad. Sci. USA 92, 5768-5172.
PI Keeling, P.J. and Doolittle, W.F. (1995) Archaea: narrowing
the gap between prokaryotes
and eukaryotes.
Proc. Nat1
Acad. Sci. USA 92, 5761-5764.
191 Sensen, C.W., Klenk, H.-P., Singh, R.K., Allard, G., Chan,
C.C.-Y., Liu, Q.Y., Penny, S.L., Young, F., Schenk, M.E.,
Gaasterland,
T., Doolittle, W.F., Ragan, M.A. and Charlebois, R.L. (1996) Organizational
characteristics and information content of an archaeal genome: 156 kbp of sequence
from Sulfolobus solfataricus P2. Mol. Microbial., in press.
1101 Confalonieri, F., Marsault, .I. and Duguet, M. (1994) SAV, an
archaebacterial
gene with extensive homology to a family of
highly conserved eukayotic ATPases. J. Mol. Biol. 235, 396
401.
Ull Kondo, S., Yamagishi, A. and Oshima, T. (1993) A physical
map of the sulfur-dependent
archaebacterium
Sulfolobus acidocaldarius 7 chromosome. J. Bacterial. 175, 1532-1536.
WI S&leper, C., Kubo, K. and Zillig, W. (1992) The particle
SSVl from the extremely thermophilic archaeon Sulfolobus
is a virus: demonstration
of infectivity and of transfection
with viral DNA. Proc. Nat1 Acad. Sci. USA 89, 7645-7649.
P31 Segerer, A.H. and Stetter, K.O. (1992) in: The Prokaryotes
(Balows, A., Trtlper, H.G., Dworkin, M., Harder, W. and
Schleifer, K.-H., Eds.), Vol. I, pp. 684701, Springer-Verlag,
New York.
[I41 Daniels, L. (1993) Biochemistry of methanogenesis. In: The
Biochemistry of Archaea (Archaebacteria)
(Kates, M., Kushner, D.J. and Matheson, A.T., Eds.), pp. 41-112, Elsevier,
Amsterdam.
u51 Eberhart, C.G. and Wasserman, S.A. (1995) The pelota locus
encodes a protein required for meiotic cell division: an analysis of GslM arrest in Drosophila spermatogenesis.
Development 121, 3477-3486.
WI Lalo, D., Stettler, S., Mariotte, S., Gendreau, E. and Thuriaux, P. (1994) Organization of chromosome XIV in Saccharomyces cerevisiae. Yeast 10, 523-533.
1171 VerHasselt, P., Aert, R., Voet, M. and Volckaert, G. (1994)
Nucleotide sequence analysis of the left arm of yeast chomo-
155
some XIV, encompassing the centromere sequence. Yeast 10,
945-951,
[18] De Smet, K.A.L., Jamil, S. and Stoker, N.G. (1993) Tropist3:
a cosmid vector for simplified mapping of both G+C-rich and
A+T-rich genomic DNA. Gene 136, 215-219.
[19] Thompson,
J.D., Higgins, D.G. and Gibson, T.J. (1994)
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positionspecific gap penalties and weight matrix choice. Nucl. Acids
Res. 22, 46734680.
[20] Altschul, SF., Gish, W., Miller, W., Myers, E.W. and Lipman, D.J. (1990) Basic local alignment search tool. I. Mol.
Biol. 215, 403-410.
1211 Wilson, R., Ainscough, R., Anderson, K., Baynes, C., Berks,
M., Bonfield, J., Burton, J., Connell, M., Copsey, T., Cooper,
J., Coulson, A., Craxton, M., Dear, S., Du, Z., Durbin, R.,
Favello, A., Fraser, A., Fulton, L., Gardner, A., Green, P.,
Hawkins, T., Hillier, L., Jier, M., Johnston, L., Jones, M.,
Kershaw, J., Kirsten, J., Laisster, N., Latreille, P., Lightning,
J., Lloyd, C., Mortimore, B., G’Callaghan,
M., Parsons, J.,
Percy, C., Rifken, L., Roopra, A., Saunders, D., Shownkeen,
R., Sims, M., Smaldon, N., Smith, A., Smith, M., Sonnhammer, E., Staden, R., Sulston, J., Thierry-Mieg, J., Thomas, K.,
Vaudin, M., Vaughan, K., Waterston, R., Watson, A., Weinstock, L., Wilkinson-Sproat,
J. and Wohldman, P. (1994) 2.2
Mb of contiguous nucleotide sequence from chromosome III
of C. elegans. Nature 368, 32-38.
[22] Gamier, J., Osguthorpe, D.J. and Robson, B. (1978) Analysis
of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J. Mol.
Biol. 120, 97-120.
[23] Gascuel, 0. and Gohnard, J.L. (1988) A simple method for
predicting the secondary structure of globular proteins: implications and accuracy. CABIOS 4, 357-365.
[24] Sandler, S.J., Satin, L.H., Samra, H.S. and Clark, A.J. (1996)
recA-like genes from three archaean species with putative protein products similar to Rad51 and Dmcl proteins of the yeast
Saccharomyces cerevisiae. Nucl. Acids Res. 24, 2125-2132.
[25] Bishop, D.K., Park, D., Xu, L. and Kleckner, N. (1992)
DMCl: a meiosis-specific yeast homolog of E. coli recA required for recombination,
synaptonemal complex formation,
and cell cycle progression. Cell 69, 439456.