Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
JOURNAL OF BACTERIOLOGY, JUlY 1992, p. 4525-4529 Vol. 174, No. 0021-9193/92/144525-05$02.00/0 14 Copyright C 1992, American Society for Microbiology MINIREVIEW Short, Interspersed Repetitive DNA Sequences in Prokaryotic Genomes JAMES R. LUPSKI1,2 AND GEORGE M. WEINSTOCK3* Institute for Molecular Genetics' and Department of Pediatrics,2 Baylor College of Medicine, Houston, Texas 77030, and Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center, P.O. Box 20708, 6431 Fannin Street, Houston, Texas 772253 Repeated sequences are present in the genomes of all organisms. The DNA sequence organization of eukaryotic genomes consists of numerous repeats interspersed with single-copy sequences. This organization has been elucidated through renaturation rate studies of denatured DNA (2). The best-characterized eukaryotic interspersed repetitive DNA sequences are the repeats in the Alu family that have been characterized in mammalian genomes (29, 55). In the human genome of 3 x 109 bp, the 300-bp Alu sequence represents approximately 3 to 6% of the mass, indicating that there are approximately 300,000 copies per haploid genome (55). Interspersed repetitive sequences in numerous prokaryotic and eukaryotic microorganisms have recently been described (Table 1). The ubiquitous nature and seemingly random chromosomal distribution of these repeats in prokaryotic genomes suggests that like more-complex eukaryotes, the DNA sequence arrangement in the genomes of eubacteria may consist of short repeats interspersed with longer single-copy sequences. Although the precise functions of repetitive sequences in prokaryotic genomes are obscure, their presence can be exploited for several applications and molecular genetic manipulations. Prokaryotic genomes contain a variety of low-copy-number repeated sequences, such as insertion elements, rRNA operons, tRNA genes, and other genes such as those belonging to the rhs gene family (32, 51, 52). These sequences may contribute to the evolution of chromosome structure through DNA rearrangements such as chromosomal deletions, duplications, and inversions (44). Repeated genes also provide mechanisms to enhance bacterial virulence, such as antigenic variation in Neissena gonorrhoeae (21) and other pathogens. The subject of this minireview is an additional group of repeated sequences, the interspersed repetitive sequence elements which are characterized as being short (usually <200 bp), noncoding, intercistronic, and apparently widely distributed throughout prokaryotic genomes. The repeated sequence that was first described and most intensively studied is the repetitive extragenic palindrome (REP), or palindromic unit (PU) sequence (12, 24), initially identified in Salmonella typhimurium and Escherichia coli. The REP sequence was identified through DNA sequence comparisons of intercistronic regions of different operons (12, 24). The REP sequence structure consists of a 38nucleotide consensus sequence which is a palindrome and can form a stable stem-loop structure with a 5-bp variable loop in the central region of the consensus sequence (58). In * Corresponding author. 4525 REP sequences in which a base pair change from the sequence has occurred in the region of dyad symmetry, a compensatory change is usually found in the symmetric portion of the sequence to maintain the palindrome. The initial analysis of REP sequences in E. coli showed that the REP sequence was frequently present in complex clusters of a few copies of the sequence (58). These clusters are present and transcribed in about 25% of transcription units and may account for as much as 1% of the total genome. It was estimated that there are between 500 and 1,000 copies of the REP sequence organized into clusters on the E. coli chromosome (58). Subsequent studies showed these clusters could contain as many as 10 elements (34). Recently it has been reported that the clusters could also contain other repeated elements in specific arrangements (17, 18). These latter complex repeats were termed bacterial interspersed mosaic elements (BIMEs). The E. coli chromosome was estimated to have about 500 BIME structures, always found in extragenic locations, or about one BIME for every six genes (17, 18). The REP element has been shown to be located between genes within an operon or at the end of an operon, in different orientations and in tandem arrays, and in operons distributed throughout the E. coli genome (12, 58). Hybridization experiments, using the E. coli Gene Mapping Membrane (41) containing an ordered library of the E. coli genome (30), in conjunction with analysis of GenBank and the E. coli sequence data base EcoSeqS (47, 48), more accurately mapped the distribution of REP sequences throughout the genome (6a). This study detected fewer total REP sequences (563 sequences in 295 clusters). It is difficult to know which numbers are more accurate, since the hybridization and computer search procedures were known to miss some sequences, while in earlier studies some REP sequences were duplicated in the data bases used. In any case, it is clear that REP sequences are distributed throughout the E. coli genome. In some but not all cases, the REP element is present in the same intergenic region of analogous operons from different bacterial species (5, 25). Although REP sequences were initially described in two members of the family Enterobacteriaceae, S. typhimurium and E. coli, for which substantial sequence information is available in the DNA sequence data base, REP-like sequences have been shown to exist throughout the eubacterial kingdom, although consensus sequences may differ among different bacteria (6a, 11, 63). By using DNA-DNA hybridization and the polymerase chain reaction (PCR), REP-like sequences were documented in 7 of the 10 consensus 4526 J. BAcTERIOL. MINIREVIEW TABLE 1. Microorganisms shown to carry interspersed repetitive sequences in their genomes Organism(s) Reference(s) Agrobacterium spp .....................9 19 Bordetella pertussis.................... Brucella spp .................... 22 54 Candida albicans .................... Cyanobacteria .................... Deinococcus radiodurans .................... Escherichia coli.................... TABLE 2. Well-defined interspersed repetitive sequences in prokaryotic genomes 35 31 12, 24, 27, 56 45 Halobacterium spp .................... 53 73 Leptospira interrogans .................... 8, 46 Mycobacterium tuberculosis .................... 68 Mycoplasma pneumoniae .................... 10 Myxococcus xanthus .................... Neisseria gonorrhoeae .....................4 Neisseria meningitidis .....................4 60 Pneumocystis carinii .................... Rhizobium spp .........................9 50 Rhodomicrobium vannielili .................... Salmonella typhimurium .................... 12, 24, 27, 56 42 Spiroplasma spp .................... Eukaryotic microorganisms .................... phyla defined by Woese (69) on the basis of analysis of 16S rRNA sequences. Postulated functions for the REP or PU sequence have included roles in gene regulation by differential translation within polycistronic operons and retroregulation by stabilization of translationally active mRNA (39, 40, 58, 59). These functions may be a consequence of the formation of a stem-loop structure in a specific chromosome location. Although the 3' ends of some mRNAs are found near REP sequences, the REP sequence does not appear to function as a transcription terminator (16, 58, 59). One possible exception to this is the report that the REP sequence acts as a bidirectional transcription terminator between the convergentpheA and tyrA genes (26). It has been proposed that this REP element is a member of a subclass of REP sequences termed PU* sequences that have a slightly different sequence and may be transcription terminators (16). The fact that REP elements are not always present in the same intergenic regions in different species indicates that they are not an essential element in the mechanism of gene expression. However, when present they can influence the amount or regulation of expression of a gene or operon. Additional proposed functions have related to chromosome structure (13, 14, 58). It has been shown that the REP sequence binds DNA gyrase (70) and DNA polymerase I (15) and that the HU protein stimulates binding of gyrase to the REP sequence (71). These studies have led to the proposal that the PU or REP sequence is involved in the folding of the bacterial nucleoid into independent supercoiled looped domains (13, 58). The REP sequence has also been implicated in chromosomal rearrangements (12, 58) and has been identified at the junction of tandem duplications (57). Any proposed function(s) must explain the conserved primary sequence, conserved palindromic structure, location in noncoding regions, inclusion in transcribed sequences, distribution throughout the bacterial chromosome, presence in different species, and ubiquitous nature of the REP sequence. These characteristics, as well as the lack of a single unified proposed function, are reminiscent of the Alu family of repetitive sequences in mammalian genomes. An additional interspersed repetitive DNA sequence ele- Size (bp) Sequence REP (PU) 38 ERIC (IRU) 126 (124-127) Ng-rep 26 Dr-rep 150-192 Aft-rep 87 300 RepMP1 SDC1 400 Organism in which originally identified E. coli S. typhimurium E. coli S. typhimurium N. gonnorhoeae N. meningitidis D. radiodurans M. xanthus M. pneumoniae M. pneumoniae Reference(s) 12, 24 27, 56 4 31 10 68 3 ment in prokaryotic genomes has been identified in E. coli, S. typhimurium, and other members of the family Enterobacteriaceae. This repetitive element has alternately been called an intergenic repeat unit (IRU) (56) or enterobacterial repetitive intergenic consensus (ERIC) (27) sequence. The ERIC or IRU sequence is approximately 126 bp in length. Like the REP sequence, it appears from analysis of DNA sequence information in the data bases that the ERIC or IRU sequence is located in noncoding transcribed regions of the chromosome, in either orientation with respect to transcription, and includes a conserved inverted repeat. The chromosomal locations of the ERIC sequence can differ in different species. In analogous locations, regardless of whether or not the IRU or ERIC sequence is present in different species, the surrounding sequence is not disturbed, suggesting either a precise deletion or insertion relative to the other species (56). No base pair duplication of surrounding sequence can be identified, arguing against a classic transposition mechanism for dispersion of these repetitive sequences. ERIC-like repetitive sequence elements have recently been demonstrated throughout the eubacterial kingdom (63). Several other small, dispersed repeated sequences have been characterized in bacteria (Table 2). A 26-bp repetitive sequence was identified from N. gonorrhoeae by utilizing a two-dimensional S1 nuclease heteroduplex mapping protocol. A consensus sequence, with five variant positions, was proposed on the basis of the nucleotide sequences of this repeat determined from four different locations (4). By hybridization experiments using a 21-nucleotide oligomer, sequences related to this repetitive sequence (Ng-rep) were demonstrated in several N. gonorrheae strains as well as in the species Neisseria meningitidis (4). By using synthetic oligonucleotides matching the consensus sequence for the repeat as primers in the PCR, Ng-rep-like sequences were detected in many microorganisms (64a). Control experiments indicate that this result was more specific than an arbitrarily primed PCR assay (66, 67); thus, the Ng-rep sequence is a candidate for another widely dispersed repeated sequence element. Additional repetitive sequence elements that have recently been identified from prokaryotic genomes have been found by hybridization experiments. Typically, a cloned DNA fragment is used as a probe of total genomic DNA and produces a complex hybridization pattern, consistent with multiple copies of all or part of the cloned sequence being present in the genome. A highly conserved repetitive sequence was identified in this manner from the radioresistant bacterium Deinococcus radiodurans SARK (31). When an VOL. 174, 1992 EcoRI-digested D. radiodurans genomic library was constructed in the pUC18 vector, about 3% of all E. coli colonies hybridized with an insert containing the repetitive sequence. This repetitive element (Dr-rep) is variable in length, ranging from 150 to 192 bp, and contains dyad symmetries near each end based on the nucleotide sequence obtained from five different repeats. Hybridization experiments using a restriction fragment in the intergenic region between the ops and tps genes of Myxococcus xanthus as a probe of genomic blots identified many strongly hybridizing genomic fragments (10). The same restriction fragment was used to screen a M. xanthus cosmid library containing 30- to 40-kb inserts in the pHC79 vector, and about 3% of the clones hybridized. DNA sequence analysis of six cloned repeats revealed a consensus sequence with an 87-bp core. Interestingly, by sequence analysis one of the MX-rep repeats was located downstream from the rpoD gene (10) in a position analogous to that of a REP element in E. coli (12). At least five different repeated elements in Mycoplasma pneumoniae have been described (3, 49, 61, 68). All of these are present in 8 to 10 copies per cell and are larger than the elements discussed previously. The smaller elements are the 300-bp RepMP1 (68) and 400-bp SDC1 (3) elements; the other elements are from 1.1 to 2.2 kb in size. It is remarkable that at least 6% of the 840-kb M. pneumoniae genome consists of repeated sequences (49). In addition to the repetitive sequences REP (PU), ERIC (IRU), Ng-rep, Dr-rep, Mx-rep, and those of M. pneumoniae described above, simple sequence repeats have been identified in eubacteria. In the cyanobacterium Calothrix genome, three different simple heptanucleotide sequence repeats were identified and named short tandemly repeated repetitive (STRR) sequences (35). The STRR sequence was initially identified by hybridization experiments. A 413-bpXbaI restriction fragment located upstream of the cpeBA operon was used to probe an EcoRI-digested Calothrix genomic DNA blot. A smear corresponding to multiple hybridization bands was obtained, suggesting the occurrence of a highly repeated sequence. The STRR sequences were also found to be present in the genome of other cyanobacteria. These observations are reminiscent of the simple sequence repeats, consisting of tandemly amplified sequences, that have been identified in the human genome. Hypervariable minisatellite regions (28) and variable-number-of-tandem-repeat markers (36) have been useful in fingerprinting human genomes. Short tandem repeats, or microsatellites such as (GT)n (33, 65), as well as trimeric and tetrameric tandem repeats (7), are highly polymorphic and useful in genetic mapping and DNA typing in humans. In addition, tandemly reiterated heptanucleotide sequences have been shown to be a structural sequence component of chromosome telomeres in eukaryotes (1, 72). Repetitive sequences are ubiquitous in prokaryotic genomes. Their precise function has not been determined. How repetitive sequences have become dispersed throughout genomes and how their high degree of sequence similarity is maintained are not known. A postulated mechanism for their propagation and dissemination is as "selfish" DNA by gene conversion (13, 25). Regardless of how these repetitive sequences are maintained and dispersed, their presence and widespread distribution in both prokaryotic and eukaryotic genomes strongly suggests that they are important to the structure and evolution of genomes. A functional role in genome structure for a repetitive sequence from the mouse has been proposed on the basis of the interaction with MINIREVIEW 4527 nuclear polypeptides by nitrocellulose filter binding assays (38). The hypothesized role involves loop anchorage to the nuclear matrix. This is analogous to some of the proposed roles for the REP sequence discussed above. The highly conserved primary sequence structure and widespread distribution of these repetitive sequences in prokaryotic genomes can be exploited for several applications and molecular genetic manipulations, as has been done with the short repeated sequences in mammalian genomes (20, 23, 37, 43). Recently it has been demonstrated that synthetic oligonucleotides matching the consensus sequence for REP and ERIC sequences, with the base inosine placed at the nonconserved positions, could be used as primers in the PCR using prokaryotic genomic DNA as a template (63). In this technique, known as rep-PCR (for repetitive sequence element PCR), the primers bind to the repetitive sequences in the prokaryotic genome, and if those primer binding sites are in the proper orientation and within a distance that can be spanned by Taq polymerase extension, an amplification product is obtained. Size fractionation of the amplification products by agarose gel electrophoresis reveals a specific pattern or genomic DNA fingerprint. These fingerprints appear to be species and even strain specific (63). Multiple colonies isolated from the same culture, as well as repeated isolates over time from the same strain, reveal a specific pattern (64), demonstrating that the fingerprint is stable and specific to a given bacterial strain. Fingerprinting by rep-PCR may have multiple applications in epidemiological studies of microorganisms as well as for quality assurance or quality control of microorganisms used in medical, agricultural, and industrial applications (6, 64). In addition to enabling the generation of strain-specific fingerprints, primers to repetitive sequences in prokaryotic genomes may be useful for the mapping of insertion sequences, such as mutations caused by transposon insertions. By using a REP primer that corresponds to either half of the REP palindrome, a TnS-specific primer (both of which are oriented to allow synthesis to proceed into the neighboring genomic DNA), and chromosomal DNA from a TnS insertion mutant strain as a template, PCR amplification products that contained an unique sequence located between the transposon and an adjacent REP sequence were obtained (62). The unique sequence amplification product was used as a probe for the TnS-mutated locus by screening an ordered E. coli genomic library (41), and the insertion was mapped quickly to a specific region of the chromosome (62). This rapid physical mapping technique should complement already useful methods for genetic mapping in E. coli. This procedure should also be important in other bacteria that do not have as sophisticated genetic techniques for mapping as E. coli. In conclusion, it has become clear that the genomes of prokaryotic and simple eukaryotic microorganisms contain numerous classes of short repeated sequences, just as was found a quarter of a century ago for the genomes of higher organisms. The functions of these sequences are not yet clear, and notions as to their origin and mechanism of dispersion are likewise only speculative. Nevertheless, these sequences are beginning to provide new methodologies for the identification and genetic analysis of microorganisms and are likely to play an increasingly important role in studies of microorganisms in the future, as they have in higher eukaryotes. 4528 MINIREVIEW REFERENCES 1. Blackburn, E. H. 1990. Telomeres: structure and synthesis. J. Biol. Chem. 265:5919-5921. 2. Britten, R. J., and D. E. Kohne. 1968. Repeated sequences in DNA. Science 161:529-540. 3. Colman, S. D., P. C. Hu, and K. F. Bott. 1990. Prevalence of novel repeat sequences in and around the P1 operon in the genome of Mycoplasma pneumoniae. Gene 87:91-96. 4. Correia, F. F., S. Inouye, and M. Inouye. 1986. A 26-base-pair repetitive sequence specific for Neisseria gonorrhoeae and Neisseria meningitidis genomic DNA. J. Bacteriol. 167:10091015. 5. Dahl, M. K., E. Francoz, W. Saurin, W. Booz, M. D. Manson, and M. Hofnung. 1989. Comparison of sequences from the malB regions of Salmonella typhimurium and Enterobacter aerogenes with Escherichia coli K12: a potential new regulatory site in the interoperonic region. Mol. Gen. Genet. 218:199-207. 6. de Bruijn, F. J. 1992. Use of repetitive (repetitive extragenic element and enterobacterial repetitive intergeneric consensus) sequences and the polymerase chain reaction to fingerprint the genomes of Rhizobium meliloti isolates and other soil bacteria. Appl. Environ. Microbiol. 58:2180-2187. 6a.Dimri, G. P., K. E. Rudd, M. K. Morgan, H. Bayat, and G. F.-L. Ames. 1992. Physical mapping of repetitive extragenic palindromic sequences in Escherichia coli and phylogenetic distribution among Escherichia coli strains and other enteric bacteria. J. Bacteriol. 174:4583-4593. 7. Edwards, A., A. Civitello, H. A. Hammond, and C. T. Caskey. 1991. DNA typing and genetic mapping with trimeric and tetrameric tandem repeats. Am. J. Hum. Genet. 49:746-756. 8. Eisenach, K. D., M. D. Cave, J. H. Bates, and J. T. Crawford. 1990. Polymerase chain reaction amplification of a repetitive DNA sequence specific for Mycobacterium tuberculosis. J. Infect. Dis. 161:977-981. 9. Flores, M., V. Gonzalez, S. Brom, E. Martinez, D. Pifiero, D. Romero, G. Dafvila, and R. Palacios. 1987. Reiterated DNA sequences in Rhizobium and Agrobacterium spp. J. Bacteriol. 169:5782-5788. (Erratum, 170:2002, 1988.) 10. Fujitani, S., T. Komano, and S. Inouye. 1991. A unique repetitive DNA sequence in the Myxococcus xanthus genome. J. Bacteriol. 173:2125-2127. 11. Gilson, E., S. Bachellier, S. Perrin, D. Perrin, P. A. Grimont, F. Grimont, and M. Hofnung. 1990. Palindromic unit highly repetitive DNA sequences exhibit species specificity within Enterobacteriaceae. Res. Microbiol. 141:1103-1116. 12. Gilson, E., J.-M. Clement, D. Brutlag, and M. Hofnung. 1984. A family of dispersed repetitive extragenic palindromic DNA sequences in E. coli. EMBO J. 3:1417-1421. 13. Gilson, E., J.-M. Clement, S. Perrin, and M. Hofnung. 1987. Palindromic units: a case of highly repetitive DNA sequences in bacteria. Trends Genet. 3:226-230. 14. Gilson, E., D. Perrin, J. M. Clement, S. Szmelcman, E. Dassa, and M. Hofnung. 1986. Palindromic units from E. coli as binding sites for a chromoid-associated protein. FEBS Lett. 206:323328. 15. Gilson, E., D. Perrin, and M. Hofnung. 1990. DNA polymerase I and a protein complex bind specifically to E. coli palindromic unit highly repetitive DNA: implications for bacterial chromosome organization. Nucleic Acids Res. 18:3941-3952. 16. Gilson, E., J. P. Rousset, J.-M. Clement, and M. Hofnung. 1986. A subfamily of E. coli palindromic units implicated in transcription termination? Ann. Inst. Pasteur/Microbiol. (Paris) 137B: 259-270. 17. Gilson, E., W. Saurin, D. Perrin, S. Bachellier, and M. Hofnung. 1991. The BIME family of bacterial highly repetitive sequences. Res. Microbiol. 142:217-222. 18. Gilson, E., W. Saurin, D. Perrin, S. Bachellier, and M. Hofnung. 1991. Palindromic units are part of a new bacterial interspersed mosaic element (BIME). Nucleic Acids Res. 19:1375-1383. 19. Glare, E. M., J. C. Paton, R. R. Premier, A. J. Lawrence, and I. T. Nisbet. 1990. Analysis of a repetitive DNA sequence from Bordetella pertussis and its application to the diagnosis of pertussis using the polymerase chain reaction. J. Clin. Micro- J. BACTIEFRIOL. biol. 28:1982-1987. 20. Guzzetta, V., R. Montes de Oca-Luna, J. R. Lupski, and P. I. Patel. 1990. Isolation of region-specific and polymorphic markers from chromosome 17 by restricted Alu polymerase chain reaction. Genomics 9:31-36. 21. Haas, R., and T. F. Meyer. 1986. The repertoire of silent pilus genes in Neisseria gonorrhoeae: evidence for gene conversion. Cell 44:107-115. 22. Hailing, S. M., and E. S. Zehr. 1990. Polymorphism in Brucella spp. due to highly repeated DNA. J. Bacteriol. 172:6637-6640. 23. Herman, G. E., J. H. Nadeau, and S. C. Hardies. 1992. Dispersed repetitive elements in mouse genome analysis. Mamm. Genome 2:207-214. 24. Higgins, C. F., G. F.-L. Ames, W. M. Barnes, J. M. Clement, and M. Hofnung. 1982. A novel intercistronic regulatory element of prokaryotic operons. Nature (London) 298:760-762. 25. Higgins, C. F., R. S. McLaren, and S. F. Newbury. 1988. Repetitive extragenic palindromic sequences, mRNA stability and gene expression: evolution by gene conversion?-a review. Gene 72:3-14. 26. Hudson, G. S., and B. E. Davidson. 1984. Nucleotide sequence and transcription for the phenylalanine and tyrosine operons of E. coli K12. J. Mol. Biol. 180:1023-1051. 27. Hulton, C. S., C. F. Higgins, and P. M. Sharp. 1991. ERIC sequences: a novel family of repetitive elements in the genomes of Escherichia coli, Salmonella typhimurium and other enterobacteria. Mol. Microbiol. 5:825-834. 28. Jeffreys, A. J., V. Wilson, and S. L. Thein. 1985. Hypervariable minisatellite regions in human DNA. Nature (London) 314:6773. 29. Jelinek, W. R., T. P. Toomey, L. Leinwand, C. H. Duncan, P. A. Biro, P. V. Choudary, S. M. Weissman, C. M. Rubin, C. M. Houck, P. L. Deininger, and C. W. Schmid. 1980. Ubiquitous, interspersed repeated sequences in mammalian genomes. Proc. Natl. Acad. Sci. USA 77:1398-1402. 30. Kohara, Y., K. Akiyama, and K. Isono. 1987. The physical map of the whole E. coli chromosome: application of a new strategy for rapid analysis and sorting of a large genomic library. Cell 50:495-508. 31. Lennon, E., P. D. Gutman, H. L. Yao, and K. W. Minton. 1991. A highly conserved repeated chromosomal sequence in the radioresistant bacterium Deinococcus radiodurans SARK. J. Bacteriol. 173:2137-2140. 32. Lin, R.-J., M. Capage, and C. W. Hill. 1984. A repetitive DNA sequence, rhs, responsible for duplications within the Escherichia coli K-12 chromosome. J. Mol. Biol. 177:1-18. 33. Litt, M., and J. A. Luty. 1989. A hypervariable microsatellite revealed by in vitro amplification of a dinucleotide repeat within the cardiac muscle actin gene. Am. J. Hum. Genet. 44:397-401. 34. Makino, K., S. K. Kim, H. Shinagawa, M. Amemura, and A. Nakata. 1991. Molecular analysis of the cryptic and functional phn operons for phosphonate use in Escherichia coli K-12. J. Bacteriol. 173:2665-2672. 35. Mazel, D., J. Houmard, A. M. Castets, and N. Tandeau de Marsac. 1990. Highly repetitive DNA sequences in cyanobacterial genomes. J. Bacteriol. 172:2755-2761. 36. Nakamura, Y., M. Leppert, P. O'Connell, R. Wolff, T. Holm, M. Culver, C. Martin, E. Fujimoto, M. Hoff, E. Kumlin, and R. White. 1987. Variable number of tandem repeat (VNTR) markers for human gene mapping. Science 235:1616-1622. 37. Nelson, D. L., S. A. Ledbetter, L. Corbo, M. F. Victoria, R. Ramirez-Solis, T. D. Webster, D. H. Ledbetter, and C. T. Caskey. 1989. Alu polymerase chain reaction: a method for rapid isolation of human-specific sequences from complex DNA sources. Proc. Natl. Acad. Sci. USA 86:6686-6690. 38. Neuer-Nitsche, B., X. Lu, and D. Werner. 1988. Functional role of a highly repetitive DNA sequence in anchorage of the mouse genome. Nucleic Acids Res. 16:8351-8360. 39. Newbury, S. F., N. H. Smith, and C. F. Higgins. 1987. Differential mRNA stability controls relative gene expression within a polycistronic operon. Cell 51:1131-1143. 40. Newbury, S. F., N. H. Smith, E. C. Robinson, I. D. Hiles, and C. F. Higgins. 1987. Stabilization of translationally active VOL. 174, 1992 mRNA by prokaryotic REP sequences. Cell 48:297-310. 41. Noda, A., J. B. Courtwright, P. F. Denor, G. Webb, Y. Kohara, and A. Ishihama. 1991. Rapid identification of specific genes in E. coli by hybridization to membranes containing the ordered set of phage clones. BioTechniques 10:474-477. 42. Nur, I., D. J. LeBlanc, and J. G. Tully. 1987. Short, interspersed, and repetitive DNA sequences in Spiroplasma species. Plasmid 17:110-116. 43. Patel, P. I., C. Garcia, R. Montes de Oca-Luna, R. I. Malamut, B. Franco, S. Slaugenhaupt, A. Chakravarti, and J. R. Lupski. 1990. Isolation of a marker linked to the Charcot-Marie-Tooth disease type IA gene by differentialAlu-PCR of human chromosome 17-retaining hybrids. Am. J. Hum. Genet. 47:926-934. 44. Petes, T. D., and C. W. Hill. 1988. Recombination between repeated genes in microorganisms. Annu. Rev. Genet. 22:147168. 45. Riley, D. E., M. Samadpour, and J. N. Krieger. 1991. Detection of variable DNA repeats in diverse eukaryotic microorganisms by a single set of polymerase chain reaction primers. J. Clin. Microbiol. 29:2746-2751. 46. Ross, B. C., K. Raios, K. Jackson, and B. Dwyer. 1992. Molecular cloning of a highly repeated DNA element from Mycobacterium tuberculosis and its use as an epidemiological tool. J. Clin. Microbiol. 30:942-946. 47. Rudd, K. E. Alignment of E. coli DNA sequences to a revised, integrated genomic restriction map, in press. In J. Miller (ed.), A short course in bacterial genetics: a laboratory manual and handbook for Escherichia coli and related bacteria. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 48. Rudd, K. E., W. Miller, C. Werner, J. Ostell, C. Tolstoshev, and S. G. Satterfield. 1991. Mapping sequenced E. coli genes by computer: software, strategies, and examples. Nucleic Acids Res. 19:637-647. 49. Ruland, K., R. Wenzel, and R. Herrmann. 1990. Analysis of three different repeated DNA elements present in the P1 operon of Mycoplasma pneumoniae: size, number and distribution on the genome. Nucleic Acids Res. 18:6311-6317. 50. Russell, G. C., and N. H. Mann. 1986. Analysis of inverted repeat DNA in the genome of Rhodomicrobium vannielii. J. Gen. Microbiol. 132:325-330. 51. Sadosky, A. B., A. Davidson, R.-J. Lin, and C. W. Hill. 1989. rhs gene family of Escherichia coli K-12. J. Bacteriol. 171:636-642. 52. Sadosky, A. B., J. A. Gray, and C. W. Hill. 1991. The RhsD-E subfamily of Escherichia coli K-12. Nucleic Acids Res. 19:71777183. 53. Sapienza, C., and W. F. Doolittle. 1982. Unusual physical organization of the Halobacterium genome. Nature (London) 295:384-389. 54. Scherer, S., and D. A. Stevens. 1988. A Candida albicans dispersed, repeated gene family and its epidemiologic applications. Proc. Natl. Acad. Sci. USA 85:1452-1456. 55. Schmid, C. W., and W. R. Jelinek. 1982. The Alu family of dispersed repetitive sequences. Science 216:1065-1070. 56. Sharples, G. J., and R. G. Lloyd. 1990. A novel repeated DNA sequence located in the intergenic regions of bacterial chromosomes. Nucleic Acids Res. 18:6503-6508. 57. Shyamala, V., E. Schneider, and G. F. Ames. 1990. Tandem MINIREVIEW 4529 chromosomal duplications: role of REP sequences in the recombination event at the join-point. EMBO J. 9:939-946. 58. Stern, M. J., G. F.-L. Ames, N. H. Smith, E. C. Robinson, and C. F. Higgins. 1984. Repetitive extragenic palindromic sequences: a major component of the bacterial genome. Cell 37:1015-1026. 59. Stern, M. J., E. Prossnitz, and G. F.-L. Ames. 1988. Role of the intercistronic region in post-transcriptional control of gene expression in the histidine transport operon of Salmonella typhimurium: involvement of REP sequences. Mol. Microbiol. 2:141-152. 60. Stringer, S. L., S.-T. Hong, D. Giuntoli, and J. R. Stringer. 1991. Repeated DNA in Pneumocystis carinii. J. Clin. Microbiol. 29:1194-1201. 61. Su, C. J., A. Chavoya, and J. B. Baseman. 1988. Regions of Mycoplasma pneumoniae cytadhesin P1 structural gene exist as multiple copies. Infect. Immun. 56:3157-3161. 62. Subramanian, P. S., J. Versalovic, E. R. B. McCabe, and J. R. Lupski. 1992. Rapid mapping of Escherichia coli::TnS insertion mutations by REP-TnS PCR. PCR Methods Appl. 1:187-194. 63. Versalovic, J., T. Koeuth, and J. R. Lupski. 1991. Distribution of repetitive DNA sequences in eubacteria and application to fingerprinting of bacterial genomes. Nucleic Acids Res. 19: 6823-31. 64. Versalovic, J., T. Koeuth, Y.-H. Zhang, E. R. B. McCabe, and J. R. Lupski. Quality control for bacterial inhibition assays: DNA fingerprinting of microorganisms by rep-PCR. Screening, in press. 64a.Versalovic, J., and J. R. Lupski. Submitted for publication. 65. Weber, J. L., and P. E. May. 1989. Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction. Am. J. Hum. Genet. 44:388-396. 66. Welsh, J., and M. McClelland. 1990. Fingerprinting genomes using PCR with arbitrary primers. Nucleic Acids Res. 18:72137218. 67. Welsh, J., and M. McClelland. 1991. Genomic fingerprinting using arbitrarily primed PCR and a matrix of pairwise combinations of primers. Nucleic Acids Res. 19:5275-5279. 68. Wenzel, R., and R. Herrmann. 1988. Repetitive DNA sequences in Mycoplasma pneumoniae. Nucleic Acids Res. 16:8337-8350. 69. Woese, C. R. 1987. Bacterial evolution. Microbiol. Rev. 51:221271. 70. Yang, Y., and G. F.-L. Ames. 1988. DNA gyrase binds to the family of prokaryotic repetitive extragenic palindromic sequences. Proc. Natl. Acad. Sci. USA 85:8850-8854. 71. Yang, Y., and G. F.-L. Ames. 1990. The family of repetitive extragenic palindromic sequences: interaction with DNA gyrase and histonelike protein HU, p. 211-225. In K. Drlica and M. Riley (ed.), The bacterial chromosome. American Society for Microbiology, Washington, D.C. 72. Zakian, V. A. 1989. Structure and function of telomeres. Annu. Rev. Genet. 23:579-604. 73. Zuerner, R. L., and C. A. Bolin. 1988. Repetitive sequence element cloned from Leptospira interrogans serovar hardjo type hardjo-bovis provides a sensitive diagnostic probe for bovine leptospirosis. J. Clin. Microbiol. 26:2495-2500.