Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
International Journal of Systematic and Evolutionary Microbiology (2014), 64, 384–391 Review DOI 10.1099/ijs.0.057091-0 A polyphasic strategy incorporating genomic data for the taxonomic description of novel bacterial species Dhamodharan Ramasamy, Ajay Kumar Mishra, Jean-Christophe Lagier, Roshan Padhmanabhan, Morgane Rossi, Erwin Sentausa, Didier Raoult and Pierre-Edouard Fournier Correspondence Pierre-Edouard Fournier pierre-edouard.fournier@ univ-amu.fr Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes URMITE, Institut Hospitalo-Universitaire Méditerranée-Infection, Aix-Marseille Université, UMR63, CNRS 7278, IRD 198, INSERM U1095, Faculté de Médecine, 27 Bd Jean Moulin, 13005 Marseille, France Currently, bacterial taxonomy relies on a polyphasic approach based on the combination of phenotypic and genotypic characteristics. However, the current situation is paradoxical in that the genetic criteria that are used, including DNA–DNA hybridization, 16S rRNA gene sequence nucleotide similarity and phylogeny, and DNA G+C content, have significant limitations, but genome sequences that contain the whole genetic information of bacterial strains are not used for taxonomic purposes, despite the decreasing costs of sequencing and the increasing number of available genomes. Recently, we diversified bacterial culture conditions with the aim of isolating uncultivated bacteria. To classify the putative novel species that we cultivated, we used a polyphasic strategy that included phenotypic as well as genomic criteria (genome characteristics as well as genomic sequence similarity). Herein, we review the pros and cons of genome sequencing for taxonomy and propose that the incorporation of genome sequences in taxonomic studies has the advantage of using reliable and reproducible data. This strategy, which we name taxono-genomics, may contribute to the taxonomic classification of bacteria. Introduction Taxonomic information is essential, as it enables scientists to understand the biodiversity and relationships among living organisms from different ecosystems (Gevers et al., 2005). For prokaryotes, taxonomy plays an essential role in enabling the reliable identification of microbial strains from clinical or environmental specimens (Moore et al., 2010). Bacterial taxonomy was initiated in the late 19th century. Initially, bacteria were classified on the basis of basic phenotypic markers such as morphology, growth requirements or pathogenic potential (Lehmann & Neumann, 1896). Later, physiological and biochemical properties of bacteria were also used for this purpose (Orla-Jensen, 1909; Buchanan, 1955). Between the 1960s and the 1980s, chemotaxonomy (Minnikin et al., 1975), numerical taxonomy and DNA–DNA hybridization techniques (Brenner et al., 1969; Johnson, 1991) were used. In the 1980s, the advent of DNA amplification and sequencing techniques, in particular of the 16S rRNA gene, constituted a major step forward by facilitating bacterial classification (Gürtler & Mayall, 2001; Coenye & Vandamme, 2004; Konstantinidis & Abbreviations: ANI, average nucleotide identity; DDH, DNA–DNA hybridization; HGT, horizontal gene transfer. 384 Tiedje, 2007). Then, starting in the mid-1990s, wholegenome sequencing constituted a revolution by giving access to the complete genetic information of a strain (Janssen et al., 2003). Despite this tremendous progress and the various genome-based methods that were developed and proposed for taxonomic purposes, including multilocus sequence analysis and average nucleotide identity (ANI), wholegenome analysis (Stackebrandt et al., 2002; Rosselló-Mora, 2005; Goris et al., 2007; Konstantinidis & Tiedje, 2007; Yarza et al., 2008) has not as yet been accepted as a source of taxonomic information. Therefore, currently, routine identification of a bacterial strain is most often obtained by comparing its phenotypic and/or molecular characteristics with those of type strains of previously described species. In the case of an unusual strain, several minimal standards are used to determine whether it fulfils the requirements to be assigned to a novel taxon. However, the validity of this system is debated, as several of the criteria used were selected empirically, and some of the methods used are time- and moneyconsuming, are not accessible to all laboratories and lack intra- and inter-laboratory reproducibility (Stackebrandt & Ebers, 2006; Rosselló-Mora, 2006). Downloaded from www.microbiologyresearch.org by 057091 G 2014 IUMS IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 Printed in Great Britain Use of genomics in taxonomic classification of bacteria Current methods used in prokaryote taxonomy Currently, taxonomy of prokaryotes relies on polyphasic combinations of phenotypic, chemotaxonomic and genotypic characteristics (Vandamme et al., 1996; Stackebrandt et al., 2002; Tindall et al., 2010). Phenotypic methods applied to prokaryotic taxonomy are not described in this article, but include a wide array of methods the aim of which is to describe the main physical characteristics of micro-organisms, including morphology, staining and antigenic properties, ultrastructure and chemical composition of the cell wall and outer membrane, metabolic potential, protein composition, pathogenesis and habitat (Sneath & Sokal, 1973; Schleifer & Stackebrandt, 1983; Smibert & Krieg, 1994; Palleroni, 2010; Welker & Moore, 2011). Among the genotypic criteria, DNA–DNA hybridization (DDH), DNA G+C content and 16S rRNA gene sequence analysis have been widely used in bacterial taxonomy (Tindall et al., 2010; Stackebrandt & Ebers, 2006). Though it was initially designed empirically, DDH is a widely used technique to estimate the genetic relatedness between micro-organisms and is still considered as the ‘gold standard’ criterion for species delineation of prokaryotes (Wayne et al., 1987). A DDH value ¡70 % indicates that the tested bacteria belong to distinct species. However, DDH suffers from various limitations, notably that: (i) the cut-off values are not applicable to all prokaryote genera [in particular, determining the taxonomic status of an isolate is impossible when the phylogenetically closest species have DDH values .70 %, as is the case for most species of the genus Rickettsia (Fournier & Raoult, 2009)]; (ii) determining DDH requires special facilities available in a limited number of laboratories; and (iii) it is a labour-intensive and expensive method that lacks reproducibility and cannot be used to establish a comparative reference database incrementally (Tindall et al., 2010). Therefore, in the current era of genomics, DDH appears to be outdated and needs to be replaced by easier and more reliable methods. The 16S rRNA gene is an effective molecular marker for taxonomic purposes because it is ubiquitous, functionally stable, highly conserved and poorly subject to horizontal gene transfer (HGT). 16S rRNA gene sequence variations observed in archaea and bacteria were used as the backbone for the classification of prokaryotes (Woese et al., 1990; Fox et al., 1992; Olsen & Woese 1993; Brenner et al., 2001; Stackebrandt et al., 2002). Therefore, sequencing and phylogenetic analysis of the 16S rRNA gene has been considered as a suitable method for determining the classification of bacterial isolates at various taxonomic levels. 16S rRNA gene sequence identity values of 97 and 95 % have been used as cut-offs to classify bacterial isolates as novel taxa at the genus and species levels, respectively, when compared with their phylogenetically closest neighbours with validly published names (Stackebrandt et al., 2002). The former value was increased to 98.7 % in 2006 (Stackebrandt & Ebers, 2006). Such values usually being correlated with DDH results (Rosselló-Móra, 2006), Stackebrandt and colleagues proposed that 16S rRNA gene http://ijs.sgmjournals.org sequence identity might even replace DDH (Stackebrandt et al., 2002; Stackebrandt & Ebers, 2006). However, the 16S rRNA gene also exhibits several limitations as a taxonomic marker including: (i) its high degree of conservation in some genera, as is the case for species of the genus Brucella, which do not differ by more than 1 % (Gándara et al., 2001); (ii) the presence of nucleotide variations among multiple rRNA operons in a single genome (Rainey et al., 1996; Acinas et al., 2004) and (iii) the possibility of 16S rRNA genes being acquired by HGT that may distort relationships between taxa in phylogenetic trees (Jain et al., 1999). The DNA G+C content of prokaryotes is often used to grossly classify prokaryotes, as is the case for the phylum Actinobacteria, also referred to as the high-G+C-content Gram-positive bacteria, and the Firmicutes, referred to as the low-G+C-content Gram-positive bacteria. Although intra-genomically variable, due to HGT, differences of .5 and .10 % between strains were used to classify them within distinct species or genera, respectively (Goodfellow et al., 1997). However, such values do not apply to all bacterial genera. As an example, all species within the genus Rickettsia exhibit less than 5 mol% difference in DNA G+C content (Fournier & Raoult, 2009). Genomics and taxonomy As early as 1999, Fitz Gibbon & House (1999) proposed that the presence or absence of genes within genomes might be used to assess taxonomic relationships among prokaryotes. Later studies also suggested that genomic sequences may represent a source of taxonomic parameters including chromosomal gene order and metabolic pathways (Snel et al., 1999; Huson & Steel, 2004), comparison of orthologous genes (Coenye & Vandamme, 2003) and the presence of indels or single nucleotide polymorphisms (SNPs) in conserved genes (Gupta, 2001). Phylogenetic studies based on the comparison of orthologous genes and the presence or absence of genes further demonstrated good congruence with studies built by comparison of 16S rRNA gene sequences (Zhi et al., 2012). However, in the decade following the pioneering sequencing of the genome of Haemophilus influenzae in 1995 (Fleischmann et al., 1995), genome sequencing remained labour- and money-consuming and thus poorly adapted to routine use. The decreasing costs and high throughput of next-generation sequencing methods have enabled thousands of genomes to be sequenced (Soon et al., 2013). As of 30 July 2013, more than 6000 prokaryotic genome sequences were available in public databases [National Center for Biotechnology Information (http://www.ncbi.nlm.nih. gov/genome), European Molecular Biology Laboratory (http://www.ebi.ac.uk/genomes) and Genomes Online Database (http://www.genomesonline.org)]. Such a large number of available genomes facilitates whole-genome sequence comparison of new isolates (http://img.jgi.doe.gov/cgi-bin/ w/main.cgi; Markowitz et al., 2012). Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 385 D. Ramasamy and others Over the past few years, several authors have suggested that genome sequences may be used as a source of taxonomic information. In particular, some of them studied the correlation between the percentage of nucleotide sequence similarity at the core genome level and DDH results. Konstantinidis and colleagues suggested that the ANI, i.e. the mean nucleotide sequence identity of shared genes between two strains, was a valid alternative to DDH (Konstantinidis & Tiedje, 2005; Goris et al., 2007). These authors even proposed that an ANI value ¢95 % between genomes corresponded to a DDH value of ¢70 % (Goris et al., 2007). In 2009, Richter & Rosselló-Móra (2009) proposed that ANI might be used as an alternative to DDH for species delineation, and might be obtained from comparison of only 20 % of any given genome sequence. To date, the ANI method has been used to describe several novel bacterial genera and/or species, including Dehalococcoides mccartyi (Loffler et al., 2012), Sphaerochaeta globosa and S. pleomorpha (Ritalahti et al., 2012) and Vibrio caribbeanicus (Hoffmann et al., 2012). However, Ozen et al. (2012) have argued that ANI results are not systematically consistent with current taxonomy and thus should not be used as a single tool for prokaryote classification. Other parameters based on genome sequences have been proposed to discriminate species, such as the maximal unique matches index (genomic distance index based on DNA conservation of the core genome and the proportion shared DNA by two genomes; Deloger et al., 2009), tetranucleotide regression [differences between observed and expected values of the frequencies of all 256 possible tetranucleotide (A, T, G, C) combinations; Richter & Rosselló-Móra, 2009] and genome-to-genome comparison [total length of all high-scoring segment pairs (HSPs) identified by a BLAST search; Auch et al., 2010]. The latter parameter may be calculated using the genome-to-genome distance calculator (GGDC) available online (http://ggdc. dsmz.de/distcalc2.php). However, several authors have pointed out drawbacks of genomics for taxonomic purposes. In 2010, Klenk & Göker (2010) argued that sequenced genomes were lacking for many major lineages of prokaryotes, but the rapidly increasing number of available genome sequences, doubling almost every 2 years, should overcome this limitation. Zhi et al. (2012) pointed out that the increasing number of sequenced genomes remains biased towards prokaryotes of medical or biotechnological importance, and Tindall et al. (2010) observed that the genome sequences of species of interest are not always those of type strains. Ricker et al. (2012) also contested the quality of genome sequences available in public databases, which may be available as complete sequences, draft assemblies, scaffolds or contigs. Klassen & Currie (2012) argued that draft genomes may be less informative than complete genomes for taxonomic purposes, and that there is a need to define minimal Bacterial isolate If score lower than 2 with species available in the database MALDI-TOF MS If nucleotide sequence identity is lower than 98.7% with phylogenetically closest species with validly published name AND This value is lower than or equal to the highest similarity observed among species of the same genus with standing in nomenclature Comparison of phenotypic 16S rRNA gene sequence analysis Putative novel bacterial species Genome sequencing and annotation and chemotaxonomic characteristics Gram staining, cell size and shape, motility, sporulation, oxygen requirement, biochemical characteristics, MALDI-TOF spectrum Determination of AGIOS values between the new isolate and other closely related species Comparison of genome characteristics with closest phylogenetic neighbours for which genomes are available Description of novel bacterial species If in the range of those observed among closely related species Deposit in two culture collections Fig. 1. Polyphasic strategy used in our laboratory for the taxonomic classification of bacterial isolates. 386 Downloaded from www.microbiologyresearch.org by International Journal of Systematic and Evolutionary Microbiology 64 IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 http://ijs.sgmjournals.org Table 1. Novel species identified and described using our polyphasic approach incorporating genome analysis Species Phylum Proteobacteria ‘Enterobacter massiliensis’ ‘Herbaspirillum massiliense’ Oxygen requirement Genome size (Mb) DNA G+C content (mol%) Accession no. Genome 16S rRNA gene Reference ACS-065V-Col13 MM10403188 phR JC122 JC66 ph5 ph1 JC140 JC401 JC6 JC13 AP8 ph2 Anaerobic Aerobic Anaerobic Anaerobic Facultatively anaerobic Anaerobic Anaerobic Anaerobic Anaerobic Aerobic Anaerobic Aerobic Anaerobic 1.79 4.632 5.05 3.89 5.58 2.1 1.77 1.84 1.75 4.98 3.57 4.61 1.77 28.56 37.3 53.1 26.8 48.2 33.9 30.01 32.2 30.7 37.6 40 34.1 51.4 AEXM00000000 CAET00000000 CAGW00000000 CAEV00000000 CAES00000000 CAGX00000000 CAHB00000000 CAEL00000000 CAEL00000000 CAHJ00000000 CAEN00000000 CAPG00000000 CAHC00000000 JF824805 JF824810 JN837488 JF824801 JF824808 JN837491 JN837495 JF824803 JN657222 JF824800 JF824807 JX101689 JN837487 Lagier et al. (2012b) Kokcha et al. (2012a) Hugon et al. (2013a) Mishra et al. (2012b) Mishra et al. (2012d) Mishra et al. (2012e) Mishra et al. (2013a) Mishra et al. (2013b) Mishra et al. (2012c) Ramasamy et al. (2013a) Ramasamy et al. (2013b) Mishra et al. (2013d) Hugon et al. (2013c) ph8 JC50 JC136 Anaerobic Anaerobic Anaerobic 3.16 4.01 3.49 58.6 58.4 58.82 CAHA00000000 CAHI00000000 CAEG00000000 JN837494 JF824804 JF824799 Hugon et al. (2013b) Mishra et al. (2012a) Lagier et al. (2012c) JC14 JC43 JC225 JC110 phI JC301 Aerobic Aerobic Aerobic Anaerobic Anaerobic Primarily aerobic, facultatively anaerobic 3.32 3.42 3.4 2.83 2.28 3.01 72.49 70 71.22 85.7 62 61.4 CAHG00000000 CAHK00000000 CAHD00000000 CAEM00000000 CAGZ00000000 CAHH00000000 JF824798 JF824806 JN657218 JF824809 JN837493 JN657220 Ramasamy et al. (2012) Kokcha et al. (2012b) Lagier et al. (2012e) Lagier et al. (2013a) Mishra et al. (2013c) Mishra et al. (2013d) JC163 JC206 Aerobic Aerobic 4.92 4.18 55.1 59.73 CAEO00000000 CAHF00000000 JN657217 JN657219 Lagier et al. (2013b) Lagier et al. (2012d) 387 Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 Use of genomics in taxonomic classification of bacteria Phylum Firmicutes ‘Anaerococcus senegalensis’ ‘Bacillus timonensis’ Brevibacillus massiliensis ‘Clostridium senegalense’ ‘Paenibacillus senegalensis’ ‘Peptoniphilus grossensis’ ‘Peptoniphilus obesi’ ‘Peptoniphilus senegalensis’ ‘Peptoniphilus timonensis’ ‘Bacillus massiliosenegalensis’ ‘Dielma fastidiosa’ ‘Bacillus massilioanorexius’ ‘Kallipyga massiliensis’ Phylum Bacteroidetes ‘Alistipes obesi’ ‘Alistipes senegalensis’ ‘Alistipes timonensis’ Phylum Actinobacteria ‘Aeromicrobium massiliense’ Brevibacterium senegalense ‘Cellulomonas massiliensis’ ‘Senegalemassilia anaerobia’ ‘Enorma massiliensis’ ‘Timonella senegalensis’ Type strain D. Ramasamy and others sequencing quality for genomes to be included in taxonomic analyses. Taxono-genomics In our laboratory, in order to isolate bacteria that were unable to grow under routine culture conditions, Lagier et al. (2012a) implemented a strategy named ‘culturomics’, in which they diversified culture conditions (culture medium composition, temperature, atmosphere, incubation length). This strategy enabled the identification of 31 putative novel species, for which genome sequences were systematically determined. In this context, we proposed to use a polyphasic approach (Fig. 1) to describe these novel bacterial species using both phenotypic characteristics such as habitat, Gram-stain reaction, culture and metabolic characteristics, proteic spectrum and, when applicable, pathogenicity, and their genomic characteristics. In this polyphasic approach, bacterial isolates are first identified using MALDI-TOF MS as described previously (Seng et al., 2009) (Fig. 1). For all isolates that exhibit an MS score ,2 with the reference spectra of species whose names have standing in nomenclature, we perform amplification and sequencing of the complete 16S rRNA gene. The resulting sequences are then compared to GenBank (http://www.ncbi.nlm.nih.gov/genbank/), and their closest phylogenetic neighbours identified. Among those, we select only the species and/or genera that have validly published names, and retrieve the 16S rRNA gene sequences for the corresponding type strains from the List of Prokaryotic Names with Standing in Nomenclature website (http:// www.bacterio.net/index.html). Percentages of 16S rRNA gene sequence identity are then calculated between all pairs of studied taxa. However, rather than using only a universal 16S rRNA gene sequence similarity cut-off (Stackebrandt & Ebers, 2006), we also consider the variability of conservation of this gene among genera. Therefore, isolates that exhibit 16S rRNA gene sequence identity ,98.7 % with their closest phylogenetic neighbour, as recommended by Stackebrandt & Ebers (2006), are considered to represent putative novel species if this identity value is equal to or lower than the highest identity observed among species with names with standing in nomenclature within the same genus. The next step consists of characterizing the strains phenotypically and chemotaxonomically using laboratory tools that are widely available in microbiology laboratories and may be easily used to reproduce the data, including staining properties, cell shape and size, motility, sporulation, oxygen and other growth requirements, biochemical characteristics using commercially available strips and MALDI-TOF mass spectra. Their genome sequence is obtained by using a 454 GS-FLX Titanium platform (Roche Diagnostics) and annotated. Genome sequences of the strains of interest and their phylogenetically closest neighbours are then compared in terms of size, DNA G+C content, percentage of coding sequences, gene content, gene distribution in COG categories (Tatusov et al., 2001), presence of mobile genetic elements, 388 numbers of RNA genes, signal peptides and transmembrane helices. In addition, we determine the average of genomic identity of orthologous gene sequences (AGIOS) of studied strains by comparison with their closest phylogenetic neighbours as well as between members of the genus of interest. The AGIOS parameter is obtained by first identifying with BLASTP the set of orthologous proteins between pairs of compared genomes using minimal coverage of 50 % and a degree of amino acid identity of 30 %. We then determine the mean percentage of nucleotide sequence identity between these orthologous genes. The AGIOS and ANI parameters differ from each other in that, for the latter, orthologous genes are identified using BLASTN, which is less sensitive for this purpose than BLASTP. We acknowledge the fact that a drawback of our taxono-genomics method is that genome sequences from some strains from species with validly published names are currently not available, and our results may be modified slightly when these data become available. Type strains are also deposited in two reference culture collections. To date, we have described 24 novel species using this polyphasic method (Table 1), including Brevibacterium senegalense (Kokcha et al., 2012b, 2013) and Brevibacillus massiliensis (Hugon et al., 2013a, d), the names of which were validly published by inclusion in Validation List no. 153. Conclusion With the decreasing costs of high-throughput sequencing and the increasing number of prokaryotic genomes that are sequenced, and given the drawbacks of the current genetic gold standards that are used as taxonomic tools, should genomic sequences be excluded from taxonomic descriptions? We strongly believe that genomic information should be included among taxonomic criteria in addition to, but not instead of, phenotypic and chemotaxonomic parameters. We also suggest that putative novel taxa should always be compared with their phylogenetic neighbours with validly published names, and the genetic parameters obtained among members of the closest genus or family should be taken into account in the description of the novel species, rather than trying to define universal cut-offs that may be applicable to only a limited proportion of prokaryotes. However, we agree that there is no current consensus on the way genomic data should be integrated into the taxonomic classification of prokaryotes, but such a discussion should take place. References Acinas, S. G., Marcelino, L. A., Klepac-Ceraj, V. & Polz, M. F. (2004). Divergence and redundancy of 16S rRNA sequences in genomes with multiple rrn operons. J Bacteriol 186, 2629–2635. Auch, A. F., Klenk, H. P. & Göker, M. (2010). Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs. Stand Genomic Sci 2, 142–148. Brenner, D. J., Fanning, G. R., Rake, A. V. & Johnson, K. E. (1969). Batch procedure for thermal elution of DNA from hydroxyapatite. Anal Biochem 28, 447–459. Downloaded from www.microbiologyresearch.org by International Journal of Systematic and Evolutionary Microbiology 64 IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 Use of genomics in taxonomic classification of bacteria Brenner, D. J., Staley, J. T. & Krieg, N. R. (2001). Classification of procaryotic organisms and the concept of bacterial speciation. In Bergey’s Manual of Systematic Bacteriology, 2nd edn, vol. 1, pp. 27–31. Edited by D. R. Boone, R. W. Castenholz & G. M. Garrity. New York: Springer. Buchanan, R. E. (1955). Taxonomy. Annu Rev Microbiol 9, 1–20. Coenye, T. & Vandamme, P. (2003). Extracting phylogenetic information from whole-genome sequencing projects: the lactic acid bacteria as a test case. Microbiology 149, 3507–3517. Coenye, T. & Vandamme, P. (2004). Use of the genomic signature in Hugon, P., Ramasamy, D., Robert, C., Couderc, C., Raoult, D. & Fournier, P. E. (2013c). Non-contiguous finished genome sequence and description of Kallipyga massiliensis gen. nov., sp. nov., a new member of the family Clostridiales Incertae Sedis XI. Stand Genomic Sci 8, 500–515. Hugon, P., Mishra, A. K., Lagier, J. C., Nguyen, T. T., Couderc, C., Raoult, D. & Fournier, P. E. (2013d). Brevibacillus massiliensis sp. nov. In List of new names and new combinations previously effectively, but not validly, published, Validation List no. 153. Int J Syst Evol Microbiol 63, 3131–3134. bacterial classification and identification. Syst Appl Microbiol 27, 175– 185. Huson, D. H. & Steel, M. (2004). Phylogenetic trees based on gene Deloger, M., El Karoui, M. & Petit, M. A. (2009). A genomic distance Jain, R., Rivera, M. C. & Lake, J. A. (1999). Horizontal gene transfer content. Bioinformatics 20, 2044–2049. based on MUM indicates discontinuity between most bacterial species and genera. J Bacteriol 191, 91–99. among genomes: the complexity hypothesis. Proc Natl Acad Sci U S A 96, 3801–3806. Fitz Gibbon, S. T. & House, C. H. (1999). Whole genome-based phylogenetic analysis of free-living microorganisms. Nucleic Acids Res 27, 4218–4222. Janssen, P., Audit, B., Cases, I., Darzentas, N., Goldovsky, L., Kunin, V., Lopez-Bigas, N., Peregrin-Alvarez, J. M., Pereira-Leal, J. B. & other authors (2003). Beyond 100 genomes. Genome Biol 4, 402. Fleischmann, R. D., Adams, M. D., White, O., Clayton, R. A., Kirkness, E. F., Kerlavage, A. R., Bult, C. J., Tomb, J. F., Dougherty, B. A. & other authors (1995). Whole-genome random sequencing and assembly of Johnson, J. L. (1991). DNA reassociation experiments. In Nucleic Acid Techniques in Bacterial Systematics, pp. 21–44. Edited by E. Stackebrandt & M. Goodfellow. Chichester: Wiley. Haemophilus influenzae Rd. Science 269, 496–512. Klassen, J. L. & Currie, C. R. (2012). Gene fragmentation in bacterial Fournier, P. E. & Raoult, D. (2009). Current knowledge on phylogeny and taxonomy of Rickettsia spp. Ann N Y Acad Sci 1166, 1–11. draft genomes: extent, consequences and mitigation. BMC Genomics 13, 14. Fox, G. E., Wisotzkey, J. D. & Jurtshuk, P., Jr (1992). How close is Klenk, H. P. & Göker, M. (2010). En route to a genome-based close: 16S rRNA sequence identity may not be sufficient to guarantee species identity. Int J Syst Bacteriol 42, 166–170. classification of Archaea and Bacteria? Syst Appl Microbiol 33, 175– 182. Gándara, B., Merino, A. L., Rogel, M. A. & Martı́nez-Romero, E. (2001). Limited genetic diversity of Brucella spp. J Clin Microbiol 39, Kokcha, S., Mishra, A. K., Lagier, J. C., Million, M., Leroy, Q., Raoult, D. & Fournier, P. E. (2012a). Non contiguous-finished genome 235–240. sequence and description of Bacillus timonensis sp. nov. Stand Genomic Sci 6, 346–355. Gevers, D., Cohan, F. M., Lawrence, J. G., Spratt, B. G., Coenye, T., Feil, E. J., Stackebrandt, E., Van de Peer, Y., Vandamme, P. & other authors (2005). Opinion: Re-evaluating prokaryotic species. Nat Rev Kokcha, S., Ramasamy, D., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2012b). Non-contiguous finished genome sequence Microbiol 3, 733–739. and description of Brevibacterium senegalense sp. nov. Stand Genomic Sci 7, 233–245. Goodfellow, M., Manfio, G. P. & Chun, J. (1997). Towards a practical species concept for cultivable bacteria. In Species: The Units of Biodiversity, pp. 25–29. Edited by M. F. Clarridge & H. A. Dawah. London: Chapman and Hall. Goris, J., Konstantinidis, K. T., Klappenbach, J. A., Coenye, T., Vandamme, P. & Tiedje, J. M. (2007). DNA–DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol 57, 81–91. Gupta, R. S. (2001). The branching order and phylogenetic placement of species from completed bacterial genomes, based on conserved indels found in various proteins. Int Microbiol 4, 187–202. Gürtler, V. & Mayall, B. C. (2001). Genomic approaches to typing, taxonomy and evolution of bacterial isolates. Int J Syst Evol Microbiol 51, 3–16. Hoffmann, M., Monday, S. R., Allard, M. W., Strain, E. A., Whittaker, P., Naum, M., McCarthy, P. J., Lopez, J. V., Fischer, M. & Brown, E. W. (2012). Vibrio caribbeanicus sp. nov., isolated from the marine sponge Scleritoderma cyanea. Int J Syst Evol Microbiol 62, 1736– 1743. Hugon, P., Mishra, A. K., Lagier, J. C., Nguyen, T. T., Couderc, C., Raoult, D. & Fournier, P. E. (2013a). Non-contiguous finished genome sequence and description of Brevibacillus massiliensis sp. nov. Stand Genomic Sci 8, 1–14. Hugon, P., Ramasamy, D., Lagier, J. C., Rivet, R., Couderc, C., Raoult, D. & Fournier, P. E. (2013b). Non contiguous-finished genome sequence and description of Alistipes obesi sp. nov. Stand Genomic Sci 7, 427–439. http://ijs.sgmjournals.org Kokcha, S., Ramasamy, D., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2013). Brevibacterium senegalense sp. nov. In List of new names and new combinations previously effectively, but not validly, published, Validation List no. 153. Int J Syst Evol Microbiol 63, 3131– 3134. Konstantinidis, K. T. & Tiedje, J. M. (2005). Genomic insights that advance the species definition for prokaryotes. Proc Natl Acad Sci U S A 102, 2567–2572. Konstantinidis, K. T. & Tiedje, J. M. (2007). Prokaryotic taxonomy and phylogeny in the genomic era: advancements and challenges ahead. Curr Opin Microbiol 10, 504–509. Lagier, J. C., Armougom, F., Million, M., Hugon, P., Pagnier, I., Robert, C., Bittar, F., Fournous, G., Gimenez, G. & other authors (2012a). Microbial culturomics: paradigm shift in the human gut microbiome study. Clin Microbiol Infect 18, 1185–1193. Lagier, J. C., El Karkouri, K., Nguyen, T. T., Armougom, F., Raoult, D. & Fournier, P. E. (2012b). Non-contiguous finished genome sequence and description of Anaerococcus senegalensis sp. nov. Stand Genomic Sci 6, 116–125. Lagier, J. C., Armougom, F., Mishra, A. K., Nguyen, T. T., Raoult, D. & Fournier, P. E. (2012c). Non-contiguous finished genome sequence and description of Alistipes timonensis sp. nov. Stand Genomic Sci 6, 315–324. Lagier, J. C., Gimenez, G., Robert, C., Raoult, D. & Fournier, P. E. (2012d). Non-contiguous finished genome sequence and description of Herbaspirillum massiliense sp. nov. Stand Genomic Sci 7, 200–209. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 389 D. Ramasamy and others Lagier, J. C., Ramasamy, D., Rivet, R., Raoult, D. & Fournier, P. E. (2012e). Non contiguous-finished genome sequence and description Mishra, A. K., Pfleiderer, A., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2013d). Non contiguous-finished genome sequence of Cellulomonas massiliensis sp. nov. Stand Genomic Sci 7, 258–270. and description of Bacillus massilioanorexicus sp. nov. Stand Genomic Sci 8, 465–479. Lagier, J. C., Elkarkouri, K., Rivet, R., Couderc, C., Raoult, D. & Fournier, P. E. (2013a). Non contiguous-finished genome sequence and description of Senegalemassilia anaerobia gen. nov., sp. nov. Stand Genomic Sci 7, 343–356. Lagier, J. C., El Karkouri, K., Mishra, A. K., Robert, C., Raoult, D. & Fournier, P. E. (2013b). Non contiguous-finished genome sequence and description of Enterobacter massiliensis sp. nov. Stand Genomic Sci 7, 399–412. Lehmann, K. B. & Neumann, R. (1896). Atlas und Grundriss der Bakteriologie und Lehrbuch der speziellen bakteriologischen Diagnostik. München: J. F. Lehmann (in German). Loffler, F. E., Yan, J., Ritalahti, K. M., Adrian, L., Edwards, E. A., Konstantinidis, K. T., Muller, J. A., Fullerton, H., Zinder, S. H. & Spormann, A. M. (2012). Dehalococcoides mccartyi gen. nov., sp. nov., obligately organohalide-respiring anaerobic bacteria relevant to halogen cycling and bioremediation, belong to a novel bacterial class, Dehalococcoidia classis nov., order Dehalococcoidales ord. nov. and family Dehalococcoidaceae fam. nov., within the phylum Chloroflexi. Int J Syst Evol Microbiol 63, 625–635. Markowitz, V. M., Chen, I.-M. A., Palaniappan, K., Chu, K., Szeto, E., Grechkin, Y., Ratner, A., Jacob, B., Huang, J. & other authors (2012). IMG: the Integrated Microbial Genomes database and comparative analysis system. Nucleic Acids Res 40 (Database issue), D115–D122. Minnikin, D. E., Alshamaony, L. & Goodfellow, M. (1975). Differentiation of Mycobacterium, Nocardia, and related taxa by thin-layer chromatographic analysis of whole-organism methanolysates. J Gen Microbiol 88, 200–204. Mishra, A. K., Gimenez, G., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2012a). Non-contiguous finished genome sequence and description of Alistipes senegalensis sp. nov. Stand Genomic Sci 6, 304–314. Mishra, A. K., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2012b). Non-contiguous finished genome sequence and description of Clostridium senegalense sp. nov. Stand Genomic Sci 6, 386–395. Mishra, A. K., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2012c). Non contiguous-finished genome sequence and description of Peptoniphilus timonensis sp. nov. Stand Genomic Sci 7, 1–11. Mishra, A. K., Lagier, J. C., Rivet, R., Raoult, D. & Fournier, P. E. (2012d). Non-contiguous finished genome sequence and description of Paenibacillus senegalensis sp. nov. Stand Genomic Sci 7, 70–81. Mishra, A. K., Hugon, P., Robert, C., Raoult, D. & Fournier, P. E. (2012e). Non contiguous-finished genome sequence and description of Peptoniphilus grossensis sp. nov. Stand Genomic Sci 7, 320–330. Mishra, A. K., Hugon, P., Lagier, J. C., Nguyen, T. T., Robert, C., Couderc, C., Raoult, D. & Fournier, P. E. (2013a). Non contiguous- finished genome sequence and description of Peptoniphilus obesi sp. nov. Stand Genomic Sci 7, 357–369. Moore, E. R., Mihaylova, S. A., Vandamme, P., Krichevsky, M. I. & Dijkshoorn, L. (2010). Microbial systematics and taxonomy: relevance for a microbial commons. Res Microbiol 161, 430–438. Olsen, G. J. & Woese, C. R. (1993). Ribosomal RNA: a key to phylogeny. FASEB J 7, 113–123. S. (1909). Die Hauptlinien des naturalischen Bakteriensystems nebst einer Ubersicht der Garungsphenomene. Zentralbl Bakteriol Parasitenkd Abt II 22, 305–346 (in German). Orla-Jensen, Ozen, A. I., Vesth, T. & Ussery, D. W. (2012). From genome sequence to taxonomy – a skeptic’s view. In The Prokaryotes, 4th edn, pp. 209–227. Edited by E. Rosenberg, E. F. DeLong, E. Stackebrandt, S. Lory & S. D. Thompson. Berlin: Springer. Palleroni, N. J. (2010). The Pseudomonas story. Environ Microbiol 12, 1377–1383. Rainey, F. A., Ward-Rainey, N. L., Janssen, P. H., Hippe, H. & Stackebrandt, E. (1996). Clostridium paradoxum DSM 7308T contains multiple 16S rRNA genes with heterogeneous intervening sequences. Microbiology 142, 2087–2095. Ramasamy, D., Kokcha, S., Lagier, J. C., N’Guyen, T. T., Raoult, D. & Fournier, P. E. (2012). Genome sequence and description of Aeromicrobium massilense sp. nov. Stand Genomic Sci 7, 246– 257. Ramasamy, D., Lagier, J.-C., Gorlas, A., Raoult, D. & Fournier, P.-E. (2013a). Non contiguous-finished genome sequence and description of Bacillus massiliosenegalensis sp. nov. Stand Genomic Sci 8, 264– 278. Ramasamy, D., Lagier, J. C., Nguyen, T. T., Raoult, D. & Fournier, P. E. (2013b). Non contiguous-finished genome sequence and description of Dielma fastidiosa gen. nov., sp. nov., a new member of the family Erysipelotrichaceae. Stand Genomic Sci 8, 336–351. Richter, M. & Rosselló-Móra, R. (2009). Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A 106, 19126–19131. Ricker, N., Qian, H. & Fulthorpe, R. R. (2012). The limitations of draft assemblies for understanding prokaryotic adaptation and evolution. Genomics 100, 167–175. Ritalahti, K. M., Justicia-Leon, S. D., Cusick, K. D., RamosHernandez, N., Rubin, M., Dornbush, J. & Löffler, F. E. (2012). Sphaerochaeta globosa gen. nov., sp. nov. and Sphaerochaeta pleomorpha sp. nov., free-living, spherical spirochaetes. Int J Syst Evol Microbiol 62, 210–216. Rosselló-Mora, R. (2005). Updating prokaryotic taxonomy. J Bac- teriol 187, 6255–6257. Rosselló-Mora, R. (2006). DNA-DNA reassociation methods applied Mishra, A. K., Lagier, J. C., Nguyen, T. T., Raoult, D. & Fournier, P. E. (2013b). Non contiguous-finished genome sequence and description to microbial taxonomy and their critical evaluation. In Molecular Identification, Systematics, and Population Structure of Prokaryotes, pp. 23–50. Edited by E. Stackebrandt. Berlin: Springer. of Peptoniphilus senegalensis sp. nov. Stand Genomic Sci 7, 370–381. Schleifer, K. H. & Stackebrandt, E. (1983). Molecular systematics of Mishra, A. K., Hugon, P., Lagier, J. C., Nguyen, T. T., Couderc, C., Raoult, D. & Fournier, P. E. (2013c). Non contiguous-finished genome sequence and description of Enorma massiliensis gen. nov., sp. nov., a new member of the family Coriobacteriaceae. Stand Genomic Sci 8, 290–305. Mishra, A. K., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E. (2013d). Genome sequence and description of Timonella senegalensis gen. nov., sp. nov., a new member of the suborder Micrococcinae. Stand Genomic Sci 8, 318–335. 390 prokaryotes. Annu Rev Microbiol 37, 143–187. Seng, P., Drancourt, M., Gouriet, F., La Scola, B., Fournier, P. E., Rolain, J. M. & Raoult, D. (2009). Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis 49, 543–551. Smibert, R. M. & Krieg, N. R. (1994). Phenotypic characterization. In Methods for General and Molecular Bacteriology, pp. 607–654. Edited by P. Gerhardt, R. G. E. Murray, W. A. Wood & N. R. Krieg. Washington, DC: American Society for Microbiology. Downloaded from www.microbiologyresearch.org by International Journal of Systematic and Evolutionary Microbiology 64 IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 Use of genomics in taxonomic classification of bacteria Sneath, P. H. A. & Sokal, R. R. (1973). Numerical Taxonomy, the Principles and Practice of Numerical Classification. San Francisco: W. H. Freeman. Snel, B., Bork, P. & Huynen, M. A. (1999). Genome phylogeny based on gene content. Nat Genet 21, 108–110. Soon, W. W., Hariharan, M. & Snyder, M. P. (2013). High-throughput sequencing for biology and medicine. Mol Syst Biol 9, 640. Stackebrandt, E. & Ebers, J. (2006). Taxonomic parameters revisited: tarnished gold standards. Microbiol Today 33, 152–155. Stackebrandt, E., Frederiksen, W., Garrity, G. M., Grimont, P. A., Kämpfer, P., Maiden, M. C., Nesme, X., Rosselló-Mora, R., Swings, J. & other authors (2002). Report of the ad hoc committee for the re- evaluation of the species definition in bacteriology. Int J Syst Evol Microbiol 52, 1043–1047. Tatusov, R. L., Natale, D. A., Garkavtsev, I. V., Tatusova, T. A., Shankavaram, U. T., Rao, B. S., Kiryutin, B., Galperin, M. Y., Fedorova, N. D. & Koonin, E. V. (2001). The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res 29, 22–28. Tindall, B. J., Rosselló-Móra, R., Busse, H.-J., Ludwig, W. & Kämpfer, P. (2010). Notes on the characterization of prokaryote strains for taxonomic purposes. Int J Syst Evol Microbiol 60, 249–266. http://ijs.sgmjournals.org Vandamme, P., Pot, B., Gillis, M., de Vos, P., Kersters, K. & Swings, J. (1996). Polyphasic taxonomy, a consensus approach to bacterial systematics. Microbiol Rev 60, 407–438. Wayne, L. G., Brenner, D. J., Colwell, R. R., Grimont, P. A. D., Kandler, O., Krichevsky, L., Moore, L. H., Moore, W. C., Murray, R. G. E. & other authors (1987). Report of the ad hoc committee on reconciliation of approaches to bacterial systematics. Int J Syst Bacteriol 37, 463–464. Welker, M. & Moore, E. R. (2011). Applications of whole-cell matrix- assisted laser-desorption/ionization time-of-flight mass spectrometry in systematic microbiology. Syst Appl Microbiol 34, 2–11. Woese, C. R., Kandler, O. & Wheelis, M. L. (1990). Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci U S A 87, 4576–4579. Yarza, P., Richter, M., Peplies, J., Euzéby, J., Amann, R., Schleifer, K. H., Ludwig, W., Glöckner, F. O. & Rosselló-Móra, R. (2008). The All-Species Living Tree project: a 16S rRNA-based phylogenetic tree of all sequenced type strains. Syst Appl Microbiol 31, 241– 250. Zhi, X. Y., Zhao, W., Li, W. J. & Zhao, G. P. (2012). Prokaryotic systematics in the genomics era. Antonie van Leeuwenhoek 101, 21– 34. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 19 Jun 2017 01:44:53 391