Download A polyphasic strategy incorporating genomic data for the taxonomic

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Zinc finger nuclease wikipedia , lookup

Exome sequencing wikipedia , lookup

Microsatellite wikipedia , lookup

Helitron (biology) wikipedia , lookup

Transcript
International Journal of Systematic and Evolutionary Microbiology (2014), 64, 384–391
Review
DOI 10.1099/ijs.0.057091-0
A polyphasic strategy incorporating genomic data
for the taxonomic description of novel bacterial
species
Dhamodharan Ramasamy, Ajay Kumar Mishra, Jean-Christophe Lagier,
Roshan Padhmanabhan, Morgane Rossi, Erwin Sentausa, Didier Raoult
and Pierre-Edouard Fournier
Correspondence
Pierre-Edouard Fournier
pierre-edouard.fournier@
univ-amu.fr
Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes URMITE, Institut
Hospitalo-Universitaire Méditerranée-Infection, Aix-Marseille Université, UMR63, CNRS 7278,
IRD 198, INSERM U1095, Faculté de Médecine, 27 Bd Jean Moulin, 13005 Marseille, France
Currently, bacterial taxonomy relies on a polyphasic approach based on the combination of
phenotypic and genotypic characteristics. However, the current situation is paradoxical in that the
genetic criteria that are used, including DNA–DNA hybridization, 16S rRNA gene sequence
nucleotide similarity and phylogeny, and DNA G+C content, have significant limitations, but
genome sequences that contain the whole genetic information of bacterial strains are not used for
taxonomic purposes, despite the decreasing costs of sequencing and the increasing number of
available genomes. Recently, we diversified bacterial culture conditions with the aim of isolating
uncultivated bacteria. To classify the putative novel species that we cultivated, we used a
polyphasic strategy that included phenotypic as well as genomic criteria (genome characteristics
as well as genomic sequence similarity). Herein, we review the pros and cons of genome
sequencing for taxonomy and propose that the incorporation of genome sequences in taxonomic
studies has the advantage of using reliable and reproducible data. This strategy, which we name
taxono-genomics, may contribute to the taxonomic classification of bacteria.
Introduction
Taxonomic information is essential, as it enables scientists
to understand the biodiversity and relationships among
living organisms from different ecosystems (Gevers et al.,
2005). For prokaryotes, taxonomy plays an essential role in
enabling the reliable identification of microbial strains
from clinical or environmental specimens (Moore et al.,
2010). Bacterial taxonomy was initiated in the late 19th
century. Initially, bacteria were classified on the basis of
basic phenotypic markers such as morphology, growth requirements or pathogenic potential (Lehmann & Neumann,
1896). Later, physiological and biochemical properties of
bacteria were also used for this purpose (Orla-Jensen, 1909;
Buchanan, 1955). Between the 1960s and the 1980s,
chemotaxonomy (Minnikin et al., 1975), numerical taxonomy and DNA–DNA hybridization techniques (Brenner
et al., 1969; Johnson, 1991) were used. In the 1980s, the
advent of DNA amplification and sequencing techniques, in
particular of the 16S rRNA gene, constituted a major step
forward by facilitating bacterial classification (Gürtler &
Mayall, 2001; Coenye & Vandamme, 2004; Konstantinidis &
Abbreviations: ANI, average nucleotide identity; DDH, DNA–DNA
hybridization; HGT, horizontal gene transfer.
384
Tiedje, 2007). Then, starting in the mid-1990s, wholegenome sequencing constituted a revolution by giving access
to the complete genetic information of a strain (Janssen et al.,
2003). Despite this tremendous progress and the various
genome-based methods that were developed and proposed
for taxonomic purposes, including multilocus sequence
analysis and average nucleotide identity (ANI), wholegenome analysis (Stackebrandt et al., 2002; Rosselló-Mora,
2005; Goris et al., 2007; Konstantinidis & Tiedje, 2007; Yarza
et al., 2008) has not as yet been accepted as a source of
taxonomic information.
Therefore, currently, routine identification of a bacterial
strain is most often obtained by comparing its phenotypic
and/or molecular characteristics with those of type strains
of previously described species. In the case of an unusual
strain, several minimal standards are used to determine
whether it fulfils the requirements to be assigned to a novel
taxon. However, the validity of this system is debated, as
several of the criteria used were selected empirically, and
some of the methods used are time- and moneyconsuming, are not accessible to all laboratories and lack
intra- and inter-laboratory reproducibility (Stackebrandt &
Ebers, 2006; Rosselló-Mora, 2006).
Downloaded from www.microbiologyresearch.org by
057091 G 2014 IUMS
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
Printed in Great Britain
Use of genomics in taxonomic classification of bacteria
Current methods used in prokaryote taxonomy
Currently, taxonomy of prokaryotes relies on polyphasic
combinations of phenotypic, chemotaxonomic and genotypic characteristics (Vandamme et al., 1996; Stackebrandt
et al., 2002; Tindall et al., 2010).
Phenotypic methods applied to prokaryotic taxonomy are not
described in this article, but include a wide array of methods
the aim of which is to describe the main physical characteristics
of micro-organisms, including morphology, staining and
antigenic properties, ultrastructure and chemical composition
of the cell wall and outer membrane, metabolic potential,
protein composition, pathogenesis and habitat (Sneath &
Sokal, 1973; Schleifer & Stackebrandt, 1983; Smibert & Krieg,
1994; Palleroni, 2010; Welker & Moore, 2011).
Among the genotypic criteria, DNA–DNA hybridization
(DDH), DNA G+C content and 16S rRNA gene sequence
analysis have been widely used in bacterial taxonomy (Tindall
et al., 2010; Stackebrandt & Ebers, 2006). Though it was
initially designed empirically, DDH is a widely used technique
to estimate the genetic relatedness between micro-organisms
and is still considered as the ‘gold standard’ criterion for
species delineation of prokaryotes (Wayne et al., 1987). A
DDH value ¡70 % indicates that the tested bacteria belong to
distinct species. However, DDH suffers from various limitations, notably that: (i) the cut-off values are not applicable to
all prokaryote genera [in particular, determining the taxonomic status of an isolate is impossible when the phylogenetically closest species have DDH values .70 %, as is the case
for most species of the genus Rickettsia (Fournier & Raoult,
2009)]; (ii) determining DDH requires special facilities
available in a limited number of laboratories; and (iii) it is a
labour-intensive and expensive method that lacks reproducibility and cannot be used to establish a comparative reference
database incrementally (Tindall et al., 2010). Therefore, in the
current era of genomics, DDH appears to be outdated and
needs to be replaced by easier and more reliable methods.
The 16S rRNA gene is an effective molecular marker for
taxonomic purposes because it is ubiquitous, functionally
stable, highly conserved and poorly subject to horizontal
gene transfer (HGT). 16S rRNA gene sequence variations
observed in archaea and bacteria were used as the backbone
for the classification of prokaryotes (Woese et al., 1990;
Fox et al., 1992; Olsen & Woese 1993; Brenner et al.,
2001; Stackebrandt et al., 2002). Therefore, sequencing
and phylogenetic analysis of the 16S rRNA gene has
been considered as a suitable method for determining the
classification of bacterial isolates at various taxonomic
levels. 16S rRNA gene sequence identity values of 97 and
95 % have been used as cut-offs to classify bacterial isolates
as novel taxa at the genus and species levels, respectively,
when compared with their phylogenetically closest neighbours with validly published names (Stackebrandt et al.,
2002). The former value was increased to 98.7 % in 2006
(Stackebrandt & Ebers, 2006). Such values usually being
correlated with DDH results (Rosselló-Móra, 2006), Stackebrandt and colleagues proposed that 16S rRNA gene
http://ijs.sgmjournals.org
sequence identity might even replace DDH (Stackebrandt
et al., 2002; Stackebrandt & Ebers, 2006). However, the 16S
rRNA gene also exhibits several limitations as a taxonomic
marker including: (i) its high degree of conservation in
some genera, as is the case for species of the genus Brucella,
which do not differ by more than 1 % (Gándara et al., 2001);
(ii) the presence of nucleotide variations among multiple
rRNA operons in a single genome (Rainey et al., 1996;
Acinas et al., 2004) and (iii) the possibility of 16S rRNA
genes being acquired by HGT that may distort relationships
between taxa in phylogenetic trees (Jain et al., 1999).
The DNA G+C content of prokaryotes is often used to
grossly classify prokaryotes, as is the case for the phylum
Actinobacteria, also referred to as the high-G+C-content
Gram-positive bacteria, and the Firmicutes, referred to as
the low-G+C-content Gram-positive bacteria. Although
intra-genomically variable, due to HGT, differences of .5
and .10 % between strains were used to classify them
within distinct species or genera, respectively (Goodfellow
et al., 1997). However, such values do not apply to all
bacterial genera. As an example, all species within the genus
Rickettsia exhibit less than 5 mol% difference in DNA
G+C content (Fournier & Raoult, 2009).
Genomics and taxonomy
As early as 1999, Fitz Gibbon & House (1999) proposed
that the presence or absence of genes within genomes
might be used to assess taxonomic relationships among
prokaryotes. Later studies also suggested that genomic
sequences may represent a source of taxonomic parameters
including chromosomal gene order and metabolic pathways (Snel et al., 1999; Huson & Steel, 2004), comparison
of orthologous genes (Coenye & Vandamme, 2003) and the
presence of indels or single nucleotide polymorphisms
(SNPs) in conserved genes (Gupta, 2001). Phylogenetic
studies based on the comparison of orthologous genes and
the presence or absence of genes further demonstrated
good congruence with studies built by comparison of 16S
rRNA gene sequences (Zhi et al., 2012).
However, in the decade following the pioneering sequencing of the genome of Haemophilus influenzae in 1995
(Fleischmann et al., 1995), genome sequencing remained
labour- and money-consuming and thus poorly adapted to
routine use. The decreasing costs and high throughput of
next-generation sequencing methods have enabled thousands of genomes to be sequenced (Soon et al., 2013). As
of 30 July 2013, more than 6000 prokaryotic genome sequences were available in public databases [National Center
for Biotechnology Information (http://www.ncbi.nlm.nih.
gov/genome), European Molecular Biology Laboratory
(http://www.ebi.ac.uk/genomes) and Genomes Online Database (http://www.genomesonline.org)]. Such a large number
of available genomes facilitates whole-genome sequence
comparison of new isolates (http://img.jgi.doe.gov/cgi-bin/
w/main.cgi; Markowitz et al., 2012).
Downloaded from www.microbiologyresearch.org by
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
385
D. Ramasamy and others
Over the past few years, several authors have suggested that
genome sequences may be used as a source of taxonomic
information. In particular, some of them studied the
correlation between the percentage of nucleotide sequence
similarity at the core genome level and DDH results.
Konstantinidis and colleagues suggested that the ANI, i.e.
the mean nucleotide sequence identity of shared genes
between two strains, was a valid alternative to DDH
(Konstantinidis & Tiedje, 2005; Goris et al., 2007). These
authors even proposed that an ANI value ¢95 % between
genomes corresponded to a DDH value of ¢70 % (Goris
et al., 2007). In 2009, Richter & Rosselló-Móra (2009)
proposed that ANI might be used as an alternative to DDH
for species delineation, and might be obtained from
comparison of only 20 % of any given genome sequence.
To date, the ANI method has been used to describe several
novel bacterial genera and/or species, including Dehalococcoides mccartyi (Loffler et al., 2012), Sphaerochaeta
globosa and S. pleomorpha (Ritalahti et al., 2012) and Vibrio
caribbeanicus (Hoffmann et al., 2012). However, Ozen et al.
(2012) have argued that ANI results are not systematically
consistent with current taxonomy and thus should not be
used as a single tool for prokaryote classification.
Other parameters based on genome sequences have been
proposed to discriminate species, such as the maximal
unique matches index (genomic distance index based on
DNA conservation of the core genome and the proportion
shared DNA by two genomes; Deloger et al., 2009), tetranucleotide regression [differences between observed and
expected values of the frequencies of all 256 possible
tetranucleotide (A, T, G, C) combinations; Richter &
Rosselló-Móra, 2009] and genome-to-genome comparison
[total length of all high-scoring segment pairs (HSPs)
identified by a BLAST search; Auch et al., 2010]. The latter
parameter may be calculated using the genome-to-genome
distance calculator (GGDC) available online (http://ggdc.
dsmz.de/distcalc2.php).
However, several authors have pointed out drawbacks of
genomics for taxonomic purposes. In 2010, Klenk & Göker
(2010) argued that sequenced genomes were lacking for
many major lineages of prokaryotes, but the rapidly
increasing number of available genome sequences, doubling almost every 2 years, should overcome this limitation.
Zhi et al. (2012) pointed out that the increasing number of
sequenced genomes remains biased towards prokaryotes of
medical or biotechnological importance, and Tindall et al.
(2010) observed that the genome sequences of species of
interest are not always those of type strains. Ricker et al.
(2012) also contested the quality of genome sequences
available in public databases, which may be available as
complete sequences, draft assemblies, scaffolds or contigs.
Klassen & Currie (2012) argued that draft genomes may be
less informative than complete genomes for taxonomic
purposes, and that there is a need to define minimal
Bacterial isolate
If score lower than 2 with species available in the database
MALDI-TOF MS
If nucleotide sequence identity is lower than 98.7% with
phylogenetically closest species with validly published name
AND
This value is lower than or equal to the highest similarity
observed among species of the same genus with standing in nomenclature
Comparison of phenotypic
16S rRNA gene sequence analysis
Putative novel bacterial species
Genome sequencing and annotation
and chemotaxonomic characteristics
Gram staining, cell size and shape, motility, sporulation,
oxygen requirement, biochemical characteristics,
MALDI-TOF spectrum
Determination of AGIOS values between
the new isolate and other closely related species
Comparison of genome
characteristics with closest
phylogenetic neighbours for
which genomes are available
Description of novel bacterial species
If in the range of those observed
among closely related species
Deposit
in two culture collections
Fig. 1. Polyphasic strategy used in our laboratory for the taxonomic classification of bacterial isolates.
386
Downloaded from www.microbiologyresearch.org by
International Journal of Systematic and Evolutionary Microbiology 64
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
http://ijs.sgmjournals.org
Table 1. Novel species identified and described using our polyphasic approach incorporating genome analysis
Species
Phylum Proteobacteria
‘Enterobacter massiliensis’
‘Herbaspirillum massiliense’
Oxygen requirement
Genome size
(Mb)
DNA G+C
content (mol%)
Accession no.
Genome
16S rRNA gene
Reference
ACS-065V-Col13
MM10403188
phR
JC122
JC66
ph5
ph1
JC140
JC401
JC6
JC13
AP8
ph2
Anaerobic
Aerobic
Anaerobic
Anaerobic
Facultatively anaerobic
Anaerobic
Anaerobic
Anaerobic
Anaerobic
Aerobic
Anaerobic
Aerobic
Anaerobic
1.79
4.632
5.05
3.89
5.58
2.1
1.77
1.84
1.75
4.98
3.57
4.61
1.77
28.56
37.3
53.1
26.8
48.2
33.9
30.01
32.2
30.7
37.6
40
34.1
51.4
AEXM00000000
CAET00000000
CAGW00000000
CAEV00000000
CAES00000000
CAGX00000000
CAHB00000000
CAEL00000000
CAEL00000000
CAHJ00000000
CAEN00000000
CAPG00000000
CAHC00000000
JF824805
JF824810
JN837488
JF824801
JF824808
JN837491
JN837495
JF824803
JN657222
JF824800
JF824807
JX101689
JN837487
Lagier et al. (2012b)
Kokcha et al. (2012a)
Hugon et al. (2013a)
Mishra et al. (2012b)
Mishra et al. (2012d)
Mishra et al. (2012e)
Mishra et al. (2013a)
Mishra et al. (2013b)
Mishra et al. (2012c)
Ramasamy et al. (2013a)
Ramasamy et al. (2013b)
Mishra et al. (2013d)
Hugon et al. (2013c)
ph8
JC50
JC136
Anaerobic
Anaerobic
Anaerobic
3.16
4.01
3.49
58.6
58.4
58.82
CAHA00000000
CAHI00000000
CAEG00000000
JN837494
JF824804
JF824799
Hugon et al. (2013b)
Mishra et al. (2012a)
Lagier et al. (2012c)
JC14
JC43
JC225
JC110
phI
JC301
Aerobic
Aerobic
Aerobic
Anaerobic
Anaerobic
Primarily aerobic,
facultatively anaerobic
3.32
3.42
3.4
2.83
2.28
3.01
72.49
70
71.22
85.7
62
61.4
CAHG00000000
CAHK00000000
CAHD00000000
CAEM00000000
CAGZ00000000
CAHH00000000
JF824798
JF824806
JN657218
JF824809
JN837493
JN657220
Ramasamy et al. (2012)
Kokcha et al. (2012b)
Lagier et al. (2012e)
Lagier et al. (2013a)
Mishra et al. (2013c)
Mishra et al. (2013d)
JC163
JC206
Aerobic
Aerobic
4.92
4.18
55.1
59.73
CAEO00000000
CAHF00000000
JN657217
JN657219
Lagier et al. (2013b)
Lagier et al. (2012d)
387
Downloaded from www.microbiologyresearch.org by
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
Use of genomics in taxonomic classification of bacteria
Phylum Firmicutes
‘Anaerococcus senegalensis’
‘Bacillus timonensis’
Brevibacillus massiliensis
‘Clostridium senegalense’
‘Paenibacillus senegalensis’
‘Peptoniphilus grossensis’
‘Peptoniphilus obesi’
‘Peptoniphilus senegalensis’
‘Peptoniphilus timonensis’
‘Bacillus massiliosenegalensis’
‘Dielma fastidiosa’
‘Bacillus massilioanorexius’
‘Kallipyga massiliensis’
Phylum Bacteroidetes
‘Alistipes obesi’
‘Alistipes senegalensis’
‘Alistipes timonensis’
Phylum Actinobacteria
‘Aeromicrobium massiliense’
Brevibacterium senegalense
‘Cellulomonas massiliensis’
‘Senegalemassilia anaerobia’
‘Enorma massiliensis’
‘Timonella senegalensis’
Type strain
D. Ramasamy and others
sequencing quality for genomes to be included in taxonomic analyses.
Taxono-genomics
In our laboratory, in order to isolate bacteria that were
unable to grow under routine culture conditions, Lagier
et al. (2012a) implemented a strategy named ‘culturomics’,
in which they diversified culture conditions (culture
medium composition, temperature, atmosphere, incubation
length). This strategy enabled the identification of 31 putative
novel species, for which genome sequences were systematically determined. In this context, we proposed to use a
polyphasic approach (Fig. 1) to describe these novel bacterial
species using both phenotypic characteristics such as habitat,
Gram-stain reaction, culture and metabolic characteristics,
proteic spectrum and, when applicable, pathogenicity, and
their genomic characteristics.
In this polyphasic approach, bacterial isolates are first
identified using MALDI-TOF MS as described previously
(Seng et al., 2009) (Fig. 1). For all isolates that exhibit an
MS score ,2 with the reference spectra of species whose
names have standing in nomenclature, we perform amplification and sequencing of the complete 16S rRNA gene.
The resulting sequences are then compared to GenBank
(http://www.ncbi.nlm.nih.gov/genbank/), and their closest
phylogenetic neighbours identified. Among those, we select
only the species and/or genera that have validly published
names, and retrieve the 16S rRNA gene sequences for the
corresponding type strains from the List of Prokaryotic
Names with Standing in Nomenclature website (http://
www.bacterio.net/index.html). Percentages of 16S rRNA
gene sequence identity are then calculated between all pairs
of studied taxa. However, rather than using only a universal
16S rRNA gene sequence similarity cut-off (Stackebrandt &
Ebers, 2006), we also consider the variability of conservation
of this gene among genera. Therefore, isolates that exhibit
16S rRNA gene sequence identity ,98.7 % with their closest
phylogenetic neighbour, as recommended by Stackebrandt
& Ebers (2006), are considered to represent putative novel
species if this identity value is equal to or lower than the
highest identity observed among species with names with
standing in nomenclature within the same genus. The next
step consists of characterizing the strains phenotypically and
chemotaxonomically using laboratory tools that are widely
available in microbiology laboratories and may be easily
used to reproduce the data, including staining properties,
cell shape and size, motility, sporulation, oxygen and other
growth requirements, biochemical characteristics using
commercially available strips and MALDI-TOF mass spectra.
Their genome sequence is obtained by using a 454 GS-FLX
Titanium platform (Roche Diagnostics) and annotated.
Genome sequences of the strains of interest and their
phylogenetically closest neighbours are then compared in
terms of size, DNA G+C content, percentage of coding
sequences, gene content, gene distribution in COG categories
(Tatusov et al., 2001), presence of mobile genetic elements,
388
numbers of RNA genes, signal peptides and transmembrane
helices. In addition, we determine the average of genomic
identity of orthologous gene sequences (AGIOS) of studied
strains by comparison with their closest phylogenetic
neighbours as well as between members of the genus of
interest. The AGIOS parameter is obtained by first identifying with BLASTP the set of orthologous proteins between
pairs of compared genomes using minimal coverage of 50 %
and a degree of amino acid identity of 30 %. We then
determine the mean percentage of nucleotide sequence
identity between these orthologous genes. The AGIOS and
ANI parameters differ from each other in that, for the latter,
orthologous genes are identified using BLASTN, which is less
sensitive for this purpose than BLASTP. We acknowledge the
fact that a drawback of our taxono-genomics method is that
genome sequences from some strains from species with validly
published names are currently not available, and our results
may be modified slightly when these data become available.
Type strains are also deposited in two reference culture
collections. To date, we have described 24 novel species using
this polyphasic method (Table 1), including Brevibacterium
senegalense (Kokcha et al., 2012b, 2013) and Brevibacillus
massiliensis (Hugon et al., 2013a, d), the names of which were
validly published by inclusion in Validation List no. 153.
Conclusion
With the decreasing costs of high-throughput sequencing
and the increasing number of prokaryotic genomes that are
sequenced, and given the drawbacks of the current genetic
gold standards that are used as taxonomic tools, should
genomic sequences be excluded from taxonomic descriptions? We strongly believe that genomic information
should be included among taxonomic criteria in addition
to, but not instead of, phenotypic and chemotaxonomic
parameters. We also suggest that putative novel taxa should
always be compared with their phylogenetic neighbours
with validly published names, and the genetic parameters
obtained among members of the closest genus or family
should be taken into account in the description of the
novel species, rather than trying to define universal cut-offs
that may be applicable to only a limited proportion of
prokaryotes. However, we agree that there is no current
consensus on the way genomic data should be integrated
into the taxonomic classification of prokaryotes, but such a
discussion should take place.
References
Acinas, S. G., Marcelino, L. A., Klepac-Ceraj, V. & Polz, M. F. (2004).
Divergence and redundancy of 16S rRNA sequences in genomes with
multiple rrn operons. J Bacteriol 186, 2629–2635.
Auch, A. F., Klenk, H. P. & Göker, M. (2010). Standard operating
procedure for calculating genome-to-genome distances based on
high-scoring segment pairs. Stand Genomic Sci 2, 142–148.
Brenner, D. J., Fanning, G. R., Rake, A. V. & Johnson, K. E. (1969).
Batch procedure for thermal elution of DNA from hydroxyapatite.
Anal Biochem 28, 447–459.
Downloaded from www.microbiologyresearch.org by
International Journal of Systematic and Evolutionary Microbiology 64
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
Use of genomics in taxonomic classification of bacteria
Brenner, D. J., Staley, J. T. & Krieg, N. R. (2001). Classification of
procaryotic organisms and the concept of bacterial speciation. In
Bergey’s Manual of Systematic Bacteriology, 2nd edn, vol. 1, pp. 27–31.
Edited by D. R. Boone, R. W. Castenholz & G. M. Garrity. New York:
Springer.
Buchanan, R. E. (1955). Taxonomy. Annu Rev Microbiol 9, 1–20.
Coenye, T. & Vandamme, P. (2003). Extracting phylogenetic
information from whole-genome sequencing projects: the lactic acid
bacteria as a test case. Microbiology 149, 3507–3517.
Coenye, T. & Vandamme, P. (2004). Use of the genomic signature in
Hugon, P., Ramasamy, D., Robert, C., Couderc, C., Raoult, D. &
Fournier, P. E. (2013c). Non-contiguous finished genome sequence
and description of Kallipyga massiliensis gen. nov., sp. nov., a new
member of the family Clostridiales Incertae Sedis XI. Stand Genomic
Sci 8, 500–515.
Hugon, P., Mishra, A. K., Lagier, J. C., Nguyen, T. T., Couderc, C.,
Raoult, D. & Fournier, P. E. (2013d). Brevibacillus massiliensis sp. nov.
In List of new names and new combinations previously effectively, but
not validly, published, Validation List no. 153. Int J Syst Evol Microbiol
63, 3131–3134.
bacterial classification and identification. Syst Appl Microbiol 27, 175–
185.
Huson, D. H. & Steel, M. (2004). Phylogenetic trees based on gene
Deloger, M., El Karoui, M. & Petit, M. A. (2009). A genomic distance
Jain, R., Rivera, M. C. & Lake, J. A. (1999). Horizontal gene transfer
content. Bioinformatics 20, 2044–2049.
based on MUM indicates discontinuity between most bacterial species
and genera. J Bacteriol 191, 91–99.
among genomes: the complexity hypothesis. Proc Natl Acad Sci U S A
96, 3801–3806.
Fitz Gibbon, S. T. & House, C. H. (1999). Whole genome-based
phylogenetic analysis of free-living microorganisms. Nucleic Acids Res
27, 4218–4222.
Janssen, P., Audit, B., Cases, I., Darzentas, N., Goldovsky, L., Kunin,
V., Lopez-Bigas, N., Peregrin-Alvarez, J. M., Pereira-Leal, J. B. &
other authors (2003). Beyond 100 genomes. Genome Biol 4, 402.
Fleischmann, R. D., Adams, M. D., White, O., Clayton, R. A., Kirkness,
E. F., Kerlavage, A. R., Bult, C. J., Tomb, J. F., Dougherty, B. A. & other
authors (1995). Whole-genome random sequencing and assembly of
Johnson, J. L. (1991). DNA reassociation experiments. In Nucleic Acid
Techniques in Bacterial Systematics, pp. 21–44. Edited by E. Stackebrandt
& M. Goodfellow. Chichester: Wiley.
Haemophilus influenzae Rd. Science 269, 496–512.
Klassen, J. L. & Currie, C. R. (2012). Gene fragmentation in bacterial
Fournier, P. E. & Raoult, D. (2009). Current knowledge on phylogeny
and taxonomy of Rickettsia spp. Ann N Y Acad Sci 1166, 1–11.
draft genomes: extent, consequences and mitigation. BMC Genomics
13, 14.
Fox, G. E., Wisotzkey, J. D. & Jurtshuk, P., Jr (1992). How close is
Klenk, H. P. & Göker, M. (2010). En route to a genome-based
close: 16S rRNA sequence identity may not be sufficient to guarantee
species identity. Int J Syst Bacteriol 42, 166–170.
classification of Archaea and Bacteria? Syst Appl Microbiol 33, 175–
182.
Gándara, B., Merino, A. L., Rogel, M. A. & Martı́nez-Romero, E.
(2001). Limited genetic diversity of Brucella spp. J Clin Microbiol 39,
Kokcha, S., Mishra, A. K., Lagier, J. C., Million, M., Leroy, Q., Raoult,
D. & Fournier, P. E. (2012a). Non contiguous-finished genome
235–240.
sequence and description of Bacillus timonensis sp. nov. Stand
Genomic Sci 6, 346–355.
Gevers, D., Cohan, F. M., Lawrence, J. G., Spratt, B. G., Coenye, T.,
Feil, E. J., Stackebrandt, E., Van de Peer, Y., Vandamme, P. & other
authors (2005). Opinion: Re-evaluating prokaryotic species. Nat Rev
Kokcha, S., Ramasamy, D., Lagier, J. C., Robert, C., Raoult, D. &
Fournier, P. E. (2012b). Non-contiguous finished genome sequence
Microbiol 3, 733–739.
and description of Brevibacterium senegalense sp. nov. Stand Genomic
Sci 7, 233–245.
Goodfellow, M., Manfio, G. P. & Chun, J. (1997). Towards a practical
species concept for cultivable bacteria. In Species: The Units of
Biodiversity, pp. 25–29. Edited by M. F. Clarridge & H. A. Dawah.
London: Chapman and Hall.
Goris, J., Konstantinidis, K. T., Klappenbach, J. A., Coenye, T.,
Vandamme, P. & Tiedje, J. M. (2007). DNA–DNA hybridization
values and their relationship to whole-genome sequence similarities.
Int J Syst Evol Microbiol 57, 81–91.
Gupta, R. S. (2001). The branching order and phylogenetic placement
of species from completed bacterial genomes, based on conserved
indels found in various proteins. Int Microbiol 4, 187–202.
Gürtler, V. & Mayall, B. C. (2001). Genomic approaches to typing,
taxonomy and evolution of bacterial isolates. Int J Syst Evol Microbiol
51, 3–16.
Hoffmann, M., Monday, S. R., Allard, M. W., Strain, E. A., Whittaker,
P., Naum, M., McCarthy, P. J., Lopez, J. V., Fischer, M. & Brown, E. W.
(2012). Vibrio caribbeanicus sp. nov., isolated from the marine
sponge Scleritoderma cyanea. Int J Syst Evol Microbiol 62, 1736–
1743.
Hugon, P., Mishra, A. K., Lagier, J. C., Nguyen, T. T., Couderc, C.,
Raoult, D. & Fournier, P. E. (2013a). Non-contiguous finished
genome sequence and description of Brevibacillus massiliensis sp. nov.
Stand Genomic Sci 8, 1–14.
Hugon, P., Ramasamy, D., Lagier, J. C., Rivet, R., Couderc, C., Raoult,
D. & Fournier, P. E. (2013b). Non contiguous-finished genome
sequence and description of Alistipes obesi sp. nov. Stand Genomic Sci
7, 427–439.
http://ijs.sgmjournals.org
Kokcha, S., Ramasamy, D., Lagier, J. C., Robert, C., Raoult, D. &
Fournier, P. E. (2013). Brevibacterium senegalense sp. nov. In List of
new names and new combinations previously effectively, but not validly,
published, Validation List no. 153. Int J Syst Evol Microbiol 63, 3131–
3134.
Konstantinidis, K. T. & Tiedje, J. M. (2005). Genomic insights that
advance the species definition for prokaryotes. Proc Natl Acad Sci
U S A 102, 2567–2572.
Konstantinidis, K. T. & Tiedje, J. M. (2007). Prokaryotic taxonomy
and phylogeny in the genomic era: advancements and challenges
ahead. Curr Opin Microbiol 10, 504–509.
Lagier, J. C., Armougom, F., Million, M., Hugon, P., Pagnier, I., Robert,
C., Bittar, F., Fournous, G., Gimenez, G. & other authors (2012a).
Microbial culturomics: paradigm shift in the human gut microbiome
study. Clin Microbiol Infect 18, 1185–1193.
Lagier, J. C., El Karkouri, K., Nguyen, T. T., Armougom, F., Raoult, D.
& Fournier, P. E. (2012b). Non-contiguous finished genome sequence
and description of Anaerococcus senegalensis sp. nov. Stand Genomic
Sci 6, 116–125.
Lagier, J. C., Armougom, F., Mishra, A. K., Nguyen, T. T., Raoult, D. &
Fournier, P. E. (2012c). Non-contiguous finished genome sequence
and description of Alistipes timonensis sp. nov. Stand Genomic Sci 6,
315–324.
Lagier, J. C., Gimenez, G., Robert, C., Raoult, D. & Fournier, P. E.
(2012d). Non-contiguous finished genome sequence and description
of Herbaspirillum massiliense sp. nov. Stand Genomic Sci 7, 200–209.
Downloaded from www.microbiologyresearch.org by
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
389
D. Ramasamy and others
Lagier, J. C., Ramasamy, D., Rivet, R., Raoult, D. & Fournier, P. E.
(2012e). Non contiguous-finished genome sequence and description
Mishra, A. K., Pfleiderer, A., Lagier, J. C., Robert, C., Raoult, D. &
Fournier, P. E. (2013d). Non contiguous-finished genome sequence
of Cellulomonas massiliensis sp. nov. Stand Genomic Sci 7, 258–270.
and description of Bacillus massilioanorexicus sp. nov. Stand Genomic
Sci 8, 465–479.
Lagier, J. C., Elkarkouri, K., Rivet, R., Couderc, C., Raoult, D. &
Fournier, P. E. (2013a). Non contiguous-finished genome sequence
and description of Senegalemassilia anaerobia gen. nov., sp. nov.
Stand Genomic Sci 7, 343–356.
Lagier, J. C., El Karkouri, K., Mishra, A. K., Robert, C., Raoult, D. &
Fournier, P. E. (2013b). Non contiguous-finished genome sequence
and description of Enterobacter massiliensis sp. nov. Stand Genomic Sci
7, 399–412.
Lehmann, K. B. & Neumann, R. (1896). Atlas und Grundriss der
Bakteriologie und Lehrbuch der speziellen bakteriologischen Diagnostik.
München: J. F. Lehmann (in German).
Loffler, F. E., Yan, J., Ritalahti, K. M., Adrian, L., Edwards, E. A.,
Konstantinidis, K. T., Muller, J. A., Fullerton, H., Zinder, S. H. &
Spormann, A. M. (2012). Dehalococcoides mccartyi gen. nov., sp. nov.,
obligately organohalide-respiring anaerobic bacteria relevant to
halogen cycling and bioremediation, belong to a novel bacterial class,
Dehalococcoidia classis nov., order Dehalococcoidales ord. nov. and
family Dehalococcoidaceae fam. nov., within the phylum Chloroflexi.
Int J Syst Evol Microbiol 63, 625–635.
Markowitz, V. M., Chen, I.-M. A., Palaniappan, K., Chu, K., Szeto, E.,
Grechkin, Y., Ratner, A., Jacob, B., Huang, J. & other authors (2012).
IMG: the Integrated Microbial Genomes database and comparative
analysis system. Nucleic Acids Res 40 (Database issue), D115–D122.
Minnikin, D. E., Alshamaony, L. & Goodfellow, M. (1975).
Differentiation of Mycobacterium, Nocardia, and related taxa by
thin-layer chromatographic analysis of whole-organism methanolysates. J Gen Microbiol 88, 200–204.
Mishra, A. K., Gimenez, G., Lagier, J. C., Robert, C., Raoult, D. &
Fournier, P. E. (2012a). Non-contiguous finished genome sequence
and description of Alistipes senegalensis sp. nov. Stand Genomic Sci 6,
304–314.
Mishra, A. K., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E.
(2012b). Non-contiguous finished genome sequence and description
of Clostridium senegalense sp. nov. Stand Genomic Sci 6, 386–395.
Mishra, A. K., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E.
(2012c). Non contiguous-finished genome sequence and description
of Peptoniphilus timonensis sp. nov. Stand Genomic Sci 7, 1–11.
Mishra, A. K., Lagier, J. C., Rivet, R., Raoult, D. & Fournier, P. E.
(2012d). Non-contiguous finished genome sequence and description
of Paenibacillus senegalensis sp. nov. Stand Genomic Sci 7, 70–81.
Mishra, A. K., Hugon, P., Robert, C., Raoult, D. & Fournier, P. E.
(2012e). Non contiguous-finished genome sequence and description
of Peptoniphilus grossensis sp. nov. Stand Genomic Sci 7, 320–330.
Mishra, A. K., Hugon, P., Lagier, J. C., Nguyen, T. T., Robert, C.,
Couderc, C., Raoult, D. & Fournier, P. E. (2013a). Non contiguous-
finished genome sequence and description of Peptoniphilus obesi sp.
nov. Stand Genomic Sci 7, 357–369.
Moore, E. R., Mihaylova, S. A., Vandamme, P., Krichevsky, M. I. &
Dijkshoorn, L. (2010). Microbial systematics and taxonomy: relevance
for a microbial commons. Res Microbiol 161, 430–438.
Olsen, G. J. & Woese, C. R. (1993). Ribosomal RNA: a key to
phylogeny. FASEB J 7, 113–123.
S. (1909). Die Hauptlinien des naturalischen
Bakteriensystems nebst einer Ubersicht der Garungsphenomene.
Zentralbl Bakteriol Parasitenkd Abt II 22, 305–346 (in German).
Orla-Jensen,
Ozen, A. I., Vesth, T. & Ussery, D. W. (2012). From genome sequence
to taxonomy – a skeptic’s view. In The Prokaryotes, 4th edn, pp.
209–227. Edited by E. Rosenberg, E. F. DeLong, E. Stackebrandt,
S. Lory & S. D. Thompson. Berlin: Springer.
Palleroni, N. J. (2010). The Pseudomonas story. Environ Microbiol 12,
1377–1383.
Rainey, F. A., Ward-Rainey, N. L., Janssen, P. H., Hippe, H. &
Stackebrandt, E. (1996). Clostridium paradoxum DSM 7308T
contains multiple 16S rRNA genes with heterogeneous intervening
sequences. Microbiology 142, 2087–2095.
Ramasamy, D., Kokcha, S., Lagier, J. C., N’Guyen, T. T., Raoult, D. &
Fournier, P. E. (2012). Genome sequence and description of
Aeromicrobium massilense sp. nov. Stand Genomic Sci 7, 246–
257.
Ramasamy, D., Lagier, J.-C., Gorlas, A., Raoult, D. & Fournier, P.-E.
(2013a). Non contiguous-finished genome sequence and description
of Bacillus massiliosenegalensis sp. nov. Stand Genomic Sci 8, 264–
278.
Ramasamy, D., Lagier, J. C., Nguyen, T. T., Raoult, D. & Fournier, P. E.
(2013b). Non contiguous-finished genome sequence and description
of Dielma fastidiosa gen. nov., sp. nov., a new member of the family
Erysipelotrichaceae. Stand Genomic Sci 8, 336–351.
Richter, M. & Rosselló-Móra, R. (2009). Shifting the genomic gold
standard for the prokaryotic species definition. Proc Natl Acad Sci
U S A 106, 19126–19131.
Ricker, N., Qian, H. & Fulthorpe, R. R. (2012). The limitations of draft
assemblies for understanding prokaryotic adaptation and evolution.
Genomics 100, 167–175.
Ritalahti, K. M., Justicia-Leon, S. D., Cusick, K. D., RamosHernandez, N., Rubin, M., Dornbush, J. & Löffler, F. E. (2012).
Sphaerochaeta globosa gen. nov., sp. nov. and Sphaerochaeta
pleomorpha sp. nov., free-living, spherical spirochaetes. Int J Syst
Evol Microbiol 62, 210–216.
Rosselló-Mora, R. (2005). Updating prokaryotic taxonomy. J Bac-
teriol 187, 6255–6257.
Rosselló-Mora, R. (2006). DNA-DNA reassociation methods applied
Mishra, A. K., Lagier, J. C., Nguyen, T. T., Raoult, D. & Fournier, P. E.
(2013b). Non contiguous-finished genome sequence and description
to microbial taxonomy and their critical evaluation. In Molecular
Identification, Systematics, and Population Structure of Prokaryotes, pp.
23–50. Edited by E. Stackebrandt. Berlin: Springer.
of Peptoniphilus senegalensis sp. nov. Stand Genomic Sci 7, 370–381.
Schleifer, K. H. & Stackebrandt, E. (1983). Molecular systematics of
Mishra, A. K., Hugon, P., Lagier, J. C., Nguyen, T. T., Couderc, C.,
Raoult, D. & Fournier, P. E. (2013c). Non contiguous-finished
genome sequence and description of Enorma massiliensis gen. nov.,
sp. nov., a new member of the family Coriobacteriaceae. Stand
Genomic Sci 8, 290–305.
Mishra, A. K., Lagier, J. C., Robert, C., Raoult, D. & Fournier, P. E.
(2013d). Genome sequence and description of Timonella senegalensis
gen. nov., sp. nov., a new member of the suborder Micrococcinae.
Stand Genomic Sci 8, 318–335.
390
prokaryotes. Annu Rev Microbiol 37, 143–187.
Seng, P., Drancourt, M., Gouriet, F., La Scola, B., Fournier, P. E., Rolain,
J. M. & Raoult, D. (2009). Ongoing revolution in bacteriology: routine
identification of bacteria by matrix-assisted laser desorption ionization
time-of-flight mass spectrometry. Clin Infect Dis 49, 543–551.
Smibert, R. M. & Krieg, N. R. (1994). Phenotypic characterization. In
Methods for General and Molecular Bacteriology, pp. 607–654. Edited
by P. Gerhardt, R. G. E. Murray, W. A. Wood & N. R. Krieg.
Washington, DC: American Society for Microbiology.
Downloaded from www.microbiologyresearch.org by
International Journal of Systematic and Evolutionary Microbiology 64
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
Use of genomics in taxonomic classification of bacteria
Sneath, P. H. A. & Sokal, R. R. (1973). Numerical Taxonomy, the
Principles and Practice of Numerical Classification. San Francisco:
W. H. Freeman.
Snel, B., Bork, P. & Huynen, M. A. (1999). Genome phylogeny based
on gene content. Nat Genet 21, 108–110.
Soon, W. W., Hariharan, M. & Snyder, M. P. (2013). High-throughput
sequencing for biology and medicine. Mol Syst Biol 9, 640.
Stackebrandt, E. & Ebers, J. (2006). Taxonomic parameters revisited:
tarnished gold standards. Microbiol Today 33, 152–155.
Stackebrandt, E., Frederiksen, W., Garrity, G. M., Grimont, P. A.,
Kämpfer, P., Maiden, M. C., Nesme, X., Rosselló-Mora, R., Swings, J.
& other authors (2002). Report of the ad hoc committee for the re-
evaluation of the species definition in bacteriology. Int J Syst Evol
Microbiol 52, 1043–1047.
Tatusov, R. L., Natale, D. A., Garkavtsev, I. V., Tatusova, T. A.,
Shankavaram, U. T., Rao, B. S., Kiryutin, B., Galperin, M. Y.,
Fedorova, N. D. & Koonin, E. V. (2001). The COG database: new
developments in phylogenetic classification of proteins from complete
genomes. Nucleic Acids Res 29, 22–28.
Tindall, B. J., Rosselló-Móra, R., Busse, H.-J., Ludwig, W. & Kämpfer,
P. (2010). Notes on the characterization of prokaryote strains for
taxonomic purposes. Int J Syst Evol Microbiol 60, 249–266.
http://ijs.sgmjournals.org
Vandamme, P., Pot, B., Gillis, M., de Vos, P., Kersters, K. & Swings, J.
(1996). Polyphasic taxonomy, a consensus approach to bacterial
systematics. Microbiol Rev 60, 407–438.
Wayne, L. G., Brenner, D. J., Colwell, R. R., Grimont, P. A. D.,
Kandler, O., Krichevsky, L., Moore, L. H., Moore, W. C., Murray, R. G.
E. & other authors (1987). Report of the ad hoc committee on
reconciliation of approaches to bacterial systematics. Int J Syst
Bacteriol 37, 463–464.
Welker, M. & Moore, E. R. (2011). Applications of whole-cell matrix-
assisted laser-desorption/ionization time-of-flight mass spectrometry
in systematic microbiology. Syst Appl Microbiol 34, 2–11.
Woese, C. R., Kandler, O. & Wheelis, M. L. (1990). Towards a natural
system of organisms: proposal for the domains Archaea, Bacteria, and
Eucarya. Proc Natl Acad Sci U S A 87, 4576–4579.
Yarza, P., Richter, M., Peplies, J., Euzéby, J., Amann, R., Schleifer,
K. H., Ludwig, W., Glöckner, F. O. & Rosselló-Móra, R. (2008).
The All-Species Living Tree project: a 16S rRNA-based phylogenetic tree of all sequenced type strains. Syst Appl Microbiol 31, 241–
250.
Zhi, X. Y., Zhao, W., Li, W. J. & Zhao, G. P. (2012). Prokaryotic
systematics in the genomics era. Antonie van Leeuwenhoek 101, 21–
34.
Downloaded from www.microbiologyresearch.org by
IP: 88.99.165.207
On: Mon, 19 Jun 2017 01:44:53
391