Download Taxonomy and Evolution of the Cotton Genus, Gossypium

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Introduced species wikipedia , lookup

Island restoration wikipedia , lookup

Fauna of Africa wikipedia , lookup

Transcript
Published April 23, 2015
Taxonomy and Evolution
of the Cotton Genus, Gossypium
Jonathan F. Wendel* and Corrinne E. Grover
Abstract
We present an overview of the taxonomy of Gossypium L. (the cotton genus) and its
evolutionary history. Gossypium contains more than 50 recognized species, including
several recently described, distributed in arid to semiarid regions of the tropics and
subtropics. Diversity in Gossypium has been promoted by two seemingly unlikely processes: transoceanic, long-distance dispersal and wide hybridization among lineages
that presently are widely separated geographically. Included are four species that were
independently domesticated for their seed fiber—two diploids from Africa–Asia and
two allopolyploids from the Americas. This repeated domestication of different wild
progenitors represents a remarkable case of human-driven parallel evolution. Morphological variation in Gossypium is extensive; growth forms in the genus range from
sprawling herbaceous perennials to ?15-m-tall trees, representing a notable array of
reproductive and vegetative characteristics. Equally impressive is the striking cytogenetic and genomic diversity that emerged as Gossypium diversified and spread
worldwide, ultimately spawning eight groups of closely related diploid (n = 13) species
(i.e., genome groups A through G, and K). DNA sequence data place the origin of Gossypium at about 5 to 10 million years ago (mya), which rapidly diversified into these major
genome groups shortly thereafter. Allopolyploid cottons appeared within the last 1 to 2
million years, a consequence of the improbable transoceanic dispersal of an A genome
taxon to the New World and subsequent hybridization with an indigenous D genome
diploid. Diversification of the nascent allopolyploid gave rise to three modern lineages
containing seven species, including the agronomically important G. hirsutum L. and G.
barbadense L.
B
ecause of its economic importance, the cotton genus (Gossypium L.) has been
of interest to agricultural scientists, taxonomists, and many other kinds of
biologists. Accordingly, a considerable amount is understood regarding the origin and diversification of the genus, its basic plant biology, and its properties as
a crop plant (Paterson, 2009; Stewart et al., 2010; Wendel et al., 2009, 2010, 2012).
Increasingly over the past two decades, classic taxonomic questions such as the
origin of the polyploid species, the relationships among species and species
Abbreviations: mya, million years ago.
Jonathan F. Wendel and Corrinne E. Grover ([email protected]), Dep. of Ecology,
Evolution, and Organismal Biology, Bessey Hall, Iowa State University, Ames, IA 50011.
*Corresponding author ([email protected]).
doi:10.2134/agronmonogr57.2013.0020
Copyright © 2015. ASA, CSSA, and SSSA, 5585 Guilford Road, Madison, WI 53711, USA.
Cotton. 2nd. ed. David D. Fang and Richard G. Percy, editors. Agronomy Monograph 57.
1
2
Wendel & Grover
groups, and the origins of the domesticated forms from their wild progenitors
have been revisited using modern molecular technologies (Wendel et al., 2009,
2010, 2012; Wendel and Cronn, 2003). One of the salient features of the genus
is that its history is so encompassing in scope, involving a global phylogenetic
diversification and a repeated history of transoceanic dispersal. A remarkable
consequence of this global spread and natural diversification has been that it provided the opportunity for different ancient human cultures on several continents
to independently domesticate four different cotton species—two allopolyploids
from the Americas, G. hirsutum and G. barbadense, and two diploids from Africa–
Asia, G. arboreum L. and G. herbaceum L. The canvas for this human history is
the natural evolutionary diversification of the genus, one that involves multiple,
seemingly impossible hybridization events, often between lineages that presently
are geographically disjunct or even on different continents, as well as multiple,
remarkable cases of transoceanic, long-distance dispersals. These twin themes of
cryptic intergenomic introgression or hybridization and long-distance dispersal
have repeatedly come into play as the genus diversified and colonized much of
the arid to semiarid regions of the tropics and subtropics.
A wealth of recent studies have provided new insights into Gossypium taxonomy and diversity, suggesting that it is an appropriate time to update our
understanding of the evolution, taxonomy, and biogeographic history of the
genus. Here we provide such an update, drawing on our earlier reviews (Wendel
et al., 2009, 2010, 2012; Wendel and Cronn, 2003) and with new syntheses.
Emergence of the Genus Gossypium
The cotton genus belongs to the Gossypieae, a small taxonomic tribe that includes
only nine genera, eight of which are classically recognized (Fryxell, 1968, 1979)
and one that is a new segregate from Cienfuegosia (Phuphathanaphong, 2006). Five
of these genera are small with restricted geographic distributions (Fryxell, 1968,
1979; Phuphathanaphong, 2006) including Lebronnecia (Marquesas Islands), Cephalohibiscus (New Guinea, Solomon Islands), Gossypioides (East Africa, Madagascar),
Thepparatia (known only from the type locality in Northern Thailand) and Kokia
(Hawaii). The tribe also includes four moderately sized genera with broader geographic ranges: Hampea (21 neotropical species), the relatively diverse Cienfuegosia
(25 species from the neotropics and parts of Africa), Thespesia (17 tropical species),
and Gossypium, whose 50+ species (Fryxell, 1992) make it the largest and most
widely distributed genus in the tribe. Thus, in aggregate this small taxonomic
tribe has achieved an expansive geographic range, though many of the included
taxa are quite rare and/or narrowly distributed.
Molecular phylogenetic analyses have shed light on three important aspects
of the evolutionary history of the tribe (Cronn et al., 2002; Seelanan et al., 1997).
First and foremost is that despite their extensive distribution and extraordinary
diversity, the group of species that belong to Gossypium do constitute a single
natural lineage. A second important revelation has been the identity of the closest relatives of Gossypium, that is, the African–Madagascan genus Gossypioides
and the Hawaiian endemic genus Kokia. This is unsuspected finding, given their
biogeography and lower chromosome number (n = 12), yet it is quite important
in that they may serve as outgroups in phylogenetic analyses, which provides an
essential context both for studying evolutionary patterns and processes within
Taxonomy and Evolution of the Cotton Genus, Gossypium
3
Gossypium and for understanding genome evolution. The third important insight
gained from these molecular analyses was the provision of a temporal component to the major divergences. Using sequence divergence data as a proxy for
time (i.e., a “molecular clock”), Seelanan et al. (1997) suggested that Gossypium
diverged from Kokia and Gossypioides circa 12.5 mya, an estimate bolstered by a
later, more extensive data set containing 10 different nuclear genes (Cronn et al.,
2002). Collectively, all molecular data suggest that Gossypium diverged from its
closest relatives approximately 10 to 15 mya during the Miocene and acquired its
modern, worldwide distribution by multiple transoceanic dispersals combined
with regional speciation.
Diversification of the Diploid Cotton Species
From the time of origin 5 to 10 mya, Gossypium experienced rapid speciation and
diversification, while also achieving a nearly worldwide distribution containing
several primary centers of diversity in the arid or seasonally arid tropics and subtropics (Table 1). Species-rich regions include Australia, especially the Kimberley
region in NW Australia; the Horn of Africa and southern Arabian Peninsula; and
the western part of central and southern Mexico. The taxonomy of the genus has
been well studied (Cronn et al., 2002; Fryxell, 1979, 1992; Hutchinson et al., 1947;
Saunders, 1961; Seelanan et al., 1997; Watt, 1907), with the taxonomic classification
of Fryxell (1979, 1992) being the most modern and widely followed. In this treatment, species are grouped into four subgenera and eight sections (Table 1). The
Table 1. Diversity and geographic distribution of the major lineages of Gossypium.
Genomic placements and taxonomic status of species enclosed by parentheses are
yet to be validated by cytogenetic and other means.
Genome
(no. of species) Recognized species
A (2)
G. arboreum, G. herbaceum
B (3–4)
G. anomalum, G. triphyllum, G. capitis-viridis, (G.
trifurcatum)
C (2)
G. sturtianum, G. robinsonii
D (13–14)
G. thurberi, G. armourianum, G. harknessii, G.
davidsonii, G. klotzschianum, G. aridum, G. raimondii,
G. gossypioides, G. lobatum, G. trilobum, G. laxum, G.
turneri, G. schwendimanii, (Gossypium sp. nov.)
E (5–9)
G. stocksii Mast., G. somalense (Gurke) J.B. Hutch., G.
areysianum Deflers, G. incanum (O. Schwartz) Hillc.,
G. trifurcatum, (G. benadirense), (G. bricchettii), (G.
vollesenii Fryxell), (G. trifurcatum)
F (1)
G. longicalyx
G (3)
G. bickii, G. australe, G. nelsonii
K (12)
G. anapoides, G. costulatum, G. cunninghamii, G.
enthyle Fryxell et al., G. exiguum Fryxell et al., G.
londonderriense Fryxell et al., G. marchantii, G. nobile
Fryxell et al., G. pilosum Fyxell, G. populifolium (Benth.)
F. Muell. ex Tod., G. pulchellum (C.A. Gardner) Fryxell,
G. rotundifolium Fryxell et al.
AD (7)
G. hirsutum, G. barbadense, G. tomentosum, G.
mustelinum, G. darwinii, G. ekmanianum, Gossypium
sp. nov.†
Geographic distribution
Africa, Asia
Africa, Cape Verde
Islands
Australia
primarily Mexico, with
range extensions
into Peru, Galapagos
Islands, Arizona
Arabian Peninsula,
Northeast Africa,
Southwest Asia
East Africa
Australia
Northwest Australia,
Cobourg Peninsula,
Northern Territory,
Australia
New World tropics
and subtropics,
including Hawaii, the
Wake Atoll, and the
Galapagos Islands
† J.F. Wendel, C.E. Grover, J. Jareczek, and J.P. Gallagher, unpublished data, 2014.
4
Wendel & Grover
classification represents the accumulated scientific understanding that emerged
both from basic plant exploration and taxonomic and evolutionary study. While
these species groupings are based primarily on morphological and geographical
evidence, most infrageneric alignments are congruent with subsequent cytogenetic and molecular data sets (Endrizzi et al., 1985; Wendel and Albert, 1992;
Wendel et al., 2009, 2010).
The current circumscription of Gossypium presently includes approximately
50 species (Fryxell, 1992), although new species continue to be discovered (Alvarez and Wendel, 2006; Grover et al., 2014; Krapovickas and Seijo, 2008; Stewart et
al., 2015; Ulloa et al., 2006). Morphologically, the genus is exceptionally diverse;
species morphologies range from fire-adapted, herbaceous perennials with massive underground rootstocks in NW Australia to trees in SW Mexico that escape
the dry season by dropping their leaves. Corolla colors span a rainbow of blue to
purple [G. triphyllum (Harv.) Hochr.], mauves and pinks (G. sturtianum J.H. Willis,
or “Sturt’s Desert Rose,” the official floral emblem of the Northern Territory, Australia), whites and pale yellows (NW Australia, Mexico, Africa–Arabia), and even
a deep sulfur-yellow (G. tomentosum Nutt. ex Seem., Hawaii). Seed coverings are
equally diverse; these range from nearly glabrous to the naked eye (e.g., G. klotzschianum Andersson and G. davidsonii Kellogg), to short, stiff, dense, brown hairs
that aid in wind dispersal (G. australe F. Muell and G. nelsonii Fryxell), to the long,
fine, white fibers that characterize highly improved forms of the four cultivated
species (Fig. 1). There are even seeds that produce fat bodies to facilitate ant-dispersal (Seelanan et al., 1999).
In addition to the remarkable morphological diversity, the genus also experienced extensive chromosomal evolution during diversification (Baudoin et al.,
2009; Endrizzi et al., 1985). Despite sharing the same chromosome number (n =
13), genome sizes among the diploid species vary more than threefold (Hendrix
and Stewart, 2005). Among closely related species, chromosome morphology is
quite similar, as is demonstrated by the ability of closely related species to form
hybrids that undergo normal meiotic pairing and that often experience high F1
fertility. In contrast, crosses between more distantly related species are often difficult to generate, with the few successful hybrids typically experiencing meiotic
abnormalities. Extensive observations concerning the pairing behavior, chromosome sizes, and relative fertility in interspecific hybrids ultimately led to the
clustering of closely related species into “genome groups,” designated by single-letter genome symbols (Beasley, 1941). Currently, there are eight recognized
diploid genome groups (A through G, plus K) (Endrizzi et al., 1985; Stewart, 1995),
whose species inclusions are largely congruent with taxonomic and phylogenetic
placements.
A temporal framework for the origin of Gossypium and its diversification is
provided by sequence divergence data serving as a proxy for time (Senchina et al.,
2003). These analyses indicate that Gossypium diverged from its closest relatives
during the Miocene approximately 10 to 15 mya to originate approximately 5 to
10 mya, and these estimates of divergence times have recently been supported by
an enormous amount of new sequence data derived from a global assembly of
expressed sequence tags (Flagel et al., 2012). A consideration of the near worldwide distribution of species combined with their phylogenetic history (Fig. 2),
relative to plate tectonic history, indicates that the evolution of Gossypium has
involved multiple episodes of transoceanic dispersal. Minimally, these include
Taxonomy and Evolution of the Cotton Genus, Gossypium
5
Fig. 1. Representative seed and trichome diversity in Gossypium. Seed and trichome
size and morphology are exceedingly variable in the genus. Most wild species have
relatively small seeds (<5 mm in any dimension) with equally short fibers. Long
(spinnable) fiber evolved only once, in the ancestor of modern A genome cottons,
which subsequently donated this capacity to modern allopolyploid species, including
the commercially important G. hirsutum and G. barbadense, at the time of allopolyploid
formation. Key to species: Cult. AD1, G. hirsutum TM1; Wild AD1, G. hirsutum
Tx2094 from the Yucatan Peninsula; AD3, G. tomentosum WT936 from Hawaii; C1, G.
sturtianum C1-4 from Australia; Cult. A2, G. arboreum AKA8401; Wild A1, G. herbaceum
subsp. africanum (Watt) Vollesen from Botswana; D5, G. raimondii from Peru; D3, G.
davidsonii D3d-32 from Baja California; F1, G. longicalyx F1-3 from Tanzania; B1, G.
anomalum B1-1 from Africa.
at least one dispersal between Australia and Africa; another to the Americas,
which led to the evolution of the D genome diploids; and a second, much later
colonization of the New World by the A genome ancestor of the AD genome allopolyploids (see below).
Out of Africa, the Cytogenetic Center
of Diversity in the Genus (A, B, E, and F Genomes)
An important implication of the phylogenetic data for Gossypium is that it supports the proposition that Africa is the ancestral home for the genus (Fig. 2). This
is not strictly true from a purely cladistic standpoint, in that the basal phylogenetic split is between the New World and Old World diploids; however, the
supposition that Gossypium originated in Africa is supported by the genomic
Fig. 2. Biogeographic history of Gossypium. Following its likely origin 5 to 10 mya, Gossypium split into three major diploid lineages: the New
World clade (D genome); the African–Asian clade (A , B, E, and F genomes); and the Australian clade (C, G, and K genomes). The diversity of
genome sizes and seed morphologies in the genus are shown with exemplars from each genome group. See text for additional details.
6
Wendel & Grover
Taxonomy and Evolution of the Cotton Genus, Gossypium
7
diversity found in the region (see below), versus the relative homogeneity found
in the New World. Fourteen species from Africa and Arabia are recognized in the
most recent taxonomic treatment of the genus (Fryxell, 1992) and together compose the African–Arabian subgenus Gossypium. Recognized within the subgenus
Gossypium are two taxonomic sections: section Gossypium, which contains four
subsections, and another African–Arabian section (Serrata), which contains the
single species G. trifurcatum Vollesen (found in desert area of eastern Somalia).
The presence of dentate leaves has raised the question of whether G. trifurcatum
belongs in the genus Gossypium at all, as dentate leaves are more characteristic
of Cienfugosia; however, molecular work has established that this poorly known
specimen is indeed a cotton species, if unusual in form (Rapp and Wendel, 2005).
This example underscores the provisional nature of much of the taxonomy of the
African–Arabian species of Gossypium, which are sorely in need of basic plant
exploration and systematic study. Other examples include subsection Pseudopambak, which contains several species whose recognition and definition are based
solely on limited herbarium material (e.g., G. benadirense Mattei, G. bricchettii
(Ulbr.) Vollesen, G. vollesenii). As no seeds have been collected and no living specimens are available, there exists no information on the cytogenetic characteristics
and molecular phylogenetic placement for these species. It is nearly certain that
new taxonomic diversity remains to be discovered in this region of the world.
Cytogenetically, the African–Arabian species are rather diverse and account
for four of the eight genome groups (A, B, E, and F), which neatly correspond to
their taxonomic subsections. The A genome group contains the two cultivated
cottons of subsection Gossypium, G. arboreum and G. herbaceum, while the three
African species in the B genome (G. anomalum Wawra, G. capitis-viridis Mauer, and
G. triphyllum) compose subsection Anomala. It bears noting that G. trifurcatum
(taxonomic section Serrata) may in fact belong to the B genome, but this has not
been established. The sole F genome species, G. longicalyx J.B. Hutch. & B.J.S. Lee,
represents subsection Longiloba, and, of the section Gossypium, it is considered
cytogenetically distinct (Phillips, 1966), morphologically isolated (Fryxell, 1971,
1992), and according to Fryxell may be better adapted to more mesic conditions
than any other diploid Gossypium species. Further adding to the interest in this
species is the phylogenetic placement of G. longicalyx as sister to the A genome
taxa (Fig. 2), which provides a window into the evolutionary origin of “long” fiber.
The remaining African–Asian species, those of subsection Pseudopambak, are considered to possess E genomes, although this has yet to be verified.
To Australia, with Genomic
Gigantism and New Ecological Specializations
Australia represents another center of diversity for the cotton genus. The Australian cottons (subgenus Sturtia) are represented by 3 taxonomic sections containing
17 total species, including one, G. anapoides J.M. Stewart et al., nom. inval., that is
newly described (Stewart et al., 2015). Cytogenetically, these taxa are divided into
the C, G, and K genome groups, having 2, 3, and 12 species, respectively. DNA
sequence data (Liu et al., 2001; Seelanan et al., 1997, 1999) support these groups
as natural lineages, consistent with their formal alignments into the taxonomic
sections Sturtia (C genome), Hibiscoidea (G genome), and Grandicalyx (K genome),
although the relationships among these three are yet unresolved. Some data
8
Wendel & Grover
support a west to east radiation of Gossypium in Australia by placing G. robinsonii
F. Muell. (C genome) as the basal-most branch of the Australian clade (DeJoode
and Wendel, 1992). This basal placement of the C genome has not been observed
in the most recent analyses (Liu et al., 2001; Seelanan et al., 1997, 1999), which
provide no resolution among the Australian lineages and thereby highlight the
uncertainty in the phylogenetic history of the Australian cottons.
Taxonomically, the C and G genome groups are well understood, as these are
amply represented in collections and have been thoroughly studied (DeJoode and
Wendel, 1992; Fryxell, 1979, 1992; Liu et al., 2001; Seelanan et al., 1997, 1999; Tiwari
et al., 2014; Wendel et al., 1991). Species in the G genome are interfertile and apparently hybridize commonly (Tiwari et al., 2014; Wendel et al., 1991). The taxonomy
of the K genome (section Grandicalyx), however, is less certain. Collecting expeditions to the Kimberley region have doubled the number of recognized K genome
species in the past two decades through the discovery of at least seven new species (Fryxell, 1992; Stewart et al., 2015) and have expanded our understanding
of the diversity in this section. The species of this section are geographically,
morphologically, and ecologically distinct from other cotton species, frequently
exhibiting features characteristic of fire adaptation. Specifically, these are herbaceous perennials that exhibit a biseasonal growth pattern, where vegetative
growth dies back during the dry season and the plants survive as underground
rootstocks until fire or the return of the wet season initiates a new cycle of
growth. Also unique to section Grandicalyx are the flower and fruit morphologies.
K genome species possess upright flowers (like other cotton species) that subsequently become pendant following pollination. This change in floral positioning
is thought to aid in these uniquely ant-dispersed cotton seeds. That is, at maturity the pendant capsules release sparsely haired seeds, which bear elaiosomes
(fat bodies) that attract the ants who participate in their dispersal. Nearly all of
K genome species are poorly represented in collections (Campbell et al., 2010),
leading to much of this taxonomic uncertainty. These species have enormous
genomes, relative to others in the genus (Fig. 2), reflecting a relatively recent proliferation of gypsy-like transposable elements (Hawkins et al., 2006). Phylogenetic
analyses have yielded conflicting results regarding interspecific relationships in
section Grandicalyx (Liu et al., 2001; Seelanan et al., 1999), although there is some
support for a fundamental division of the section into a group of prostrate species
and a clade of more upright plants.
One final comment about the Australian species concerns their apparent
propensity to hybridize, a somewhat surprising observation given their present
status as species whose populations are mostly small, scattered, and geographically highly disjunct from those of related species. These geographical and often
phenological barriers notwithstanding, there is abundant evidence for multiple
episodes of historical introgression in the group, involving G. bickii Prokh., G.
sturtianum, and G. cunninghamii Tod. (Cronn and Wendel, 2004; Seelanan et al.,
1999; Tiwari et al., 2014; Wendel et al., 1991).
To the New World, Part I: The D Genome Diploids
A watershed moment in the evolutionary history of Gossypium followed a seemingly impossible transoceanic (either Atlantic or Pacific) dispersal of an African
ancestor to the shores of Central America, possibly on what is now the western
Taxonomy and Evolution of the Cotton Genus, Gossypium
9
coast of Mexico. This dispersal event, dating to about 5 to 10 mya, led to the evolution and diversification of the New World D genome clade. This assemblage of
species have unusually small genomes for the genus (Fig. 2), due at least in part to
deletional clearance of much of the transposable element fraction (Hawkins et al.,
2009), and none bear spinnable fiber (Fig. 1). Taxonomically placed into subgenus
Houzingenia, the species are aligned into two sections and six subsections. These
species have been more thoroughly studied than most, and consequently their
taxonomy is reasonably well understood. Nonetheless, there remains sufficient
cryptic variability in the group that much remains to be learned about diversity
and relationships (Alvarez and Wendel, 2006; Cronn et al., 2003; Feng et al., 2011;
Ulloa et al., 2006, 2013; Wendel and Cronn, 2003; Wendel et al., 1995). This is particularly true for the “arborescent clade” comprising the group of small trees that
includes the widespread G. aridum (Rose & Standl.) Skovst. and the more narrowly distributed G. laxum L. Ll. Phillips, G. lobatum Gentry, and G. schwendimanii
Fryxell & S.D. Koch. Evidence indicates that cryptic variation exists within this
clade, possibly including new species related to G. aridum (Alvarez et al., 2005;
Feng et al., 2011) and G. laxum (Alvarez et al., 2005; Ulloa et al., 2006). There even
are populations of G. aridum from Colima that have a plastid genome much more
similar to those from G. klotzschianum (from the Galapagos Islands) and G. davidsonii (from Baja California) than to all other G. aridum populations, likely reflecting
historical interspecific gene flow (Alvarez and Wendel, 2006; Wendel and Albert,
1992). Gossypium gossypioides (Ulbr.) Standl. also has a somewhat mysterious introgressive ancestry (Cronn et al., 2003; Wendel et al., 1995), pointing out once again
the significance of ancient, natural interspecific hybridization in the generation of
diversity in the cotton genus (Cronn and Wendel, 2004).
The D genome cottons also have received considerable phylogenetic attention (Alvarez et al., 2005; Cronn et al., 2002, 2003; Feng et al., 2011; Ulloa et al., 2006,
2013; Wendel and Albert, 1992), which provides support for the naturalness of
most of the subsections. Evolutionary relationships among the apparently natural subsections are less certain, however (Alvarez et al., 2005; Ulloa et al., 2013),
although available evidence suggests that G. gossypioides is basal-most within
the subgenus (Cronn et al., 2003; Feng et al., 2011; Yu-xiang et al., 2013). All analyses support the monophyly of the arborescent clade, the grouping of the Baja
California species (G. harknessii Brandegee–G. armourianum Kearney–G. turneri
Fryxell) as a distinct clade, and the naturalness of the G. thurberi Tod.–G. trilobum
(DC.) Skovst. species pair, but the relative phylogenetic placements of these three
groups with G. raimondii Ulbr. remain uncertain.
Twelve of the 14+ D genome diploid species are endemic to western Mexico,
indicating this area is the center of diversity of the D genome. It seems likely, therefore, that this lineage became established and initially diversified in this region.
From this we can deduce later range extensions, probably relatively recently (during the Pleistocene), following long-distance dispersals leading to the evolution
of endemics in Peru (G. raimondii) and the Galapagos Islands (G. klotzschianum).
The latter species is closely related to G. davidsonii from Baja California, representing a clear and unusual case of this floristic connection (Wendel and Percival,
1990). As for G. raimondii, it is interesting that the sole South American representative of the New World diploid cottons is implicated as the closest ancestor of the
D genome of allopolyploid cotton (more on this below); the implication, perhaps
ironic, is that if allopolyploidy occurred in northern South America, as suggested
10
Wendel & Grover
(but not shown) by this observation, the possibility exists that they never would
have evolved had colonization of the New World by an A genome propagule not
followed closely on the heels of an earlier interhemispheric North American to
South American dispersal of the ancestor of what we now recognize as G. raimondii. This aspect of the origin of the allopolyploids makes an improbable story all
the more remarkable and yet again points to the recurring theme of the evolutionary significance of long-distance dispersal in cotton diversification.
To the New World, Part II: Origin and
Diversification of the Polyploid Cottons
A rich body of cytogenetic and experimental evidence now has demonstrated
convincingly that the tetraploid cotton species, which are entirely New World
in their distribution, are allopolyploids containing two coresident genomes: one
from an African or Asian A genome species and the other from a species similar
to the American D genome diploids (Endrizzi et al., 1985; Wendel, 1989; Wendel
and Cronn, 2003; Wendel et al., 2012). The hemisphere-scale allopatry of these two
diploid genome groups led to many decades of mystery surrounding the timing
and parentage of the New World allopolyploids. The question of “when” has been
convincingly addressed through gene sequence data that indicate allopolyploid
Gossypium originated approximately 1 to 2 mya in the mid-Pleistocene, before the
evolution of modern humans but relatively recently in geological terms (Flagel
et al., 2012; Senchina et al., 2003; Wendel, 1989). The question of parentage has
also been clarified through a wealth of DNA sequence data that indicate: (i) both
extant A genome species (G. arboreum, G. herbaceum) are equally distant from the
allopolyploid A genome and (ii) G. raimondii is the closest extant relative of the
actual D genome donor of the allopolyploid (Endrizzi et al., 1985; Li et al., 2014;
Wendel, 1989; Wendel and Cronn, 2003; Wendel et al., 2012; Yu-xiang et al., 2013).
Additionally, all allopolyploids contain an A genome cytoplasm, as evidenced
from analysis of both mitochondrial and plastidial genomes (Galau and Wilkins,
1989; Li et al., 2014; Small and Wendel, 1999; Wendel, 1989; Xu et al., 2012). Finally,
the studies mentioned above, and additional, multilocus DNA sequence data
(Grover et al., 2012), support a single origin for allopolyploid cotton.
As noted above, one truly remarkable feature of the origin of allopolyploid
cottons is that their genesis likely was dependent on the following series of highly
improbable events, which when considered in their totality amount to a biological marvel: (i) an ancient colonization of the New World by the ancestor of the D
genome diploids, following an initial transoceanic dispersal; (ii) a more recent
interhemispheric North American to South American dispersal of the ancestor
of modern G. raimondii; (iii) a third long-distance dispersal of an African–Asian
A genome ancestor to the New World, likely in the mid-Pleistocene; and (iv) the
chance biological encounter between the immigrant A genome taxon and the
endemic D genome entity, giving rise to hybridization and genome doubling, and
hence the genomic reunion among genomes that had been isolated on different
continents for 5 to 10 million years.
Given this now-justified scenario of a Pleistocene origin, the morphological diversification and spread of the allopolyploid cotton species must have been
relatively rapid following polyploidization. Classically, five allopolyploid species
were widely recognized, but a sixth species (G. ekmanianum Wittm.) was recently
Taxonomy and Evolution of the Cotton Genus, Gossypium
11
resurrected (Krapovickas and Seijo, 2008) and subsequently validated (Grover et
al., 2014) by molecular sequence data. In addition, ongoing studies provide justification for recognition of a seventh species that is relatively closely related
to G. hirsutum, from two islands (Wake, Peale) in the Wake Atoll in the Pacific
Ocean, Gossypium sp. nov. (J.F. Wendel, C.E. Grover, J. Jareczek, and J.P. Gallagher,
unpublished data, 2014). Included among the classically recognized species are
two island endemics, G. darwinii and G. tomentosum, that have differing population structures. Gossypium darwinii, a native of the Galapagos Islands, forms large
and continuous populations in some areas (Wendel and Percy, 1990), whereas
the Hawaiian endemic, G. tomentosum, tends to be more sparse, primarily occurring as scattered individuals and small populations on several islands. The latter
species, G. tomentosum, has recently received some attention with respect to its
reproductive biology and diversity (DeJoode and Wendel, 1992; Hawkins et al.,
2005; Pleasants and Wendel, 2010). A third classical allopolyploid, G. mustelinum
Miers ex G. Watt, is restricted to a relatively small region (Bahia state) of northeast
Brazil, where it survives in apparently relictual and highly homozygous scattered
populations (de Menezes et al., 2014; Wendel et al., 1994). In addition to these three
truly wild species, there exist two species with cultivated forms (G. barbadense
and G. hirsutum). Both species are characterized by large indigenous ranges that
in the aggregate encompass a wealth of morphological forms spanning the wildto-domesticate continuum (Brubaker et al., 1999; Brubaker and Wendel, 1993,
1994; Fryxell, 1979; Hutchinson, 1951, 1959; Hutchinson et al., 1947; Lubbers and
Chee, 2009; Percy and Wendel, 1990; Stephens, 1963). Gossypium hirsutum is widely
distributed in Central America and northern South America, the Caribbean, and
even reaches distant islands in the Pacific (Solomon Islands, Marquesas). Natural
diversity in this species is underscored by its complex nomenclatorial and taxonomic history (Fryxell, 1976, 1979). It is therefore unsurprising that a new species
from the Dominican Republic was recently discovered both in natural settings
and within the Gossypium germplasm collection (Grover et al., 2014; Krapovickas
and Seijo, 2008) and that other undescribed species likely are encompassed by the
diversity presently subsumed under the name G. hirsutum, as exemplified also
by Gossypium sp. nov., as noted above. Gossypium hirsutum has a more northerly
indigenous distribution than G. barbadense, with wild populations occurring as
far north as Tampa Bay, Florida (27°38¢ N) (J.M. Stewart, personal observation).
Gossypium barbadense has an indigenous range centered in the northern third of
South America but with a large region of range overlap with G. hirsutum in the
Caribbean (Percy, 2009).
Consideration of the ecological and geographic distribution of the allopolyploid species has led to the suggestion that polyploidy created the opportunity
to invade an ecological niche that is distinct from that occupied by most diploids. Fryxell (1965, 1979) noted that the truly wild forms of the allopolyploid
species typically occur in coastal habitats, in contrast to most of their diploid relatives. Three species (G. darwinii, Gossypium sp. nov., and G. tomentosum) are island
endemics that typically reside near coastlines, and two of the other species (G.
barbadense and G. hirsutum) possess wild forms that typically occupy littoral habitats tracing the Gulf of Mexico, northwest South America, and reaching distant
Pacific Islands. Fryxell speculated that the adaptations that allowed the newly
evolved allopolyploid to survive, and even thrive, in littoral habitats ultimately
facilitated the establishment of new polyploid species and provided a means for
12
Wendel & Grover
rapid dispersal of the saltwater tolerant seeds. These ideas gain additional support from recent ecological niche modeling studies (d’Eeckenbrugge and Lacape,
2014), which provide an elegant framework for understanding past and present
ecogeographic distributions of wild polyploid cottons.
Phylogenetic Relationships in the Genus
The genealogy of Gossypium has been evaluated by multiple molecular phylogenetic investigations (reviewed in Wendel and Cronn, 2003; Wendel et al., 2012),
each of which demonstrated that the major clades of species (i.e., genealogical
lineages) are largely consistent with geographical distributions, cytogenetic relationships (Baudoin et al., 2009; Endrizzi et al., 1985), and genome designations.
Thus, each taxonomically and cytogenetically established genome group corresponds to a single natural lineage, which also, in most cases, is geographically
cohesive. This information is summarized in a depiction of our present understanding of these relationships (Fig. 2 and 3).
Gossypium phylogenetic history has several broad aspects worth highlighting. First, four major diploid lineages exist that correspond to three continents:
Australia (C, G, and K genomes), the Americas (D genome), and Africa/Arabia
(itself containing two lineages: one comprising the A, B, and F genomes, and a
second solely containing the E genome species). Second, the earliest divergence
event separated the New World D genome lineage from that containing the ancestor of all Old World taxa, approximately 5 to 10 mya, making the New World and
Old World diploids phylogenetic sister groups. Subsequently, the Old World lineage itself diversified into three groups, namely, the Australian cottons (C, G, and
K genome species), the African–Arabian E genome species, and the African A, B,
and F genome cottons. Third, analyses identify the sole F genome species (G. longicalyx, Africa) as sister to the A genome clade (G. arboreum and G. herbaceum), where
the first emergence of fiber suitable for domestication evolved. This information
diagnoses the wild forms that best represent the ancestral lintless condition, and
provides a foundation for ultimately understanding the genetic basis of the origin of useful, long lint (cf. Fig. 2). Fourth, the major lineages of Gossypium are
characterized by rapid radiation and diversification that occurred shortly after
the genus originated and diverged from the Kokia–Gossypioides clade. Fifth, and
finally, the phylogenetic depiction summarizes our current understanding and
uncertainties regarding relationships among the seven allopolyploid species.
From Dispersal Ecology to Human Opportunity
As emphasized in this review, a consideration of the phylogeny of Fig. 2 and 3
in a temporal context leads to the realization that the genus has had an inordinate fondness for transoceanic travel. As already noted, this includes at least
one dispersal between Australia and Africa, another to the Americas (probably
Mexico) leading to the evolution of the D genome diploids, and a second, much
later, colonization of the New World by the A genome ancestor of the AD genome
allopolyploids. Long-distance dispersal played a role not only in diversification of
major evolutionary lines but also in speciation within Gossypium genome groups.
Examples include dispersals from southern Mexico to Peru (G. raimondii), from
northern Mexico to the Galapagos Islands (G. klotzschianum), from western South
Taxonomy and Evolution of the Cotton Genus, Gossypium
13
Fig. 3. Evolutionary history of Gossypium, as inferred from multiple molecular
phylogenetic data sets. The closest relative of Gossypium is a lineage containing the
African–Madagascan genus Gossypioides and the Hawaiian endemic genus Kokia.
Following its likely origin 5 to 10 mya, Gossypium split into three major diploid
lineages: the New World clade (D genome); the African–Asian clade (A, B, E, and F
genomes); and the Australian clade (C, G, and K genomes). This global diversification
involved multiple transoceanic dispersal events and was accompanied by
morphological, ecological, and chromosomal differentiation (genome sizes shown).
Interspecific hybridization has been exceptionally common during the evolution of
the genus. Allopolyploid cottons formed following transoceanic dispersal of an A
genome diploid to the Americas, where the new immigrant underwent hybridization,
as female, with a native D genome diploid similar to modern G. raimondii. Polyploid
cotton probably originated during the Pleistocene (1–2 mya), with the modern
species representing the descendants of an early and rapid colonization of the New
World tropics and subtropics.
America to the Galapagos Islands (G. darwinii), from Africa to the Cape Verde
Islands (G. capitis-viridis), and from the neotropics to the Hawaiian Islands (G.
tomentosum). What is so striking about this history is not just its biographically
fantastical nature, but also the lack of obvious adaptations for saltwater dispersal
14
Wendel & Grover
(Stephens, 1958, 1966). Not only that, but the life history bauplan for Gossypium is
of arid-zone adapted plants that one might assume at first blush do not possess
any capacity for overseas journeys. This aspect of the natural history of the cotton
genus represents a fascinating conundrum, one that lends itself more to speculation than to experimentation. Yet it also highlights the potential evolutionary
significance of the seemingly most improbable and rare events.
Given the absence of obvious adaptations for water dispersal, what can we
say about the evolutionary significance of the single-celled epidermal trichomes
for which cotton is justifiably so famous? The seeds and their coverings are
extraordinarily diverse in Gossypium, as illustrated in Fig. 1, which provides a
visual comparison of the various lineages of wild species and domesticated cotton species. Even though wild and cultivated cottons both produce fiber on the
seed coat, there are striking morphological and structural differences between
these fibers, the most obvious of which is their size. In contrast to the canonical (domesticated) cotton fiber, some D genome species (G. thurberi, G. trilobum,
G. davidsonii, and G. klotzschianum) do not possess obvious seed hairs; however,
the presence of these developmentally repressed structures is apparent under
microscopy (Applequist et al., 2001). Similarly, while the three D genome species
of subsection Caducibracteolata appear hairless to the unaided eye, these species
do in fact have seed hairs that are simply tightly appressed to the seed. Cultivated lint fiber is a model for cell wall developmental biology, as it is single cell
of almost pure cellulose that experiences remarkable elongation during development in cultivated species, achieving a final length up to 6 cm (Arpat et al., 2004;
Haigler et al., 2012; Kim and Triplett, 2001). In contrast, the wild-type fiber cell
is composed mostly of a combination of cellulose and suberin, which elongates
to <1 cm (Applequist et al., 2001; Haigler et al., 2009; Ryser and Holloway, 1985).
These and many other differences reflect both natural evolutionary processes
as well as human-mediated selection during domestication. The duration of the
elongation phase and the timing of onset of secondary wall synthesis appear to
be key determinants of the final length of the fiber in both wild and cultivated
plants (Applequis et al., 2001). Phylogenetic analysis of growth rates has shown
that the evolutionary innovation of prolonged elongation arose in the F genome/A
genome lineage, which may have facilitated the original domestication of the A
genome cottons. This trait of prolonged elongation was passed on to the allopolyploids, which in turn was a key component of their eventual domestication.
Relatively little is known about the developmental and genetic underpinnings of
the diverse morphologies illustrated in Fig. 2, but an improved understanding of
the changes that occurred during evolution and domestication may have implications for crop improvement (Hovav et al., 2007, 2008; Hu et al., 2013; Rapp et al.,
2010; Yoo and Wendel, 2014).
In nature, seed dispersal in Gossypium often follows a “shaker” model, where
erect, mature capsules dehisce along the sutures and the seeds are distributed
near the parental plant as wind shakes the branches. This likely is the ancestral
method of seed dispersal in the genus, as it occurs in all species of the B, C, E,
F, and (in part) D genome groups and in one species (G. bickii) of the G genome
group. The A, AD, (some) D, G, and K genome species have evolved other mechanisms of seed dispersal, with perhaps the most noteworthy innovation being the
fat bodies to facilitate ant dispersal on the seeds of the K genome species (Seelanan et al., 1999). The development of spinnable fibers apparently has occurred
Taxonomy and Evolution of the Cotton Genus, Gossypium
15
only once in the history of Gossypium, in the ancestor of the two A genome species
that became the progenitor of the A genome of the tetraploids.
The evolutionary “purpose” of epidermal seed hairs, or trichomes, is a matter of speculation. Fryxell (1979) suggested that elongated fibers aid dispersal by
birds, a hypothesis that gains credibility from sporadic mention of cotton seeds
in birds’ nests as well as a collection of G. darwinii from a finch’s nest in the Galapagos Islands (Wendel, unpublished data, 1990). One might also speculate that
fibers serve to inhibit germination unless there is sufficient moisture to saturate
the fibers; should germination occur following a light rain, there might not be
sufficient water for subsequent survival of the seedling. In this respect the waxy
coating of the fibers would repel water, to a point, and thus prevent premature
germination. A related possibility is that seed hairs function as “biological incubators” to facilitate germination only when ecological conditions are appropriate,
by recruiting particular microbial communities under appropriate moisture
regimes. Finally, two G genome species (G. australe, G. nelsonii) have evolved stiff
straight seed hairs that facilitate wind dispersal in that the stiffening hairs function to extrude seeds from the locules of the dehisced capsule.
It is against this natural history backdrop that ancient peoples discovered
that the unique properties of cotton fibers made them useful for ropes, textiles,
and other applications. This history has involved several cultures on different
continents and a parallel plant improvement process starting from divergent
and geographically isolated wild ancestors. Each of these crop species (G.
hirsutum, G. barbadense, G. arboreum, G. herbaceum) has its own history of domestication, diversification, and utilization (Brubaker et al., 1999; Hutchinson, 1951,
1959; Hutchinson et al., 1947; Lubbers and Chee, 2009; Percival et al., 1999; Percy,
2009; Percy and Wendel, 1990; Tyagi et al., 2014; Ulloa et al., 2013; Wendel et al.,
1992, 2009, 2010). Cotton domestication, thus, has been a replicated, and in many
respects parallel experiments for the four domesticated species. Many papers
describe various aspects of patterns of genetic diversity, the shape and severity of
genetic bottlenecks that accompanied the development of landraces and cultivars,
and the influence of recent human history on geographic patterns of cultivation
(Brubaker et al., 1999; Brubaker and Wendel, 1994, 2001; Fang et al., 2013; Hutchinson, 1951, 1954, 1959; Hutchinson et al., 1947; Percy and Wendel, 1990; Tyagi et al.,
2014; Wendel et al., 1989, 1992). It is worth reiterating that this history involved
human shaping and molding of naturally occurring diversity that originated
through the process of evolutionary diversification over a period of millions of
years, a legacy we continue to exploit today through deliberate introgression of
exotic germplasm from diverse, wild gene pools (Mehetre, 2010; Stelly, 2014; Wu
et al., 2010; Chapala et al., 2012; He et al., 2011a, 2011b; McCarty and Percy, 2001;
Nacoulima et al., 2012; Niles and Feaster, 1984; Zhang et al., 2011, 2014). Moreover,
new species of cotton and previously unrecognized taxonomic diversity continue
to be discovered today (Alvarez and Wendel, 2006; Feng et al., 2011; Grover et
al., 2014; Krapovickas and Seijo, 2008; Stewart et al., 2015; Ulloa et al., 2006), a
remarkable realization when one considers that Gossypium ranks among our most
important and hence frequently studied plant genera. The wild species of cotton,
consequently, represent an ample genetic repository for potential exploitation by
cotton breeders. The status of much of this diversity in the world’s germplasm
collections was recently reviewed (Campbell et al., 2010). Given current threats to
16
Wendel & Grover
global ecosystems, it has never been more critical than it is today to recognize this
legacy of speciation and diversification for humankind.
Acknowledgments
Research in the Wendel lab has largely been funded by the NSF Plant Genome Program,
with additional support from other NSF programs, the USDA NRI, and Cotton Incorporated.
References
Alvarez, I., R. Cronn, and J.F. Wendel. 2005. Phylogeny of the New World diploid cottons
(Gossypium L., Malvaceae) based on sequences of three low-copy nuclear genes. Plant
Syst. Evol. 252:199–214. doi:10.1007/s00606-004-0294-0
Alvarez, I., and J.F. Wendel. 2006. Cryptic interspecific introgression and genetic differentiation within Gossypium aridum (Malvaceae) and its relatives. Evolution 60:505–517.
doi:10.1111/j.0014-3820.2006.tb01132.x
Applequist, W.L., R.C. Cronn, and J.F. Wendel. 2001. Comparative development of fiber in
wild and cultivated cotton. Evol. Dev. 3:3–17. doi:10.1046/j.1525-142x.2001.00079.x
Arpat, A., M. Waugh, J.P. Sullivan, M. Gonzales, D. Frisch, D. Main, et al. 2004. Functional
genomics of cell elongation in developing cotton fibers. Plant Mol. Biol. 54:911–929.
doi:10.1007/s11103-004-0392-y
Baudoin, J.-P., A. D’Hont, and G. Mergeai. 2009. Bridging classical and molecular cytogenetics of Gossypium. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer,
New York. p. 257–281.
Beasley, J.O. 1941. Hybridization, cytology, and polyploidy of Gossypium. Chron. Bot.
6:394–395.
Brubaker, C.L., F.M. Bourland, and J.F. Wendel. 1999. The origin and domestication of cotton. In: C.W. Smith and J.T. Cothren, editors, Cotton; origin, history, technology and
production. John Wiley & Sons, New York. p. 3–31.
Brubaker, C.L., and J.F. Wendel. 1993. On the specific status of Gossypium lanceolatum Todaro. Genet. Resour. Crop Evol. 40:165–170. doi:10.1007/BF00051121
Brubaker, C.L., and J.F. Wendel. 1994. Reevaluating the origin of domesticated cotton
(Gossypium hirsutum; Malvaceae) using nuclear restriction fragment length polymorphisms (RFLPs). Am. J. Bot. 81:1309–1326. doi:10.2307/2445407
Brubaker, C.L., and J.F. Wendel. 2001. RFLP diversity in cotton. In: J.N. Jenkins and S. Saha,
editors, Genetic improvement of cotton: Emerging technologies. Science Publishers,
Inc., Enfield, NH. p. 81–102.
Campbell, B., S. Saha, R. Percy, J. Frelichowski, J. Jenkins, W. Park, et al. 2010. Status of the global cotton germplasm resources. Crop Sci. 50:1161–1179. doi:10.2135/
cropsci2009.09.0551
Chapala, M.M., D.B. Weaver, B.T. Campbell, E. van Santan, and R.R. Sharpe. 2012. Exotic
germplasm introgression effect on agronomic and fiber properties of Upland cotton.
In: Proceedings of the National Cotton Council Beltwide Cotton Conference, NCC,
Orlando, FL. 3–6 Jan. 2012. NCC, Memphis, TN. p. 803–807.
Cronn, R.C., R.L. Small, T. Haselkorn, and J.F. Wendel. 2002. Rapid diversification of the
cotton genus (Gossypium: Malvaceae) revealed by analysis of sixteen nuclear and chloroplast genes. Am. J. Bot. 89:707–725. doi:10.3732/ajb.89.4.707
Cronn, R.C., R.L. Small, T. Haselkorn, and J.F. Wendel. 2003. Cryptic repeated genomic
recombination during speciation in Gossypium gossypioides. Evolution 57:2475–2489.
doi:10.1111/j.0014-3820.2003.tb01493.x
Cronn, R., and J.F. Wendel. 2004. Cryptic trysts, genomic mergers, and plant speciation.
New Phytol. 161:133–142. doi:10.1111/j.1469-8137.2004.00947.x
d’Eeckenbrugge, G.C., and J.M. Lacape. 2014. Distribution and differentiation of wild,
feral, and cultivated populations of perennial Upland cotton (Gossypium hirsutum
L.) in mesoamerica and the Caribbean. PLoS ONE 9:e107458. doi:10.1371/journal.
pone.0107458
Taxonomy and Evolution of the Cotton Genus, Gossypium
17
DeJoode, D.R., and J.F. Wendel. 1992. Genetic diversity and origin of the Hawaiian Islands
cotton, Gossypium tomentosum. Am. J. Bot. 79:1311–1319. doi:10.2307/2445059
de Menezes, I.P.P., F.A. Gaiotto, L.V. Hoffmann, A.Y. Ciampi, and P.A.V. Barroso. 2014.
Genetic diversity and structure of natural populations of Gossypium mustelinum, a
wild relative of cotton, in the basin of the De Contas River in Bahia, Brazil. Genetica
(The Hague) 142:99–108.
Endrizzi, J.E., E.L. Turcotte, and R.J. Kohel. 1985. Genetics, cytology, and evolution of Gossypium. Adv. Genet. 23:271–375. doi:10.1016/S0065-2660(08)60515-5
Fang, D.D., L.L. Hinze, R.G. Percy, P. Li, D. Deng, and G. Thyssen. 2013. A microsatellite-based genome-wide analysis of genetic diversity and linkage disequilibrium in
Upland cotton (Gossypium hirsutum L.) cultivars from major cotton-growing countries.
Euphytica 191:391–401. doi:10.1007/s10681-013-0886-2
Feng, C., M. Ulloa, C. Perez-M, and J.M. Stewart. 2011. Distribution and molecular diversity
of arborescent Gossypium species. Botany 89:615–624. doi:10.1139/b11-042
Flagel, L.E., J.F. Wendel, and J.A. Udall. 2012. Duplicate gene evolution, homoeologous
recombination, and transcriptome characterization in allopolyploid cotton. BMC
Genomics 13:302. doi:10.1186/1471-2164-13-302
Fryxell, P.A. 1965. Stages in the evolution of Gossypium. Adv. Frontiers Plant Sci. 10:31–56.
Fryxell, P.A. 1968. A redefinition of the tribe Gossypieae. Bot. Gaz. 129:296–308.
doi:10.1086/336448
Fryxell, P.A. 1971. Phenetic analysis and the phylogeny of the diploid species of Gossypium
L. (Malvaceae). Evolution 25:554–562. doi:10.2307/2407355
Fryxell, P.A. 1976. A nomenclator of Gossypium: The botanical names of cotton. USDA Agricultural Research Service, Washington DC.
Fryxell, P.A. 1979 The natural history of the cotton tribe Texas A&M Univ. Press, College
Station, TX.
Fryxell, P.A. 1992. A revised taxonomic interpretation of Gossypium L. (Malvaceae). Rheedea
2:108–165.
Galau, G.A., and T.A. Wilkins. 1989. Alloplasmic male sterility in AD allotetraploid Gossypium hirsutum upon replacement of its resident A cytoplasm with that of D species G.
harknessii. Theor. Appl. Genet. 78:23–30. doi:10.1007/BF00299748
Grover, C.E., K.K. Grupp, R.J. Wanzek, and J.F. Wendel. 2012. Assessing the monophyly of polyploid Gossypium species. Plant Syst. Evol. 298:1177–1183. doi:10.1007/
s00606-012-0615-7
Grover, C.E., X. Zhu, K. Grupp, J. Jareczek, J.P. Gallagher, E. Szadkowski, et al. 2014. Molecular confirmation of species status for the allopolyploid cotton species, Gossypium
ekmanianum. Genet. Resour. Crop Evol. doi:10.1007/s10722-014-0138-x (in press)
Haigler, C.H., L. Betancur, M.R. Stiff, and J.R. Tuttle. 2012. Cotton fiber: A powerful singlecell model for cell wall and cellulose research. Front. Plant Sci. 3:104.
Haigler, C.H., B. Singh, G. Wang, and D. Zhang. 2009. Genomics of cotton fiber secondary wall deposition and cellulose biogenesis. In: A.H. Paterson, editor, Genetics and
genomics of cotton. Springer, New York. p. 385–417.
Hawkins, J.S., H. Kim, J.D. Nason, R.A. Wing, and J.F. Wendel. 2006. Differential lineage-specific amplification of transposable elements is responsible for genome size
variation in Gossypium. Genome Res. 16:1252–1261. doi:10.1101/gr.5282906
Hawkins, J.S., J. Pleasants, and J.F. Wendel. 2005. Identification of AFLP markers that discriminate between cultivated cotton and the Hawaiian island endemic, Gossypium
tomentosum. Genet. Resour. Crop Evol. 52:1069–1078. doi:10.1007/s10722-004-6115-z
Hawkins, J.S., S.R. Proulx, R.A. Rapp, and J.F. Wendel. 2009. Rapid DNA loss as a counterbalance to genome expansion through retrotransposon proliferation in plants. Proc.
Natl. Acad. Sci. USA 106:17811–17816. doi:10.1073/pnas.0904339106
He, S., J. Sun, and X. Du. 2011. Genetic and epigenetic status of triple exotic consanguinity
cotton introgression lines. Genet. Mol. Res. 10:4063–4072. doi:10.4238/2011.October.3.1
18
Wendel & Grover
He, S.-P., J.-L. Sun, C. Zhang, and X.-M. Du. 2011. Identification of exotic genetic components and DNA methylation pattern analysis of three cotton introgression lines from
Gossypium bickii. Mol. Biol. 45:204–210. doi:10.1134/S002689331102018X
Hendrix, B., and J.M. Stewart. 2005. Estimation of the nuclear DNA content of Gossypium
species. Ann. Bot. (Lond.) 95:789–797. doi:10.1093/aob/mci078
Hovav, R., J.A. Udall, B. Chaudhary, E. Hovav, L. Flagel, G. Hu, et al. 2008. The evolution of
spinnable cotton fiber entailed prolonged development and a novel metabolism. PLoS
Genet. 4:E25. doi:10.1371/journal.pgen.0040025
Hovav, R., J. Udall, E. Hovav, R. Rapp, L. Flagel, and J. Wendel. 2007. A majority of cotton genes are expressed in single-celled fiber. Planta 227:319–329. doi:10.1007/
s00425-007-0619-7
Hu, G., J. Koh, M.J. Yoo, K. Grupp, S. Chen, and J.F. Wendel. 2013. Proteomic profiling of
developing cotton fibers from wild and domesticated Gossypium barbadense. New Phytol. 200:570–582. doi:10.1111/nph.12381
Hutchinson, J.B. 1951. Intra-specific differentiation in Gossypium hirsutum. Heredity 5:161–
193. doi:10.1038/hdy.1951.19
Hutchinson, J.B. 1954. New evidence on the origin of the Old World cottons. Heredity
8:225–241. doi:10.1038/hdy.1954.20
Hutchinson, J.B. 1959. The application of genetics to cotton improvement. Cambridge Univ.
Press, Cambridge, UK.
Hutchinson, J.B., R.A. Silow, and S.G. Stephens. 1947. The evolution of Gossypium and the
differentiation of the cultivated cottons.Oxford Univ. Press, London.
Kim, H.J., and B.A. Triplett. 2001. Cotton fiber growth in planta and in vitro. Models for plant
cell elongation and cell wall biogenesis. Plant Physiol. 127:1361–1366. doi:10.1104/pp.010724
Krapovickas, A., and G. Seijo. 2008. Gossypium ekmanianum (Malvaceae), algodon silvestre
de la Republica Dominicana. Bonplandia 17:55–63.
Li, P., Z. Li, H. Liu, and J. Hua. 2014. Cytoplasmic diversity of the cotton genus as revealed by
chloroplast microsatellite markers. Genet. Resour. Crop Evol. 61:107–119. doi:10.1007/
s10722-013-0018-9
Liu, Q., C.L. Brubaker, A.G. Green, D.R. Marshall, P.J. Sharp, and S.P. Singh. 2001. Evolution
of the FAD2-1 fatty acid desaturase 5’ UTR intron and the molecular systematics of
Gossypium (Malvaceae). Am. J. Bot. 88:92–102. doi:10.2307/2657130
Lubbers, E.L., and P.W. Chee. 2009. The worldwide gene pool of G. hirsutum and its improvement. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York.
p. 23–52.
McCarty, J.C., and R.G. Percy. 2001. Genes from exotic germplasm and their use in cultivar
improvement in Gossypium hirsutum L. and G. barbadense L. In: J.N. Jenkins and S. Saha,
editors, Genetic improvement of cotton: Emerging technologies. Science Publishers,
Inc., Enfield, NH. p. 65–80.
Mehetre, S.S. 2010. Wild Gossypium anomalum: A unique source of fibre fineness and
strength. Curr. Sci. 99:58–71.
Nacoulima, N.L., J.-P. Baudoin, and G. Mergeai. 2012. Introgression of improved fiber fineness trait in G. hirsutum L. from G. longicalyx Hutch. & Lee. Commun. Agric. Appl. Biol.
Sci. 77(1):207–211
Niles, G.A., and C.V. Feaster. 1984. Breeding. In: R.J. Kohel and C.E. Lewis, editors, Cotton.
ASA, Madison, WI. p. 201–231.
Paterson, A.H., editor. 2009. Genetics and genomics of cotton. Springer, New York.
Percival, A.E., J.M. Stewart, and J.F. Wendel. 1999. Taxonomy and germplasm resources. In:
C.W. Smith and J.T. Cothren, editors, Cotton; origin, history, technology and production. John Wiley and Sons, New York. p. 33–63.
Percy, R.G. 2009. The worldwide gene pool of Gossypium barbadense L. and its improvement.
In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York. p. 53–68.
Percy, R.G., and J.F. Wendel. 1990. Allozyme evidence for the origin and diversification of
Gossypium barbadense L. Theor. Appl. Genet. 79:529–542. doi:10.1007/BF00226164
Taxonomy and Evolution of the Cotton Genus, Gossypium
19
Phillips, L.L. 1966. The cytology and phylogenetics of the diploid species of Gossypium. Am.
J. Bot. 53:328–335. doi:10.2307/2439872
Phuphathanaphong, L. 2006. Thepparatia (Malvaceae), a new genus from Thailand. Thai For.
Bull. (Bot.) 34:195–200.
Pleasants, J.M., and J.F. Wendel. 2010. Reproductive and pollination biology of the
endemic Hawaiian cotton, Gossypium tomentosum (Malvaceae). Pac. Sci. 64:45–55.
doi:10.2984/64.1.045
Rapp, R., C. Haigler, L. Flagel, R. Hovav, J. Udall, and J. Wendel. 2010. Gene expression in
developing fibres of Upland cotton (Gossypium hirsutum L.) was massively altered by
domestication. BMC Biol. 8:139. doi:10.1186/1741-7007-8-139
Rapp, R.A., and J.F. Wendel. 2005. Epigenetics and plant evolution. New Phytol. 168:81–91.
doi:10.1111/j.1469-8137.2005.01491.x
Ryser, U., and P. Holloway. 1985. Ultrastructure and chemistry of soluble and polymeric
lipids in cell walls from seed coats and fibres of Gossypium species. Planta 163:151–163.
doi:10.1007/BF00393501
Saunders, J.H. 1961. The wild species of Gossypium and their evolutionary history.Oxford
Univ. Press, London.
Seelanan, T., C.L. Brubaker, J.M. Stewart, L.A. Craven, and J.F. Wendel. 1999. Molecular
systematics of Australian Gossypium section Grandicalyx (Malvaceae). Syst. Bot. 24:183–
208. doi:10.2307/2419548
Seelanan, T., A. Schnabel, and J.F. Wendel. 1997. Congruence and consensus in the cotton
tribe (Malvaceae). Syst. Bot. 22:259–290. doi:10.2307/2419457
Senchina, D.S., I. Alvarez, R.C. Cronn, B. Liu, J.K. Rong, R.D. Noyes, et al. 2003. Rate variation among nuclear genes and the age of polyploidy in Gossypium. Mol. Biol. Evol.
20:633–643. doi:10.1093/molbev/msg065
Small, R.L., and J.F. Wendel. 1999. The mitochondrial genome of allotetraploid cotton (Gossypium L.). J. Hered. 90:251–253. doi:10.1093/jhered/90.1.251
Stelly, D. 2014. Facilitating the use of wild species germplasm for cotton improvement via
CS lines. In: Proceedings of the National Cotton Council Beltwide Cotton Conference,
NCC, New Orleans, LA. 6–8 Jan. 2014. NCC, Memphis, TN.
Stephens, S.G. 1958. Salt water tolerance of seeds of Gossypium species as a possible factor
in seed dispersal. Am. Nat. 92:83–92. doi:10.1086/282014
Stephens, S.G. 1963. Polynesian cottons. Ann. Mo. Bot. Gard. 50:1–22. doi:10.2307/2394914
Stephens, S.G. 1966. The potential for long range oceanic dispersal of cotton seeds. Am. Nat.
100:199–210. doi:10.1086/282413
Stewart, J.M. 1995. Potential for crop improvement with exotic germplasm and genetic
engineering. In: G.A. Constable and N.W. Forrester, editors, Challenging the future:
Proceedings of the World Cotton Research Conference, Brisbane Australia. 14–17 Feb.
1994. CSIRO, Melbourne, Australia. p. 313–327.
Stewart, J.M., L.A. Craven, C.L. Brubaker, and J.F. Wendel. 2015. Gossypium anapoides (Malvaceae), a new species from Western Australia. Novon 23(4):447–451.
doi:10.3417/2007140
Stewart, J.M., D. Oosterhuis, J.J. Heithholt, and J.R. Mauney. 2010. Physiology of cotton.
Springer, Dordrecht, The Netherlands.
Tiwari, R.S., J. Zhang, and J.M. Stewart. 2014. Molecular diversity based on AFLP markers
and possible natural hybridization among the Australian arid zone Gossypium species.
Aust. J. Crop Sci. 8:674–679.
Tyagi, P., M.A. Gore, D.T. Bowman, B.T. Campbell, J.A. Udall, and V. Kuraparthy. 2014.
Genetic diversity and population structure in the US Upland cotton (Gossypium hirsutum L.). Theor. Appl. Genet. 127:283–295. doi:10.1007/s00122-013-2217-3
Ulloa, M., I.Y. Abdurakhmonov, C. Perez-M, R. Percy, and J.M. Stewart. 2013. Genetic diversity and population structure of cotton (Gossypium spp.) of the New World assessed by
SSR markers. Botany 91:251–259. doi:10.1139/cjb-2012-0192
Ulloa, M., J.M. Stewart, E.A. Garcia-C, S. Godoy-A, A. Gaytan-M, and N.S. Acosta. 2006.
Cotton genetic resources in the western states of Mexico: In situ conservation status
20
Wendel & Grover
and germplasm collection for ex situ preservation. Genet. Resour. Crop Evol. 53:653–
668. doi:10.1007/s10722-004-2988-0
Watt, G. 1907. The wild and cultivated cotton plants of the world. Longmans, Green and
Co., London.
Wendel, J.F. 1989. New World tetraploid cottons contain Old World cytoplasm. Proc. Natl.
Acad. Sci. USA 86:4132–4136. doi:10.1073/pnas.86.11.4132
Wendel, J.F., and V.A. Albert. 1992. Phylogenetics of the cotton genus (Gossypium L.): Character-state weighted parsimony analysis of chloroplast DNA restriction site data and its
systematic and biogeographic implications. Syst. Bot. 17:115–143. doi:10.2307/2419069
Wendel, J.F., C.L. Brubaker, J.P. Alvarez, R.C. Cronn, and J.M. Stewart. 2009. Evolution and
natural history of the cotton genus. In: A.H. Paterson, editor, Genetics and genomics
of cotton. Springer, New York. p. 3–22.
Wendel, J.F., C.L. Brubaker, and A.E. Percival. 1992. Genetic diversity in Gossypium hirsutum
and the origin of Upland cotton. Am. J. Bot. 79:1291–1310. doi:10.2307/2445058
Wendel, J.F., C.L. Brubaker, and T. Seelanan. 2010. The origin and evolution of Gossypium.
In: J.M. Stewart, D. Oosterhuis, J.J. Heithholt, and J.R. Mauney, editors, Physiology of
cotton. Springer, Dordrecht, The Netherlands. p. 1–18.
Wendel, J.F., and R.C. Cronn. 2003. Polyploidy and the evolutionary history of cotton. Adv.
Agron. 78:139–186. doi:10.1016/S0065-2113(02)78004-8
Wendel, J.F., L.E. Flagel, and K.L. Adams. 2012. Jeans, genes, and genomes: Cotton as a
model for studying polyploidy. In: P.S. Soltis and D.E. Soltis, editors, Polyploidy and
genome evolution. Springer-Verlag, Heidelberg. p. 181–207.
Wendel, J.F., P.D. Olson, and J.M. Stewart. 1989. Genetic diversity, introgression and independent domestication of Old World cultivated cottons. Am. J. Bot. 76:1795–1806.
doi:10.2307/2444478
Wendel, J.F., and A.E. Percival. 1990. Molecular divergence in the Galapagos Island–Baja
California species pair, Gossypium klotzschianum and G. davidsonii (Malvaceae). Plant
Syst. Evol. 171:99–115. doi:10.1007/BF00940598
Wendel, J.F., and R.G. Percy. 1990. Allozyme diversity and introgression in the Galapagos
Islands endemic Gossypium darwinii and its relationship to continental G. barbadense .
Biochem. Syst. Ecol. 18:517–528. doi:10.1016/0305-1978(90)90123-W
Wendel, J.F., R. Rowley, and J.M. Stewart. 1994. Genetic diversity in and phylogenetic relationships of the Brazilian endemic cotton, Gossypium mustelinum (Malvaceae). Plant
Syst. Evol. 192:49–59. doi:10.1007/BF00985907
Wendel, J.F., A. Schnabel, and T. Seelanan. 1995. An unusual ribosomal DNA sequence
from Gossypium gossypioides reveals ancient, cryptic, intergenomic introgression. Mol.
Phylogenet. Evol.. 4:298–313. doi:10.1006/mpev.1995.1027
Wendel, J.F., J.M. Stewart, and J.H. Rettig. 1991. Molecular evidence for homoploid reticulate
evolution in Australian species of Gossypium. Evolution 45:694–711. doi:10.2307/2409921
Wu, J., J. McCarty, J. Jenkins, and W. Meredith. 2010. Breeding potential of introgressions
into Upland cotton: Genetic effects and heterosis. Plant Breed. 129:526–532.
Xu, Q., G. Xiong, P. Li, F. He, Y. Huang, K. Wang, et al. 2012. Analysis of complete nucleotide
sequences of 12 Gossypium chloroplast genomes: Origin and evolution of allotetraploids. PLoS ONE 7:E37128. doi:10.1371/journal.pone.0037128
Yoo, M.-J., and J.F. Wendel. 2014. Comparative evolutionary and developmental dynamics of the cotton (Gossypium hirsutum) fiber transcriptome. PLoS Genet. 10:E1004073.
doi:10.1371/journal.pgen.1004073
Yu-xiang, W., C. Jin-hong, H. Qiu-ling, and Z. Shui-jin. 2013. Parental origin and genomic
evolution of tetraploid Gossypium species by molecular marker and GISH analyses.
Caryologia 66:368–374. doi:10.1080/00087114.2013.857830
Zhang, J., R.G. Percy, and J.C. McCarty, Jr. 2014. Introgression genetics and breeding between
Upland and Pima cotton: A review. Euphytica 198:1–12. doi:10.1007/s10681-014-1094-4
Zhang, Z., J. Rong, V.N. Waghmare, P.W. Chee, O.L. May, R.J. Wright, et al. 2011. QTL alleles
for improved fiber quality from a wild Hawaiian cotton, Gossypium tomentosum. Theor.
Appl. Genet. 123:1075–1088. doi:10.1007/s00122-011-1649-x