Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Published April 23, 2015 Taxonomy and Evolution of the Cotton Genus, Gossypium Jonathan F. Wendel* and Corrinne E. Grover Abstract We present an overview of the taxonomy of Gossypium L. (the cotton genus) and its evolutionary history. Gossypium contains more than 50 recognized species, including several recently described, distributed in arid to semiarid regions of the tropics and subtropics. Diversity in Gossypium has been promoted by two seemingly unlikely processes: transoceanic, long-distance dispersal and wide hybridization among lineages that presently are widely separated geographically. Included are four species that were independently domesticated for their seed fiber—two diploids from Africa–Asia and two allopolyploids from the Americas. This repeated domestication of different wild progenitors represents a remarkable case of human-driven parallel evolution. Morphological variation in Gossypium is extensive; growth forms in the genus range from sprawling herbaceous perennials to ?15-m-tall trees, representing a notable array of reproductive and vegetative characteristics. Equally impressive is the striking cytogenetic and genomic diversity that emerged as Gossypium diversified and spread worldwide, ultimately spawning eight groups of closely related diploid (n = 13) species (i.e., genome groups A through G, and K). DNA sequence data place the origin of Gossypium at about 5 to 10 million years ago (mya), which rapidly diversified into these major genome groups shortly thereafter. Allopolyploid cottons appeared within the last 1 to 2 million years, a consequence of the improbable transoceanic dispersal of an A genome taxon to the New World and subsequent hybridization with an indigenous D genome diploid. Diversification of the nascent allopolyploid gave rise to three modern lineages containing seven species, including the agronomically important G. hirsutum L. and G. barbadense L. B ecause of its economic importance, the cotton genus (Gossypium L.) has been of interest to agricultural scientists, taxonomists, and many other kinds of biologists. Accordingly, a considerable amount is understood regarding the origin and diversification of the genus, its basic plant biology, and its properties as a crop plant (Paterson, 2009; Stewart et al., 2010; Wendel et al., 2009, 2010, 2012). Increasingly over the past two decades, classic taxonomic questions such as the origin of the polyploid species, the relationships among species and species Abbreviations: mya, million years ago. Jonathan F. Wendel and Corrinne E. Grover ([email protected]), Dep. of Ecology, Evolution, and Organismal Biology, Bessey Hall, Iowa State University, Ames, IA 50011. *Corresponding author ([email protected]). doi:10.2134/agronmonogr57.2013.0020 Copyright © 2015. ASA, CSSA, and SSSA, 5585 Guilford Road, Madison, WI 53711, USA. Cotton. 2nd. ed. David D. Fang and Richard G. Percy, editors. Agronomy Monograph 57. 1 2 Wendel & Grover groups, and the origins of the domesticated forms from their wild progenitors have been revisited using modern molecular technologies (Wendel et al., 2009, 2010, 2012; Wendel and Cronn, 2003). One of the salient features of the genus is that its history is so encompassing in scope, involving a global phylogenetic diversification and a repeated history of transoceanic dispersal. A remarkable consequence of this global spread and natural diversification has been that it provided the opportunity for different ancient human cultures on several continents to independently domesticate four different cotton species—two allopolyploids from the Americas, G. hirsutum and G. barbadense, and two diploids from Africa– Asia, G. arboreum L. and G. herbaceum L. The canvas for this human history is the natural evolutionary diversification of the genus, one that involves multiple, seemingly impossible hybridization events, often between lineages that presently are geographically disjunct or even on different continents, as well as multiple, remarkable cases of transoceanic, long-distance dispersals. These twin themes of cryptic intergenomic introgression or hybridization and long-distance dispersal have repeatedly come into play as the genus diversified and colonized much of the arid to semiarid regions of the tropics and subtropics. A wealth of recent studies have provided new insights into Gossypium taxonomy and diversity, suggesting that it is an appropriate time to update our understanding of the evolution, taxonomy, and biogeographic history of the genus. Here we provide such an update, drawing on our earlier reviews (Wendel et al., 2009, 2010, 2012; Wendel and Cronn, 2003) and with new syntheses. Emergence of the Genus Gossypium The cotton genus belongs to the Gossypieae, a small taxonomic tribe that includes only nine genera, eight of which are classically recognized (Fryxell, 1968, 1979) and one that is a new segregate from Cienfuegosia (Phuphathanaphong, 2006). Five of these genera are small with restricted geographic distributions (Fryxell, 1968, 1979; Phuphathanaphong, 2006) including Lebronnecia (Marquesas Islands), Cephalohibiscus (New Guinea, Solomon Islands), Gossypioides (East Africa, Madagascar), Thepparatia (known only from the type locality in Northern Thailand) and Kokia (Hawaii). The tribe also includes four moderately sized genera with broader geographic ranges: Hampea (21 neotropical species), the relatively diverse Cienfuegosia (25 species from the neotropics and parts of Africa), Thespesia (17 tropical species), and Gossypium, whose 50+ species (Fryxell, 1992) make it the largest and most widely distributed genus in the tribe. Thus, in aggregate this small taxonomic tribe has achieved an expansive geographic range, though many of the included taxa are quite rare and/or narrowly distributed. Molecular phylogenetic analyses have shed light on three important aspects of the evolutionary history of the tribe (Cronn et al., 2002; Seelanan et al., 1997). First and foremost is that despite their extensive distribution and extraordinary diversity, the group of species that belong to Gossypium do constitute a single natural lineage. A second important revelation has been the identity of the closest relatives of Gossypium, that is, the African–Madagascan genus Gossypioides and the Hawaiian endemic genus Kokia. This is unsuspected finding, given their biogeography and lower chromosome number (n = 12), yet it is quite important in that they may serve as outgroups in phylogenetic analyses, which provides an essential context both for studying evolutionary patterns and processes within Taxonomy and Evolution of the Cotton Genus, Gossypium 3 Gossypium and for understanding genome evolution. The third important insight gained from these molecular analyses was the provision of a temporal component to the major divergences. Using sequence divergence data as a proxy for time (i.e., a “molecular clock”), Seelanan et al. (1997) suggested that Gossypium diverged from Kokia and Gossypioides circa 12.5 mya, an estimate bolstered by a later, more extensive data set containing 10 different nuclear genes (Cronn et al., 2002). Collectively, all molecular data suggest that Gossypium diverged from its closest relatives approximately 10 to 15 mya during the Miocene and acquired its modern, worldwide distribution by multiple transoceanic dispersals combined with regional speciation. Diversification of the Diploid Cotton Species From the time of origin 5 to 10 mya, Gossypium experienced rapid speciation and diversification, while also achieving a nearly worldwide distribution containing several primary centers of diversity in the arid or seasonally arid tropics and subtropics (Table 1). Species-rich regions include Australia, especially the Kimberley region in NW Australia; the Horn of Africa and southern Arabian Peninsula; and the western part of central and southern Mexico. The taxonomy of the genus has been well studied (Cronn et al., 2002; Fryxell, 1979, 1992; Hutchinson et al., 1947; Saunders, 1961; Seelanan et al., 1997; Watt, 1907), with the taxonomic classification of Fryxell (1979, 1992) being the most modern and widely followed. In this treatment, species are grouped into four subgenera and eight sections (Table 1). The Table 1. Diversity and geographic distribution of the major lineages of Gossypium. Genomic placements and taxonomic status of species enclosed by parentheses are yet to be validated by cytogenetic and other means. Genome (no. of species) Recognized species A (2) G. arboreum, G. herbaceum B (3–4) G. anomalum, G. triphyllum, G. capitis-viridis, (G. trifurcatum) C (2) G. sturtianum, G. robinsonii D (13–14) G. thurberi, G. armourianum, G. harknessii, G. davidsonii, G. klotzschianum, G. aridum, G. raimondii, G. gossypioides, G. lobatum, G. trilobum, G. laxum, G. turneri, G. schwendimanii, (Gossypium sp. nov.) E (5–9) G. stocksii Mast., G. somalense (Gurke) J.B. Hutch., G. areysianum Deflers, G. incanum (O. Schwartz) Hillc., G. trifurcatum, (G. benadirense), (G. bricchettii), (G. vollesenii Fryxell), (G. trifurcatum) F (1) G. longicalyx G (3) G. bickii, G. australe, G. nelsonii K (12) G. anapoides, G. costulatum, G. cunninghamii, G. enthyle Fryxell et al., G. exiguum Fryxell et al., G. londonderriense Fryxell et al., G. marchantii, G. nobile Fryxell et al., G. pilosum Fyxell, G. populifolium (Benth.) F. Muell. ex Tod., G. pulchellum (C.A. Gardner) Fryxell, G. rotundifolium Fryxell et al. AD (7) G. hirsutum, G. barbadense, G. tomentosum, G. mustelinum, G. darwinii, G. ekmanianum, Gossypium sp. nov.† Geographic distribution Africa, Asia Africa, Cape Verde Islands Australia primarily Mexico, with range extensions into Peru, Galapagos Islands, Arizona Arabian Peninsula, Northeast Africa, Southwest Asia East Africa Australia Northwest Australia, Cobourg Peninsula, Northern Territory, Australia New World tropics and subtropics, including Hawaii, the Wake Atoll, and the Galapagos Islands † J.F. Wendel, C.E. Grover, J. Jareczek, and J.P. Gallagher, unpublished data, 2014. 4 Wendel & Grover classification represents the accumulated scientific understanding that emerged both from basic plant exploration and taxonomic and evolutionary study. While these species groupings are based primarily on morphological and geographical evidence, most infrageneric alignments are congruent with subsequent cytogenetic and molecular data sets (Endrizzi et al., 1985; Wendel and Albert, 1992; Wendel et al., 2009, 2010). The current circumscription of Gossypium presently includes approximately 50 species (Fryxell, 1992), although new species continue to be discovered (Alvarez and Wendel, 2006; Grover et al., 2014; Krapovickas and Seijo, 2008; Stewart et al., 2015; Ulloa et al., 2006). Morphologically, the genus is exceptionally diverse; species morphologies range from fire-adapted, herbaceous perennials with massive underground rootstocks in NW Australia to trees in SW Mexico that escape the dry season by dropping their leaves. Corolla colors span a rainbow of blue to purple [G. triphyllum (Harv.) Hochr.], mauves and pinks (G. sturtianum J.H. Willis, or “Sturt’s Desert Rose,” the official floral emblem of the Northern Territory, Australia), whites and pale yellows (NW Australia, Mexico, Africa–Arabia), and even a deep sulfur-yellow (G. tomentosum Nutt. ex Seem., Hawaii). Seed coverings are equally diverse; these range from nearly glabrous to the naked eye (e.g., G. klotzschianum Andersson and G. davidsonii Kellogg), to short, stiff, dense, brown hairs that aid in wind dispersal (G. australe F. Muell and G. nelsonii Fryxell), to the long, fine, white fibers that characterize highly improved forms of the four cultivated species (Fig. 1). There are even seeds that produce fat bodies to facilitate ant-dispersal (Seelanan et al., 1999). In addition to the remarkable morphological diversity, the genus also experienced extensive chromosomal evolution during diversification (Baudoin et al., 2009; Endrizzi et al., 1985). Despite sharing the same chromosome number (n = 13), genome sizes among the diploid species vary more than threefold (Hendrix and Stewart, 2005). Among closely related species, chromosome morphology is quite similar, as is demonstrated by the ability of closely related species to form hybrids that undergo normal meiotic pairing and that often experience high F1 fertility. In contrast, crosses between more distantly related species are often difficult to generate, with the few successful hybrids typically experiencing meiotic abnormalities. Extensive observations concerning the pairing behavior, chromosome sizes, and relative fertility in interspecific hybrids ultimately led to the clustering of closely related species into “genome groups,” designated by single-letter genome symbols (Beasley, 1941). Currently, there are eight recognized diploid genome groups (A through G, plus K) (Endrizzi et al., 1985; Stewart, 1995), whose species inclusions are largely congruent with taxonomic and phylogenetic placements. A temporal framework for the origin of Gossypium and its diversification is provided by sequence divergence data serving as a proxy for time (Senchina et al., 2003). These analyses indicate that Gossypium diverged from its closest relatives during the Miocene approximately 10 to 15 mya to originate approximately 5 to 10 mya, and these estimates of divergence times have recently been supported by an enormous amount of new sequence data derived from a global assembly of expressed sequence tags (Flagel et al., 2012). A consideration of the near worldwide distribution of species combined with their phylogenetic history (Fig. 2), relative to plate tectonic history, indicates that the evolution of Gossypium has involved multiple episodes of transoceanic dispersal. Minimally, these include Taxonomy and Evolution of the Cotton Genus, Gossypium 5 Fig. 1. Representative seed and trichome diversity in Gossypium. Seed and trichome size and morphology are exceedingly variable in the genus. Most wild species have relatively small seeds (<5 mm in any dimension) with equally short fibers. Long (spinnable) fiber evolved only once, in the ancestor of modern A genome cottons, which subsequently donated this capacity to modern allopolyploid species, including the commercially important G. hirsutum and G. barbadense, at the time of allopolyploid formation. Key to species: Cult. AD1, G. hirsutum TM1; Wild AD1, G. hirsutum Tx2094 from the Yucatan Peninsula; AD3, G. tomentosum WT936 from Hawaii; C1, G. sturtianum C1-4 from Australia; Cult. A2, G. arboreum AKA8401; Wild A1, G. herbaceum subsp. africanum (Watt) Vollesen from Botswana; D5, G. raimondii from Peru; D3, G. davidsonii D3d-32 from Baja California; F1, G. longicalyx F1-3 from Tanzania; B1, G. anomalum B1-1 from Africa. at least one dispersal between Australia and Africa; another to the Americas, which led to the evolution of the D genome diploids; and a second, much later colonization of the New World by the A genome ancestor of the AD genome allopolyploids (see below). Out of Africa, the Cytogenetic Center of Diversity in the Genus (A, B, E, and F Genomes) An important implication of the phylogenetic data for Gossypium is that it supports the proposition that Africa is the ancestral home for the genus (Fig. 2). This is not strictly true from a purely cladistic standpoint, in that the basal phylogenetic split is between the New World and Old World diploids; however, the supposition that Gossypium originated in Africa is supported by the genomic Fig. 2. Biogeographic history of Gossypium. Following its likely origin 5 to 10 mya, Gossypium split into three major diploid lineages: the New World clade (D genome); the African–Asian clade (A , B, E, and F genomes); and the Australian clade (C, G, and K genomes). The diversity of genome sizes and seed morphologies in the genus are shown with exemplars from each genome group. See text for additional details. 6 Wendel & Grover Taxonomy and Evolution of the Cotton Genus, Gossypium 7 diversity found in the region (see below), versus the relative homogeneity found in the New World. Fourteen species from Africa and Arabia are recognized in the most recent taxonomic treatment of the genus (Fryxell, 1992) and together compose the African–Arabian subgenus Gossypium. Recognized within the subgenus Gossypium are two taxonomic sections: section Gossypium, which contains four subsections, and another African–Arabian section (Serrata), which contains the single species G. trifurcatum Vollesen (found in desert area of eastern Somalia). The presence of dentate leaves has raised the question of whether G. trifurcatum belongs in the genus Gossypium at all, as dentate leaves are more characteristic of Cienfugosia; however, molecular work has established that this poorly known specimen is indeed a cotton species, if unusual in form (Rapp and Wendel, 2005). This example underscores the provisional nature of much of the taxonomy of the African–Arabian species of Gossypium, which are sorely in need of basic plant exploration and systematic study. Other examples include subsection Pseudopambak, which contains several species whose recognition and definition are based solely on limited herbarium material (e.g., G. benadirense Mattei, G. bricchettii (Ulbr.) Vollesen, G. vollesenii). As no seeds have been collected and no living specimens are available, there exists no information on the cytogenetic characteristics and molecular phylogenetic placement for these species. It is nearly certain that new taxonomic diversity remains to be discovered in this region of the world. Cytogenetically, the African–Arabian species are rather diverse and account for four of the eight genome groups (A, B, E, and F), which neatly correspond to their taxonomic subsections. The A genome group contains the two cultivated cottons of subsection Gossypium, G. arboreum and G. herbaceum, while the three African species in the B genome (G. anomalum Wawra, G. capitis-viridis Mauer, and G. triphyllum) compose subsection Anomala. It bears noting that G. trifurcatum (taxonomic section Serrata) may in fact belong to the B genome, but this has not been established. The sole F genome species, G. longicalyx J.B. Hutch. & B.J.S. Lee, represents subsection Longiloba, and, of the section Gossypium, it is considered cytogenetically distinct (Phillips, 1966), morphologically isolated (Fryxell, 1971, 1992), and according to Fryxell may be better adapted to more mesic conditions than any other diploid Gossypium species. Further adding to the interest in this species is the phylogenetic placement of G. longicalyx as sister to the A genome taxa (Fig. 2), which provides a window into the evolutionary origin of “long” fiber. The remaining African–Asian species, those of subsection Pseudopambak, are considered to possess E genomes, although this has yet to be verified. To Australia, with Genomic Gigantism and New Ecological Specializations Australia represents another center of diversity for the cotton genus. The Australian cottons (subgenus Sturtia) are represented by 3 taxonomic sections containing 17 total species, including one, G. anapoides J.M. Stewart et al., nom. inval., that is newly described (Stewart et al., 2015). Cytogenetically, these taxa are divided into the C, G, and K genome groups, having 2, 3, and 12 species, respectively. DNA sequence data (Liu et al., 2001; Seelanan et al., 1997, 1999) support these groups as natural lineages, consistent with their formal alignments into the taxonomic sections Sturtia (C genome), Hibiscoidea (G genome), and Grandicalyx (K genome), although the relationships among these three are yet unresolved. Some data 8 Wendel & Grover support a west to east radiation of Gossypium in Australia by placing G. robinsonii F. Muell. (C genome) as the basal-most branch of the Australian clade (DeJoode and Wendel, 1992). This basal placement of the C genome has not been observed in the most recent analyses (Liu et al., 2001; Seelanan et al., 1997, 1999), which provide no resolution among the Australian lineages and thereby highlight the uncertainty in the phylogenetic history of the Australian cottons. Taxonomically, the C and G genome groups are well understood, as these are amply represented in collections and have been thoroughly studied (DeJoode and Wendel, 1992; Fryxell, 1979, 1992; Liu et al., 2001; Seelanan et al., 1997, 1999; Tiwari et al., 2014; Wendel et al., 1991). Species in the G genome are interfertile and apparently hybridize commonly (Tiwari et al., 2014; Wendel et al., 1991). The taxonomy of the K genome (section Grandicalyx), however, is less certain. Collecting expeditions to the Kimberley region have doubled the number of recognized K genome species in the past two decades through the discovery of at least seven new species (Fryxell, 1992; Stewart et al., 2015) and have expanded our understanding of the diversity in this section. The species of this section are geographically, morphologically, and ecologically distinct from other cotton species, frequently exhibiting features characteristic of fire adaptation. Specifically, these are herbaceous perennials that exhibit a biseasonal growth pattern, where vegetative growth dies back during the dry season and the plants survive as underground rootstocks until fire or the return of the wet season initiates a new cycle of growth. Also unique to section Grandicalyx are the flower and fruit morphologies. K genome species possess upright flowers (like other cotton species) that subsequently become pendant following pollination. This change in floral positioning is thought to aid in these uniquely ant-dispersed cotton seeds. That is, at maturity the pendant capsules release sparsely haired seeds, which bear elaiosomes (fat bodies) that attract the ants who participate in their dispersal. Nearly all of K genome species are poorly represented in collections (Campbell et al., 2010), leading to much of this taxonomic uncertainty. These species have enormous genomes, relative to others in the genus (Fig. 2), reflecting a relatively recent proliferation of gypsy-like transposable elements (Hawkins et al., 2006). Phylogenetic analyses have yielded conflicting results regarding interspecific relationships in section Grandicalyx (Liu et al., 2001; Seelanan et al., 1999), although there is some support for a fundamental division of the section into a group of prostrate species and a clade of more upright plants. One final comment about the Australian species concerns their apparent propensity to hybridize, a somewhat surprising observation given their present status as species whose populations are mostly small, scattered, and geographically highly disjunct from those of related species. These geographical and often phenological barriers notwithstanding, there is abundant evidence for multiple episodes of historical introgression in the group, involving G. bickii Prokh., G. sturtianum, and G. cunninghamii Tod. (Cronn and Wendel, 2004; Seelanan et al., 1999; Tiwari et al., 2014; Wendel et al., 1991). To the New World, Part I: The D Genome Diploids A watershed moment in the evolutionary history of Gossypium followed a seemingly impossible transoceanic (either Atlantic or Pacific) dispersal of an African ancestor to the shores of Central America, possibly on what is now the western Taxonomy and Evolution of the Cotton Genus, Gossypium 9 coast of Mexico. This dispersal event, dating to about 5 to 10 mya, led to the evolution and diversification of the New World D genome clade. This assemblage of species have unusually small genomes for the genus (Fig. 2), due at least in part to deletional clearance of much of the transposable element fraction (Hawkins et al., 2009), and none bear spinnable fiber (Fig. 1). Taxonomically placed into subgenus Houzingenia, the species are aligned into two sections and six subsections. These species have been more thoroughly studied than most, and consequently their taxonomy is reasonably well understood. Nonetheless, there remains sufficient cryptic variability in the group that much remains to be learned about diversity and relationships (Alvarez and Wendel, 2006; Cronn et al., 2003; Feng et al., 2011; Ulloa et al., 2006, 2013; Wendel and Cronn, 2003; Wendel et al., 1995). This is particularly true for the “arborescent clade” comprising the group of small trees that includes the widespread G. aridum (Rose & Standl.) Skovst. and the more narrowly distributed G. laxum L. Ll. Phillips, G. lobatum Gentry, and G. schwendimanii Fryxell & S.D. Koch. Evidence indicates that cryptic variation exists within this clade, possibly including new species related to G. aridum (Alvarez et al., 2005; Feng et al., 2011) and G. laxum (Alvarez et al., 2005; Ulloa et al., 2006). There even are populations of G. aridum from Colima that have a plastid genome much more similar to those from G. klotzschianum (from the Galapagos Islands) and G. davidsonii (from Baja California) than to all other G. aridum populations, likely reflecting historical interspecific gene flow (Alvarez and Wendel, 2006; Wendel and Albert, 1992). Gossypium gossypioides (Ulbr.) Standl. also has a somewhat mysterious introgressive ancestry (Cronn et al., 2003; Wendel et al., 1995), pointing out once again the significance of ancient, natural interspecific hybridization in the generation of diversity in the cotton genus (Cronn and Wendel, 2004). The D genome cottons also have received considerable phylogenetic attention (Alvarez et al., 2005; Cronn et al., 2002, 2003; Feng et al., 2011; Ulloa et al., 2006, 2013; Wendel and Albert, 1992), which provides support for the naturalness of most of the subsections. Evolutionary relationships among the apparently natural subsections are less certain, however (Alvarez et al., 2005; Ulloa et al., 2013), although available evidence suggests that G. gossypioides is basal-most within the subgenus (Cronn et al., 2003; Feng et al., 2011; Yu-xiang et al., 2013). All analyses support the monophyly of the arborescent clade, the grouping of the Baja California species (G. harknessii Brandegee–G. armourianum Kearney–G. turneri Fryxell) as a distinct clade, and the naturalness of the G. thurberi Tod.–G. trilobum (DC.) Skovst. species pair, but the relative phylogenetic placements of these three groups with G. raimondii Ulbr. remain uncertain. Twelve of the 14+ D genome diploid species are endemic to western Mexico, indicating this area is the center of diversity of the D genome. It seems likely, therefore, that this lineage became established and initially diversified in this region. From this we can deduce later range extensions, probably relatively recently (during the Pleistocene), following long-distance dispersals leading to the evolution of endemics in Peru (G. raimondii) and the Galapagos Islands (G. klotzschianum). The latter species is closely related to G. davidsonii from Baja California, representing a clear and unusual case of this floristic connection (Wendel and Percival, 1990). As for G. raimondii, it is interesting that the sole South American representative of the New World diploid cottons is implicated as the closest ancestor of the D genome of allopolyploid cotton (more on this below); the implication, perhaps ironic, is that if allopolyploidy occurred in northern South America, as suggested 10 Wendel & Grover (but not shown) by this observation, the possibility exists that they never would have evolved had colonization of the New World by an A genome propagule not followed closely on the heels of an earlier interhemispheric North American to South American dispersal of the ancestor of what we now recognize as G. raimondii. This aspect of the origin of the allopolyploids makes an improbable story all the more remarkable and yet again points to the recurring theme of the evolutionary significance of long-distance dispersal in cotton diversification. To the New World, Part II: Origin and Diversification of the Polyploid Cottons A rich body of cytogenetic and experimental evidence now has demonstrated convincingly that the tetraploid cotton species, which are entirely New World in their distribution, are allopolyploids containing two coresident genomes: one from an African or Asian A genome species and the other from a species similar to the American D genome diploids (Endrizzi et al., 1985; Wendel, 1989; Wendel and Cronn, 2003; Wendel et al., 2012). The hemisphere-scale allopatry of these two diploid genome groups led to many decades of mystery surrounding the timing and parentage of the New World allopolyploids. The question of “when” has been convincingly addressed through gene sequence data that indicate allopolyploid Gossypium originated approximately 1 to 2 mya in the mid-Pleistocene, before the evolution of modern humans but relatively recently in geological terms (Flagel et al., 2012; Senchina et al., 2003; Wendel, 1989). The question of parentage has also been clarified through a wealth of DNA sequence data that indicate: (i) both extant A genome species (G. arboreum, G. herbaceum) are equally distant from the allopolyploid A genome and (ii) G. raimondii is the closest extant relative of the actual D genome donor of the allopolyploid (Endrizzi et al., 1985; Li et al., 2014; Wendel, 1989; Wendel and Cronn, 2003; Wendel et al., 2012; Yu-xiang et al., 2013). Additionally, all allopolyploids contain an A genome cytoplasm, as evidenced from analysis of both mitochondrial and plastidial genomes (Galau and Wilkins, 1989; Li et al., 2014; Small and Wendel, 1999; Wendel, 1989; Xu et al., 2012). Finally, the studies mentioned above, and additional, multilocus DNA sequence data (Grover et al., 2012), support a single origin for allopolyploid cotton. As noted above, one truly remarkable feature of the origin of allopolyploid cottons is that their genesis likely was dependent on the following series of highly improbable events, which when considered in their totality amount to a biological marvel: (i) an ancient colonization of the New World by the ancestor of the D genome diploids, following an initial transoceanic dispersal; (ii) a more recent interhemispheric North American to South American dispersal of the ancestor of modern G. raimondii; (iii) a third long-distance dispersal of an African–Asian A genome ancestor to the New World, likely in the mid-Pleistocene; and (iv) the chance biological encounter between the immigrant A genome taxon and the endemic D genome entity, giving rise to hybridization and genome doubling, and hence the genomic reunion among genomes that had been isolated on different continents for 5 to 10 million years. Given this now-justified scenario of a Pleistocene origin, the morphological diversification and spread of the allopolyploid cotton species must have been relatively rapid following polyploidization. Classically, five allopolyploid species were widely recognized, but a sixth species (G. ekmanianum Wittm.) was recently Taxonomy and Evolution of the Cotton Genus, Gossypium 11 resurrected (Krapovickas and Seijo, 2008) and subsequently validated (Grover et al., 2014) by molecular sequence data. In addition, ongoing studies provide justification for recognition of a seventh species that is relatively closely related to G. hirsutum, from two islands (Wake, Peale) in the Wake Atoll in the Pacific Ocean, Gossypium sp. nov. (J.F. Wendel, C.E. Grover, J. Jareczek, and J.P. Gallagher, unpublished data, 2014). Included among the classically recognized species are two island endemics, G. darwinii and G. tomentosum, that have differing population structures. Gossypium darwinii, a native of the Galapagos Islands, forms large and continuous populations in some areas (Wendel and Percy, 1990), whereas the Hawaiian endemic, G. tomentosum, tends to be more sparse, primarily occurring as scattered individuals and small populations on several islands. The latter species, G. tomentosum, has recently received some attention with respect to its reproductive biology and diversity (DeJoode and Wendel, 1992; Hawkins et al., 2005; Pleasants and Wendel, 2010). A third classical allopolyploid, G. mustelinum Miers ex G. Watt, is restricted to a relatively small region (Bahia state) of northeast Brazil, where it survives in apparently relictual and highly homozygous scattered populations (de Menezes et al., 2014; Wendel et al., 1994). In addition to these three truly wild species, there exist two species with cultivated forms (G. barbadense and G. hirsutum). Both species are characterized by large indigenous ranges that in the aggregate encompass a wealth of morphological forms spanning the wildto-domesticate continuum (Brubaker et al., 1999; Brubaker and Wendel, 1993, 1994; Fryxell, 1979; Hutchinson, 1951, 1959; Hutchinson et al., 1947; Lubbers and Chee, 2009; Percy and Wendel, 1990; Stephens, 1963). Gossypium hirsutum is widely distributed in Central America and northern South America, the Caribbean, and even reaches distant islands in the Pacific (Solomon Islands, Marquesas). Natural diversity in this species is underscored by its complex nomenclatorial and taxonomic history (Fryxell, 1976, 1979). It is therefore unsurprising that a new species from the Dominican Republic was recently discovered both in natural settings and within the Gossypium germplasm collection (Grover et al., 2014; Krapovickas and Seijo, 2008) and that other undescribed species likely are encompassed by the diversity presently subsumed under the name G. hirsutum, as exemplified also by Gossypium sp. nov., as noted above. Gossypium hirsutum has a more northerly indigenous distribution than G. barbadense, with wild populations occurring as far north as Tampa Bay, Florida (27°38¢ N) (J.M. Stewart, personal observation). Gossypium barbadense has an indigenous range centered in the northern third of South America but with a large region of range overlap with G. hirsutum in the Caribbean (Percy, 2009). Consideration of the ecological and geographic distribution of the allopolyploid species has led to the suggestion that polyploidy created the opportunity to invade an ecological niche that is distinct from that occupied by most diploids. Fryxell (1965, 1979) noted that the truly wild forms of the allopolyploid species typically occur in coastal habitats, in contrast to most of their diploid relatives. Three species (G. darwinii, Gossypium sp. nov., and G. tomentosum) are island endemics that typically reside near coastlines, and two of the other species (G. barbadense and G. hirsutum) possess wild forms that typically occupy littoral habitats tracing the Gulf of Mexico, northwest South America, and reaching distant Pacific Islands. Fryxell speculated that the adaptations that allowed the newly evolved allopolyploid to survive, and even thrive, in littoral habitats ultimately facilitated the establishment of new polyploid species and provided a means for 12 Wendel & Grover rapid dispersal of the saltwater tolerant seeds. These ideas gain additional support from recent ecological niche modeling studies (d’Eeckenbrugge and Lacape, 2014), which provide an elegant framework for understanding past and present ecogeographic distributions of wild polyploid cottons. Phylogenetic Relationships in the Genus The genealogy of Gossypium has been evaluated by multiple molecular phylogenetic investigations (reviewed in Wendel and Cronn, 2003; Wendel et al., 2012), each of which demonstrated that the major clades of species (i.e., genealogical lineages) are largely consistent with geographical distributions, cytogenetic relationships (Baudoin et al., 2009; Endrizzi et al., 1985), and genome designations. Thus, each taxonomically and cytogenetically established genome group corresponds to a single natural lineage, which also, in most cases, is geographically cohesive. This information is summarized in a depiction of our present understanding of these relationships (Fig. 2 and 3). Gossypium phylogenetic history has several broad aspects worth highlighting. First, four major diploid lineages exist that correspond to three continents: Australia (C, G, and K genomes), the Americas (D genome), and Africa/Arabia (itself containing two lineages: one comprising the A, B, and F genomes, and a second solely containing the E genome species). Second, the earliest divergence event separated the New World D genome lineage from that containing the ancestor of all Old World taxa, approximately 5 to 10 mya, making the New World and Old World diploids phylogenetic sister groups. Subsequently, the Old World lineage itself diversified into three groups, namely, the Australian cottons (C, G, and K genome species), the African–Arabian E genome species, and the African A, B, and F genome cottons. Third, analyses identify the sole F genome species (G. longicalyx, Africa) as sister to the A genome clade (G. arboreum and G. herbaceum), where the first emergence of fiber suitable for domestication evolved. This information diagnoses the wild forms that best represent the ancestral lintless condition, and provides a foundation for ultimately understanding the genetic basis of the origin of useful, long lint (cf. Fig. 2). Fourth, the major lineages of Gossypium are characterized by rapid radiation and diversification that occurred shortly after the genus originated and diverged from the Kokia–Gossypioides clade. Fifth, and finally, the phylogenetic depiction summarizes our current understanding and uncertainties regarding relationships among the seven allopolyploid species. From Dispersal Ecology to Human Opportunity As emphasized in this review, a consideration of the phylogeny of Fig. 2 and 3 in a temporal context leads to the realization that the genus has had an inordinate fondness for transoceanic travel. As already noted, this includes at least one dispersal between Australia and Africa, another to the Americas (probably Mexico) leading to the evolution of the D genome diploids, and a second, much later, colonization of the New World by the A genome ancestor of the AD genome allopolyploids. Long-distance dispersal played a role not only in diversification of major evolutionary lines but also in speciation within Gossypium genome groups. Examples include dispersals from southern Mexico to Peru (G. raimondii), from northern Mexico to the Galapagos Islands (G. klotzschianum), from western South Taxonomy and Evolution of the Cotton Genus, Gossypium 13 Fig. 3. Evolutionary history of Gossypium, as inferred from multiple molecular phylogenetic data sets. The closest relative of Gossypium is a lineage containing the African–Madagascan genus Gossypioides and the Hawaiian endemic genus Kokia. Following its likely origin 5 to 10 mya, Gossypium split into three major diploid lineages: the New World clade (D genome); the African–Asian clade (A, B, E, and F genomes); and the Australian clade (C, G, and K genomes). This global diversification involved multiple transoceanic dispersal events and was accompanied by morphological, ecological, and chromosomal differentiation (genome sizes shown). Interspecific hybridization has been exceptionally common during the evolution of the genus. Allopolyploid cottons formed following transoceanic dispersal of an A genome diploid to the Americas, where the new immigrant underwent hybridization, as female, with a native D genome diploid similar to modern G. raimondii. Polyploid cotton probably originated during the Pleistocene (1–2 mya), with the modern species representing the descendants of an early and rapid colonization of the New World tropics and subtropics. America to the Galapagos Islands (G. darwinii), from Africa to the Cape Verde Islands (G. capitis-viridis), and from the neotropics to the Hawaiian Islands (G. tomentosum). What is so striking about this history is not just its biographically fantastical nature, but also the lack of obvious adaptations for saltwater dispersal 14 Wendel & Grover (Stephens, 1958, 1966). Not only that, but the life history bauplan for Gossypium is of arid-zone adapted plants that one might assume at first blush do not possess any capacity for overseas journeys. This aspect of the natural history of the cotton genus represents a fascinating conundrum, one that lends itself more to speculation than to experimentation. Yet it also highlights the potential evolutionary significance of the seemingly most improbable and rare events. Given the absence of obvious adaptations for water dispersal, what can we say about the evolutionary significance of the single-celled epidermal trichomes for which cotton is justifiably so famous? The seeds and their coverings are extraordinarily diverse in Gossypium, as illustrated in Fig. 1, which provides a visual comparison of the various lineages of wild species and domesticated cotton species. Even though wild and cultivated cottons both produce fiber on the seed coat, there are striking morphological and structural differences between these fibers, the most obvious of which is their size. In contrast to the canonical (domesticated) cotton fiber, some D genome species (G. thurberi, G. trilobum, G. davidsonii, and G. klotzschianum) do not possess obvious seed hairs; however, the presence of these developmentally repressed structures is apparent under microscopy (Applequist et al., 2001). Similarly, while the three D genome species of subsection Caducibracteolata appear hairless to the unaided eye, these species do in fact have seed hairs that are simply tightly appressed to the seed. Cultivated lint fiber is a model for cell wall developmental biology, as it is single cell of almost pure cellulose that experiences remarkable elongation during development in cultivated species, achieving a final length up to 6 cm (Arpat et al., 2004; Haigler et al., 2012; Kim and Triplett, 2001). In contrast, the wild-type fiber cell is composed mostly of a combination of cellulose and suberin, which elongates to <1 cm (Applequist et al., 2001; Haigler et al., 2009; Ryser and Holloway, 1985). These and many other differences reflect both natural evolutionary processes as well as human-mediated selection during domestication. The duration of the elongation phase and the timing of onset of secondary wall synthesis appear to be key determinants of the final length of the fiber in both wild and cultivated plants (Applequis et al., 2001). Phylogenetic analysis of growth rates has shown that the evolutionary innovation of prolonged elongation arose in the F genome/A genome lineage, which may have facilitated the original domestication of the A genome cottons. This trait of prolonged elongation was passed on to the allopolyploids, which in turn was a key component of their eventual domestication. Relatively little is known about the developmental and genetic underpinnings of the diverse morphologies illustrated in Fig. 2, but an improved understanding of the changes that occurred during evolution and domestication may have implications for crop improvement (Hovav et al., 2007, 2008; Hu et al., 2013; Rapp et al., 2010; Yoo and Wendel, 2014). In nature, seed dispersal in Gossypium often follows a “shaker” model, where erect, mature capsules dehisce along the sutures and the seeds are distributed near the parental plant as wind shakes the branches. This likely is the ancestral method of seed dispersal in the genus, as it occurs in all species of the B, C, E, F, and (in part) D genome groups and in one species (G. bickii) of the G genome group. The A, AD, (some) D, G, and K genome species have evolved other mechanisms of seed dispersal, with perhaps the most noteworthy innovation being the fat bodies to facilitate ant dispersal on the seeds of the K genome species (Seelanan et al., 1999). The development of spinnable fibers apparently has occurred Taxonomy and Evolution of the Cotton Genus, Gossypium 15 only once in the history of Gossypium, in the ancestor of the two A genome species that became the progenitor of the A genome of the tetraploids. The evolutionary “purpose” of epidermal seed hairs, or trichomes, is a matter of speculation. Fryxell (1979) suggested that elongated fibers aid dispersal by birds, a hypothesis that gains credibility from sporadic mention of cotton seeds in birds’ nests as well as a collection of G. darwinii from a finch’s nest in the Galapagos Islands (Wendel, unpublished data, 1990). One might also speculate that fibers serve to inhibit germination unless there is sufficient moisture to saturate the fibers; should germination occur following a light rain, there might not be sufficient water for subsequent survival of the seedling. In this respect the waxy coating of the fibers would repel water, to a point, and thus prevent premature germination. A related possibility is that seed hairs function as “biological incubators” to facilitate germination only when ecological conditions are appropriate, by recruiting particular microbial communities under appropriate moisture regimes. Finally, two G genome species (G. australe, G. nelsonii) have evolved stiff straight seed hairs that facilitate wind dispersal in that the stiffening hairs function to extrude seeds from the locules of the dehisced capsule. It is against this natural history backdrop that ancient peoples discovered that the unique properties of cotton fibers made them useful for ropes, textiles, and other applications. This history has involved several cultures on different continents and a parallel plant improvement process starting from divergent and geographically isolated wild ancestors. Each of these crop species (G. hirsutum, G. barbadense, G. arboreum, G. herbaceum) has its own history of domestication, diversification, and utilization (Brubaker et al., 1999; Hutchinson, 1951, 1959; Hutchinson et al., 1947; Lubbers and Chee, 2009; Percival et al., 1999; Percy, 2009; Percy and Wendel, 1990; Tyagi et al., 2014; Ulloa et al., 2013; Wendel et al., 1992, 2009, 2010). Cotton domestication, thus, has been a replicated, and in many respects parallel experiments for the four domesticated species. Many papers describe various aspects of patterns of genetic diversity, the shape and severity of genetic bottlenecks that accompanied the development of landraces and cultivars, and the influence of recent human history on geographic patterns of cultivation (Brubaker et al., 1999; Brubaker and Wendel, 1994, 2001; Fang et al., 2013; Hutchinson, 1951, 1954, 1959; Hutchinson et al., 1947; Percy and Wendel, 1990; Tyagi et al., 2014; Wendel et al., 1989, 1992). It is worth reiterating that this history involved human shaping and molding of naturally occurring diversity that originated through the process of evolutionary diversification over a period of millions of years, a legacy we continue to exploit today through deliberate introgression of exotic germplasm from diverse, wild gene pools (Mehetre, 2010; Stelly, 2014; Wu et al., 2010; Chapala et al., 2012; He et al., 2011a, 2011b; McCarty and Percy, 2001; Nacoulima et al., 2012; Niles and Feaster, 1984; Zhang et al., 2011, 2014). Moreover, new species of cotton and previously unrecognized taxonomic diversity continue to be discovered today (Alvarez and Wendel, 2006; Feng et al., 2011; Grover et al., 2014; Krapovickas and Seijo, 2008; Stewart et al., 2015; Ulloa et al., 2006), a remarkable realization when one considers that Gossypium ranks among our most important and hence frequently studied plant genera. The wild species of cotton, consequently, represent an ample genetic repository for potential exploitation by cotton breeders. The status of much of this diversity in the world’s germplasm collections was recently reviewed (Campbell et al., 2010). Given current threats to 16 Wendel & Grover global ecosystems, it has never been more critical than it is today to recognize this legacy of speciation and diversification for humankind. Acknowledgments Research in the Wendel lab has largely been funded by the NSF Plant Genome Program, with additional support from other NSF programs, the USDA NRI, and Cotton Incorporated. References Alvarez, I., R. Cronn, and J.F. Wendel. 2005. Phylogeny of the New World diploid cottons (Gossypium L., Malvaceae) based on sequences of three low-copy nuclear genes. Plant Syst. Evol. 252:199–214. doi:10.1007/s00606-004-0294-0 Alvarez, I., and J.F. Wendel. 2006. Cryptic interspecific introgression and genetic differentiation within Gossypium aridum (Malvaceae) and its relatives. Evolution 60:505–517. doi:10.1111/j.0014-3820.2006.tb01132.x Applequist, W.L., R.C. Cronn, and J.F. Wendel. 2001. Comparative development of fiber in wild and cultivated cotton. Evol. Dev. 3:3–17. doi:10.1046/j.1525-142x.2001.00079.x Arpat, A., M. Waugh, J.P. Sullivan, M. Gonzales, D. Frisch, D. Main, et al. 2004. Functional genomics of cell elongation in developing cotton fibers. Plant Mol. Biol. 54:911–929. doi:10.1007/s11103-004-0392-y Baudoin, J.-P., A. D’Hont, and G. Mergeai. 2009. Bridging classical and molecular cytogenetics of Gossypium. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York. p. 257–281. Beasley, J.O. 1941. Hybridization, cytology, and polyploidy of Gossypium. Chron. Bot. 6:394–395. Brubaker, C.L., F.M. Bourland, and J.F. Wendel. 1999. The origin and domestication of cotton. In: C.W. Smith and J.T. Cothren, editors, Cotton; origin, history, technology and production. John Wiley & Sons, New York. p. 3–31. Brubaker, C.L., and J.F. Wendel. 1993. On the specific status of Gossypium lanceolatum Todaro. Genet. Resour. Crop Evol. 40:165–170. doi:10.1007/BF00051121 Brubaker, C.L., and J.F. Wendel. 1994. Reevaluating the origin of domesticated cotton (Gossypium hirsutum; Malvaceae) using nuclear restriction fragment length polymorphisms (RFLPs). Am. J. Bot. 81:1309–1326. doi:10.2307/2445407 Brubaker, C.L., and J.F. Wendel. 2001. RFLP diversity in cotton. In: J.N. Jenkins and S. Saha, editors, Genetic improvement of cotton: Emerging technologies. Science Publishers, Inc., Enfield, NH. p. 81–102. Campbell, B., S. Saha, R. Percy, J. Frelichowski, J. Jenkins, W. Park, et al. 2010. Status of the global cotton germplasm resources. Crop Sci. 50:1161–1179. doi:10.2135/ cropsci2009.09.0551 Chapala, M.M., D.B. Weaver, B.T. Campbell, E. van Santan, and R.R. Sharpe. 2012. Exotic germplasm introgression effect on agronomic and fiber properties of Upland cotton. In: Proceedings of the National Cotton Council Beltwide Cotton Conference, NCC, Orlando, FL. 3–6 Jan. 2012. NCC, Memphis, TN. p. 803–807. Cronn, R.C., R.L. Small, T. Haselkorn, and J.F. Wendel. 2002. Rapid diversification of the cotton genus (Gossypium: Malvaceae) revealed by analysis of sixteen nuclear and chloroplast genes. Am. J. Bot. 89:707–725. doi:10.3732/ajb.89.4.707 Cronn, R.C., R.L. Small, T. Haselkorn, and J.F. Wendel. 2003. Cryptic repeated genomic recombination during speciation in Gossypium gossypioides. Evolution 57:2475–2489. doi:10.1111/j.0014-3820.2003.tb01493.x Cronn, R., and J.F. Wendel. 2004. Cryptic trysts, genomic mergers, and plant speciation. New Phytol. 161:133–142. doi:10.1111/j.1469-8137.2004.00947.x d’Eeckenbrugge, G.C., and J.M. Lacape. 2014. Distribution and differentiation of wild, feral, and cultivated populations of perennial Upland cotton (Gossypium hirsutum L.) in mesoamerica and the Caribbean. PLoS ONE 9:e107458. doi:10.1371/journal. pone.0107458 Taxonomy and Evolution of the Cotton Genus, Gossypium 17 DeJoode, D.R., and J.F. Wendel. 1992. Genetic diversity and origin of the Hawaiian Islands cotton, Gossypium tomentosum. Am. J. Bot. 79:1311–1319. doi:10.2307/2445059 de Menezes, I.P.P., F.A. Gaiotto, L.V. Hoffmann, A.Y. Ciampi, and P.A.V. Barroso. 2014. Genetic diversity and structure of natural populations of Gossypium mustelinum, a wild relative of cotton, in the basin of the De Contas River in Bahia, Brazil. Genetica (The Hague) 142:99–108. Endrizzi, J.E., E.L. Turcotte, and R.J. Kohel. 1985. Genetics, cytology, and evolution of Gossypium. Adv. Genet. 23:271–375. doi:10.1016/S0065-2660(08)60515-5 Fang, D.D., L.L. Hinze, R.G. Percy, P. Li, D. Deng, and G. Thyssen. 2013. A microsatellite-based genome-wide analysis of genetic diversity and linkage disequilibrium in Upland cotton (Gossypium hirsutum L.) cultivars from major cotton-growing countries. Euphytica 191:391–401. doi:10.1007/s10681-013-0886-2 Feng, C., M. Ulloa, C. Perez-M, and J.M. Stewart. 2011. Distribution and molecular diversity of arborescent Gossypium species. Botany 89:615–624. doi:10.1139/b11-042 Flagel, L.E., J.F. Wendel, and J.A. Udall. 2012. Duplicate gene evolution, homoeologous recombination, and transcriptome characterization in allopolyploid cotton. BMC Genomics 13:302. doi:10.1186/1471-2164-13-302 Fryxell, P.A. 1965. Stages in the evolution of Gossypium. Adv. Frontiers Plant Sci. 10:31–56. Fryxell, P.A. 1968. A redefinition of the tribe Gossypieae. Bot. Gaz. 129:296–308. doi:10.1086/336448 Fryxell, P.A. 1971. Phenetic analysis and the phylogeny of the diploid species of Gossypium L. (Malvaceae). Evolution 25:554–562. doi:10.2307/2407355 Fryxell, P.A. 1976. A nomenclator of Gossypium: The botanical names of cotton. USDA Agricultural Research Service, Washington DC. Fryxell, P.A. 1979 The natural history of the cotton tribe Texas A&M Univ. Press, College Station, TX. Fryxell, P.A. 1992. A revised taxonomic interpretation of Gossypium L. (Malvaceae). Rheedea 2:108–165. Galau, G.A., and T.A. Wilkins. 1989. Alloplasmic male sterility in AD allotetraploid Gossypium hirsutum upon replacement of its resident A cytoplasm with that of D species G. harknessii. Theor. Appl. Genet. 78:23–30. doi:10.1007/BF00299748 Grover, C.E., K.K. Grupp, R.J. Wanzek, and J.F. Wendel. 2012. Assessing the monophyly of polyploid Gossypium species. Plant Syst. Evol. 298:1177–1183. doi:10.1007/ s00606-012-0615-7 Grover, C.E., X. Zhu, K. Grupp, J. Jareczek, J.P. Gallagher, E. Szadkowski, et al. 2014. Molecular confirmation of species status for the allopolyploid cotton species, Gossypium ekmanianum. Genet. Resour. Crop Evol. doi:10.1007/s10722-014-0138-x (in press) Haigler, C.H., L. Betancur, M.R. Stiff, and J.R. Tuttle. 2012. Cotton fiber: A powerful singlecell model for cell wall and cellulose research. Front. Plant Sci. 3:104. Haigler, C.H., B. Singh, G. Wang, and D. Zhang. 2009. Genomics of cotton fiber secondary wall deposition and cellulose biogenesis. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York. p. 385–417. Hawkins, J.S., H. Kim, J.D. Nason, R.A. Wing, and J.F. Wendel. 2006. Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium. Genome Res. 16:1252–1261. doi:10.1101/gr.5282906 Hawkins, J.S., J. Pleasants, and J.F. Wendel. 2005. Identification of AFLP markers that discriminate between cultivated cotton and the Hawaiian island endemic, Gossypium tomentosum. Genet. Resour. Crop Evol. 52:1069–1078. doi:10.1007/s10722-004-6115-z Hawkins, J.S., S.R. Proulx, R.A. Rapp, and J.F. Wendel. 2009. Rapid DNA loss as a counterbalance to genome expansion through retrotransposon proliferation in plants. Proc. Natl. Acad. Sci. USA 106:17811–17816. doi:10.1073/pnas.0904339106 He, S., J. Sun, and X. Du. 2011. Genetic and epigenetic status of triple exotic consanguinity cotton introgression lines. Genet. Mol. Res. 10:4063–4072. doi:10.4238/2011.October.3.1 18 Wendel & Grover He, S.-P., J.-L. Sun, C. Zhang, and X.-M. Du. 2011. Identification of exotic genetic components and DNA methylation pattern analysis of three cotton introgression lines from Gossypium bickii. Mol. Biol. 45:204–210. doi:10.1134/S002689331102018X Hendrix, B., and J.M. Stewart. 2005. Estimation of the nuclear DNA content of Gossypium species. Ann. Bot. (Lond.) 95:789–797. doi:10.1093/aob/mci078 Hovav, R., J.A. Udall, B. Chaudhary, E. Hovav, L. Flagel, G. Hu, et al. 2008. The evolution of spinnable cotton fiber entailed prolonged development and a novel metabolism. PLoS Genet. 4:E25. doi:10.1371/journal.pgen.0040025 Hovav, R., J. Udall, E. Hovav, R. Rapp, L. Flagel, and J. Wendel. 2007. A majority of cotton genes are expressed in single-celled fiber. Planta 227:319–329. doi:10.1007/ s00425-007-0619-7 Hu, G., J. Koh, M.J. Yoo, K. Grupp, S. Chen, and J.F. Wendel. 2013. Proteomic profiling of developing cotton fibers from wild and domesticated Gossypium barbadense. New Phytol. 200:570–582. doi:10.1111/nph.12381 Hutchinson, J.B. 1951. Intra-specific differentiation in Gossypium hirsutum. Heredity 5:161– 193. doi:10.1038/hdy.1951.19 Hutchinson, J.B. 1954. New evidence on the origin of the Old World cottons. Heredity 8:225–241. doi:10.1038/hdy.1954.20 Hutchinson, J.B. 1959. The application of genetics to cotton improvement. Cambridge Univ. Press, Cambridge, UK. Hutchinson, J.B., R.A. Silow, and S.G. Stephens. 1947. The evolution of Gossypium and the differentiation of the cultivated cottons.Oxford Univ. Press, London. Kim, H.J., and B.A. Triplett. 2001. Cotton fiber growth in planta and in vitro. Models for plant cell elongation and cell wall biogenesis. Plant Physiol. 127:1361–1366. doi:10.1104/pp.010724 Krapovickas, A., and G. Seijo. 2008. Gossypium ekmanianum (Malvaceae), algodon silvestre de la Republica Dominicana. Bonplandia 17:55–63. Li, P., Z. Li, H. Liu, and J. Hua. 2014. Cytoplasmic diversity of the cotton genus as revealed by chloroplast microsatellite markers. Genet. Resour. Crop Evol. 61:107–119. doi:10.1007/ s10722-013-0018-9 Liu, Q., C.L. Brubaker, A.G. Green, D.R. Marshall, P.J. Sharp, and S.P. Singh. 2001. Evolution of the FAD2-1 fatty acid desaturase 5’ UTR intron and the molecular systematics of Gossypium (Malvaceae). Am. J. Bot. 88:92–102. doi:10.2307/2657130 Lubbers, E.L., and P.W. Chee. 2009. The worldwide gene pool of G. hirsutum and its improvement. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York. p. 23–52. McCarty, J.C., and R.G. Percy. 2001. Genes from exotic germplasm and their use in cultivar improvement in Gossypium hirsutum L. and G. barbadense L. In: J.N. Jenkins and S. Saha, editors, Genetic improvement of cotton: Emerging technologies. Science Publishers, Inc., Enfield, NH. p. 65–80. Mehetre, S.S. 2010. Wild Gossypium anomalum: A unique source of fibre fineness and strength. Curr. Sci. 99:58–71. Nacoulima, N.L., J.-P. Baudoin, and G. Mergeai. 2012. Introgression of improved fiber fineness trait in G. hirsutum L. from G. longicalyx Hutch. & Lee. Commun. Agric. Appl. Biol. Sci. 77(1):207–211 Niles, G.A., and C.V. Feaster. 1984. Breeding. In: R.J. Kohel and C.E. Lewis, editors, Cotton. ASA, Madison, WI. p. 201–231. Paterson, A.H., editor. 2009. Genetics and genomics of cotton. Springer, New York. Percival, A.E., J.M. Stewart, and J.F. Wendel. 1999. Taxonomy and germplasm resources. In: C.W. Smith and J.T. Cothren, editors, Cotton; origin, history, technology and production. John Wiley and Sons, New York. p. 33–63. Percy, R.G. 2009. The worldwide gene pool of Gossypium barbadense L. and its improvement. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York. p. 53–68. Percy, R.G., and J.F. Wendel. 1990. Allozyme evidence for the origin and diversification of Gossypium barbadense L. Theor. Appl. Genet. 79:529–542. doi:10.1007/BF00226164 Taxonomy and Evolution of the Cotton Genus, Gossypium 19 Phillips, L.L. 1966. The cytology and phylogenetics of the diploid species of Gossypium. Am. J. Bot. 53:328–335. doi:10.2307/2439872 Phuphathanaphong, L. 2006. Thepparatia (Malvaceae), a new genus from Thailand. Thai For. Bull. (Bot.) 34:195–200. Pleasants, J.M., and J.F. Wendel. 2010. Reproductive and pollination biology of the endemic Hawaiian cotton, Gossypium tomentosum (Malvaceae). Pac. Sci. 64:45–55. doi:10.2984/64.1.045 Rapp, R., C. Haigler, L. Flagel, R. Hovav, J. Udall, and J. Wendel. 2010. Gene expression in developing fibres of Upland cotton (Gossypium hirsutum L.) was massively altered by domestication. BMC Biol. 8:139. doi:10.1186/1741-7007-8-139 Rapp, R.A., and J.F. Wendel. 2005. Epigenetics and plant evolution. New Phytol. 168:81–91. doi:10.1111/j.1469-8137.2005.01491.x Ryser, U., and P. Holloway. 1985. Ultrastructure and chemistry of soluble and polymeric lipids in cell walls from seed coats and fibres of Gossypium species. Planta 163:151–163. doi:10.1007/BF00393501 Saunders, J.H. 1961. The wild species of Gossypium and their evolutionary history.Oxford Univ. Press, London. Seelanan, T., C.L. Brubaker, J.M. Stewart, L.A. Craven, and J.F. Wendel. 1999. Molecular systematics of Australian Gossypium section Grandicalyx (Malvaceae). Syst. Bot. 24:183– 208. doi:10.2307/2419548 Seelanan, T., A. Schnabel, and J.F. Wendel. 1997. Congruence and consensus in the cotton tribe (Malvaceae). Syst. Bot. 22:259–290. doi:10.2307/2419457 Senchina, D.S., I. Alvarez, R.C. Cronn, B. Liu, J.K. Rong, R.D. Noyes, et al. 2003. Rate variation among nuclear genes and the age of polyploidy in Gossypium. Mol. Biol. Evol. 20:633–643. doi:10.1093/molbev/msg065 Small, R.L., and J.F. Wendel. 1999. The mitochondrial genome of allotetraploid cotton (Gossypium L.). J. Hered. 90:251–253. doi:10.1093/jhered/90.1.251 Stelly, D. 2014. Facilitating the use of wild species germplasm for cotton improvement via CS lines. In: Proceedings of the National Cotton Council Beltwide Cotton Conference, NCC, New Orleans, LA. 6–8 Jan. 2014. NCC, Memphis, TN. Stephens, S.G. 1958. Salt water tolerance of seeds of Gossypium species as a possible factor in seed dispersal. Am. Nat. 92:83–92. doi:10.1086/282014 Stephens, S.G. 1963. Polynesian cottons. Ann. Mo. Bot. Gard. 50:1–22. doi:10.2307/2394914 Stephens, S.G. 1966. The potential for long range oceanic dispersal of cotton seeds. Am. Nat. 100:199–210. doi:10.1086/282413 Stewart, J.M. 1995. Potential for crop improvement with exotic germplasm and genetic engineering. In: G.A. Constable and N.W. Forrester, editors, Challenging the future: Proceedings of the World Cotton Research Conference, Brisbane Australia. 14–17 Feb. 1994. CSIRO, Melbourne, Australia. p. 313–327. Stewart, J.M., L.A. Craven, C.L. Brubaker, and J.F. Wendel. 2015. Gossypium anapoides (Malvaceae), a new species from Western Australia. Novon 23(4):447–451. doi:10.3417/2007140 Stewart, J.M., D. Oosterhuis, J.J. Heithholt, and J.R. Mauney. 2010. Physiology of cotton. Springer, Dordrecht, The Netherlands. Tiwari, R.S., J. Zhang, and J.M. Stewart. 2014. Molecular diversity based on AFLP markers and possible natural hybridization among the Australian arid zone Gossypium species. Aust. J. Crop Sci. 8:674–679. Tyagi, P., M.A. Gore, D.T. Bowman, B.T. Campbell, J.A. Udall, and V. Kuraparthy. 2014. Genetic diversity and population structure in the US Upland cotton (Gossypium hirsutum L.). Theor. Appl. Genet. 127:283–295. doi:10.1007/s00122-013-2217-3 Ulloa, M., I.Y. Abdurakhmonov, C. Perez-M, R. Percy, and J.M. Stewart. 2013. Genetic diversity and population structure of cotton (Gossypium spp.) of the New World assessed by SSR markers. Botany 91:251–259. doi:10.1139/cjb-2012-0192 Ulloa, M., J.M. Stewart, E.A. Garcia-C, S. Godoy-A, A. Gaytan-M, and N.S. Acosta. 2006. Cotton genetic resources in the western states of Mexico: In situ conservation status 20 Wendel & Grover and germplasm collection for ex situ preservation. Genet. Resour. Crop Evol. 53:653– 668. doi:10.1007/s10722-004-2988-0 Watt, G. 1907. The wild and cultivated cotton plants of the world. Longmans, Green and Co., London. Wendel, J.F. 1989. New World tetraploid cottons contain Old World cytoplasm. Proc. Natl. Acad. Sci. USA 86:4132–4136. doi:10.1073/pnas.86.11.4132 Wendel, J.F., and V.A. Albert. 1992. Phylogenetics of the cotton genus (Gossypium L.): Character-state weighted parsimony analysis of chloroplast DNA restriction site data and its systematic and biogeographic implications. Syst. Bot. 17:115–143. doi:10.2307/2419069 Wendel, J.F., C.L. Brubaker, J.P. Alvarez, R.C. Cronn, and J.M. Stewart. 2009. Evolution and natural history of the cotton genus. In: A.H. Paterson, editor, Genetics and genomics of cotton. Springer, New York. p. 3–22. Wendel, J.F., C.L. Brubaker, and A.E. Percival. 1992. Genetic diversity in Gossypium hirsutum and the origin of Upland cotton. Am. J. Bot. 79:1291–1310. doi:10.2307/2445058 Wendel, J.F., C.L. Brubaker, and T. Seelanan. 2010. The origin and evolution of Gossypium. In: J.M. Stewart, D. Oosterhuis, J.J. Heithholt, and J.R. Mauney, editors, Physiology of cotton. Springer, Dordrecht, The Netherlands. p. 1–18. Wendel, J.F., and R.C. Cronn. 2003. Polyploidy and the evolutionary history of cotton. Adv. Agron. 78:139–186. doi:10.1016/S0065-2113(02)78004-8 Wendel, J.F., L.E. Flagel, and K.L. Adams. 2012. Jeans, genes, and genomes: Cotton as a model for studying polyploidy. In: P.S. Soltis and D.E. Soltis, editors, Polyploidy and genome evolution. Springer-Verlag, Heidelberg. p. 181–207. Wendel, J.F., P.D. Olson, and J.M. Stewart. 1989. Genetic diversity, introgression and independent domestication of Old World cultivated cottons. Am. J. Bot. 76:1795–1806. doi:10.2307/2444478 Wendel, J.F., and A.E. Percival. 1990. Molecular divergence in the Galapagos Island–Baja California species pair, Gossypium klotzschianum and G. davidsonii (Malvaceae). Plant Syst. Evol. 171:99–115. doi:10.1007/BF00940598 Wendel, J.F., and R.G. Percy. 1990. Allozyme diversity and introgression in the Galapagos Islands endemic Gossypium darwinii and its relationship to continental G. barbadense . Biochem. Syst. Ecol. 18:517–528. doi:10.1016/0305-1978(90)90123-W Wendel, J.F., R. Rowley, and J.M. Stewart. 1994. Genetic diversity in and phylogenetic relationships of the Brazilian endemic cotton, Gossypium mustelinum (Malvaceae). Plant Syst. Evol. 192:49–59. doi:10.1007/BF00985907 Wendel, J.F., A. Schnabel, and T. Seelanan. 1995. An unusual ribosomal DNA sequence from Gossypium gossypioides reveals ancient, cryptic, intergenomic introgression. Mol. Phylogenet. Evol.. 4:298–313. doi:10.1006/mpev.1995.1027 Wendel, J.F., J.M. Stewart, and J.H. Rettig. 1991. Molecular evidence for homoploid reticulate evolution in Australian species of Gossypium. Evolution 45:694–711. doi:10.2307/2409921 Wu, J., J. McCarty, J. Jenkins, and W. Meredith. 2010. Breeding potential of introgressions into Upland cotton: Genetic effects and heterosis. Plant Breed. 129:526–532. Xu, Q., G. Xiong, P. Li, F. He, Y. Huang, K. Wang, et al. 2012. Analysis of complete nucleotide sequences of 12 Gossypium chloroplast genomes: Origin and evolution of allotetraploids. PLoS ONE 7:E37128. doi:10.1371/journal.pone.0037128 Yoo, M.-J., and J.F. Wendel. 2014. Comparative evolutionary and developmental dynamics of the cotton (Gossypium hirsutum) fiber transcriptome. PLoS Genet. 10:E1004073. doi:10.1371/journal.pgen.1004073 Yu-xiang, W., C. Jin-hong, H. Qiu-ling, and Z. Shui-jin. 2013. Parental origin and genomic evolution of tetraploid Gossypium species by molecular marker and GISH analyses. Caryologia 66:368–374. doi:10.1080/00087114.2013.857830 Zhang, J., R.G. Percy, and J.C. McCarty, Jr. 2014. Introgression genetics and breeding between Upland and Pima cotton: A review. Euphytica 198:1–12. doi:10.1007/s10681-014-1094-4 Zhang, Z., J. Rong, V.N. Waghmare, P.W. Chee, O.L. May, R.J. Wright, et al. 2011. QTL alleles for improved fiber quality from a wild Hawaiian cotton, Gossypium tomentosum. Theor. Appl. Genet. 123:1075–1088. doi:10.1007/s00122-011-1649-x