* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Gene duplication and divergence in the early evolution of
Long non-coding RNA wikipedia , lookup
Essential gene wikipedia , lookup
Population genetics wikipedia , lookup
Polycomb Group Proteins and Cancer wikipedia , lookup
Genetic engineering wikipedia , lookup
Quantitative trait locus wikipedia , lookup
Epigenetics of neurodegenerative diseases wikipedia , lookup
Segmental Duplication on the Human Y Chromosome wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Gene therapy of the human retina wikipedia , lookup
Adaptive evolution in the human genome wikipedia , lookup
Copy-number variation wikipedia , lookup
Transposable element wikipedia , lookup
Gene therapy wikipedia , lookup
Pathogenomics wikipedia , lookup
Public health genomics wikipedia , lookup
Epigenetics of diabetes Type 2 wikipedia , lookup
Biology and consumer behaviour wikipedia , lookup
History of genetic engineering wikipedia , lookup
Gene nomenclature wikipedia , lookup
Genomic imprinting wikipedia , lookup
Ridge (biology) wikipedia , lookup
Gene desert wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Minimal genome wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Epigenetics of human development wikipedia , lookup
The Selfish Gene wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Genome (book) wikipedia , lookup
Helitron (biology) wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Gene expression programming wikipedia , lookup
Gene expression profiling wikipedia , lookup
Designer baby wikipedia , lookup
393 Gene duplication and divergence in the early evolution of vertebrates Francoise Mazet* and Sebastian M Shimeld† The duplication–degeneration–complementation model of duplicate gene preservation by subfunctionalisation is currently the best explanation for the high level of retention of duplicate genes in early vertebrate evolution. But a direct test of the applicability of this model to such ancient evolutionary events may be difficult. More likely, recent duplications in other lineages will allow us to establish general principles concerning the fate of genes of different types that are duplicated in different ways. These principles may be then extrapolated to understanding the early evolution of the vertebrates. Addresses School of Animal and Microbial Sciences, University of Reading, Whiteknights, Reading RG6 6AJ, UK *e-mail: [email protected] † e-mail: [email protected] Current Opinion in Genetics & Development 2002, 12:393–396 0959-437X/02/$ — see front matter © 2002 Elsevier Science Ltd. All rights reserved. Abbreviation DDC duplication–degeneration–complementation Introduction Few authors doubt the prevalence of gene duplication in the evolutionary history of the genomes of organisms of all levels of biological organisation; the evidence, in the form of multigene families in the genomes of individual organisms, is overwhelming. But the importance of gene duplication in evolutionary innovation, the degree and method of duplicate gene preservation, and even the method of gene duplication are all the subject of hotly contentious theories. In this review we summarise some of these theories. In particular, we focus on the evolution of the vertebrates and on the expansion during early vertebrate evolution of many of the gene families that were involved in vertebrate development. Maintenance of duplicated genes: why and how? In some special cases it is easy to identify an adaptive advantage for organisms that possess multiple copies of a gene, for example, the increased dosage of a protein that multiple copies provide can protect individuals against pesticides (e.g. see [1]). The importance of such short-term selective advantage should not be underestimated; however, a fundamental assumption in most instances has been that for both daughter genes of a duplication to be maintained by selection they must in some way diverge. In particular it has been proposed that duplication frees one copy from a requirement to maintain key functions, allowing it to evolve new functions [2]; such ‘neofunctionalisation’ could provide the genetic material for the evolution of new morphology — a process that could be summarised as gene duplication → increased genetic complexity → increased morphological complexity (Figure 1a). But as gene duplications take place within individuals within populations, first we must assume that the duplicated allele is neutral or immediately advantageous or it will be purged from the population; second, for the above model of neofunctionalisation to occur, a gene duplication must evolve new (i.e. divergent) functions and move to fixation in a population before one gene becomes a pseudogene. Selection does not recognise ‘future potential’, and consequently the classical model predicts a race between neofunctionalisation and pseudogene formation that should result in the elimination of most duplicated genes. Empirical evidence, however, does not support this scheme but suggests that there is a much higher retention of duplicate copies [3]. This seems to be a major paradox that was turned upside-down by the duplication–degeneration–complementation (DDC) model, which takes into account the multiple regulatory regions and the protein structure domains of multiple genes [4••,5,6••]. This model, in which functions are partitioned between daughter genes (a process known as subfunctionalisation), has the advantage of a firm foundation in population genetics, and predicts in such cases that selection will act to maintain both copies (Figure 1b). Consequently, the need for a mutation to occur in the gene and confer a selectively advantageous new phenotype is negated. At its extreme, the DCC model suggests that a multifunctional state, and therefore presumably some degree of complexity, must precede duplication rather than follow it. It should be noted, however, that although subfunctionalisation might be required for initial duplicate maintenance, it is not incompatible with subsequent neofunctionalisation (Figure 1c). Gene duplication during early vertebrate evolution Do vertebrates have more genes than other animal taxa? The number of predicted genes in the human genome, when compared to the number in the sequenced genomes of Drosophila and Caenorhabditis elegans, suggests that they do [7,8]. But such comparisons should be treated with caution as they assume that humans and Drosophila/C. elegans are representative of all vertebrates and invertebrates, respectively. More informative are comparisons made with lineages that diverged closer to the origin of vertebrates, such as ascidians, which do support this interpretation as do surveys of numerous individual gene families from amphioxus — the closest living invertebrate to the vertebrates [9•,10]. Genes that are involved in regulating development have been especially well studied in this context, and most show evidence of duplication that is specific to vertebrates (Table 1). This degree of duplication has been proposed to reflect genome duplication on the vertebrate stem — a theory that is supported by the widespread occurrence 394 Pattern formation and developmental mechanisms Figure 1 A (a) Degeneration A A A (b) Neofunctionalisation (c) Subfunctionalisation A AC A AB AB A AC AB B BD A AB Fates of duplicated genes. (a) The classical model of degeneration of one copy after duplication. (b) Neofunctionalisation, in which initially identical duplicates with function A diverge by acquiring new functions B and C. (c) Subfunctionalisation, in which duplicate genes with multiple functions A and B diverge by reciprocal loss. The extension indicates the possibility of subsequent neofunctionalisation by acquisition of further functions C and D. It is important to recognise that these models are not mutually exclusive. It is likely that all are correct to some degree and all occur to some extent, but there may be variations according to taxon, type of gene and type of duplication. The important question is does one model account for most of the preserved duplications seen generally in nature or in specific taxa? Current Opinion in Genetics & Development of paralogy in the human genome, but which has its detractors [11,12]. Gene duplication, divergence and vertebrate evolution Irrespective of the debate on the mode of duplication, it is clear that vertebrates have more members of most ‘developmental’ gene families than do their closest relatives. It is also clear that there is a correlation between the expansion Table 1 Expansion during early vertebrate evolution for a selection of gene families, and evidence for sub- or neofunctionalisation. Gene family Number of Number of Neofunctionalisation amphioxus vertebrate and/or genes genes subfunctionalisation References Wnt8 1 2 Sub [13] Wnt1 1 1 NA [26] Wnt5 1 1 NA [27] Wnt3 1 2 ? [27] Wnt7 1 2 ? [27] Otx 1 2/3 Neo [28] Hh 1 3 Neo [29] HNF-3 1 3 ? [30] Snail 1 3 Neo [31] Pax1/9 1 2 Neo [32] Hox 1 4 Neo [33,34] Pax3/7 1 2 Neo [35] Pitx 1 3 Sub + Neo [36] Dll 1 or 2 6 Neo [37] Sox1/2/3 1 3 Neo [14] Mnx 1 2 ? [38] En 1 2 Neo [39] Islet 1 2 Neo [40] A question mark indicates that the evidence for either is ambiguous. Please note this is our interpretation of the data and does not necessarily reflect the views of the authors of each publication. NA, not applicable. of gene families and the complexity of the central nervous system and other organ systems in vertebrates, at least with respect to early vertebrate evolution. Development of the vertebrate body plan requires the combinatorial action of many genes and, notably, many vertebrate-specific structures are marked by the expression of gene duplicates (Table 1). Is there evidence here for the direct linking of gene duplication with increasingly complex morphology? If so, does it support one of the above theories in preference to the others? Evidence for sub- and neofunctionalisation To establish sub- and neofunctionalisation that is specific to vertebrates, comparisons must be made between vertebrate genes and those in their closest relatives, most informatively in amphioxus. Many such studies have been carried out in recent years on several gene families. Some of these are listed in Table 1, together with our interpretation of the pattern of evolution that they display. Such comparisons reveal some indication of subfunctionalisation. A good example is the Wnt8 family, in which a single gene in amphioxus possesses several domains of expression that seem to have been subdivided between the two vertebrate Wnt8 paralogues [13•]. But clear-cut examples of subfunctionalisation are rare, with most gene families instead showing evidence of neofunctionalisation. A good example of this is the Sox1/Sox2/Sox3 family, which in amphioxus is expressed only in the nervous system — a site that we can deduce to be primitive by comparison with expression in Drosophila [14]. In vertebrates, all three Sox genes are expressed in the central nervous system and in some tissues that are presumed to be vertebrate-specific, such as the epibranchial placodes [15]. Other examples of such divergent multigene families Gene duplication and divergence in the early evolution of vertebrates Mazet and Shimeld in vertebrates are the Hedgehog, Engrailed, Pax1/Pax9 and Distalless families (Table 1). Can we take the predominance of new functions in these amphioxus/vertebrate gene family comparisons as evidence in favour of neofunctionalisation over subfunctionalisation in the early evolution of vertebrates, and consequently as falsification of the DDC model? Our answer is we cannot. First, with the available data we can only detect divergence on the basis of expression, and not on the basis of evolutionary change of protein function. The latter is hard to test experimentally, because it requires the ability to manipulate gene function in several taxa, although it may be predicted perhaps by comparing rates of evolution in different parts of the proteins of paralagous genes [16,17]. We also cannot rule out the possibility that neofunctionalisation followed subfunctionalisation. The evolutionary separation between extant vertebrate and invertebrate lineages is such that subtle subfunctionalisation may have become obscured by subsequent change. Although examining the relative rates of evolution of paralogues has been informative in the analysis of more recent duplications [6••,18•,19•], it is unlikely that we will be able to use this and more sophisticated molecular evolutionary analysis, such as tests for adaptive evolution, to examine informatively early vertebrate evolution, principally owing to the length of time of divergence. With this in mind, it may be impossible to test definitively which process was most significant in early vertebrate evolution. An alternative approach is to examine similar, more recent evolutionary events and extrapolate these findings to the origin of vertebrates. For example, some lineages of teleost fish have been found to possess duplicate copies of genes that are present in single copy in other vertebrate lineages and mapping data from zebrafish imply that this may have resulted from a genome duplication, thereby presenting an analogous situation to what occurred at the origin of vertebrates (though note this theory has its detractors) [20–22]. Comparison on a wide scale of the fates of these more recently created paralagous genes will help to establish general principles that can then be applied to vertebrate origins. It is notable that some recent studies provide strong evidence for subfunctionalisation of duplicate genes in fish [23,24], whereas others suggest the evolution of new functions [25]. It will be important to see which predominates but this will have to wait until many paralagous genes with a variety of functions have been studied in detail — a considerable undertaking. Conclusions Although many genes show evidence of duplication in early vertebrate evolution, experimental analyses of individual families suggests that there is a predominance of neofunctionalisation, with little evidence of subfunctionalisation. This may, however, be an artefact of the considerable length of time that vertebrates and their nearest relatives have been separated. Direct tests of 395 hypotheses concerning the action of selection on such ancient evolutionary events may never be possible, and consequently we must look to more recent duplications, such as those in teleost fish and amphibia, to establish general principles that can be extrapolated to the base of vertebrate evolution. Acknowledgements The authors would like to acknowledge the support of the BBSRC. References and recommended reading Papers of particular interest, published within the annual period of review, have been highlighted as: • of special interest •• of outstanding interest 1. Guillemaud T, Raymond M, Tsagkarakou A, Bernard C, Rochard P, Pasteur N: Quantitative variation and selection of esterase gene amplification in Culex pipiens. Heredity 1999, 83:87-99. 2. Ohno S: Evolution by Gene Duplication. New York: Springer-Verlag; 1970. 3. Hughes MK, Hughes AL: Evolution of duplicate genes in a tetraploid animal, Xenopus laevis. Mol Biol Evol 1993, 10:1360-1369. 4. •• Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalisation. Genetics 2000, 154:459-473. This paper unites quantitative population genetics with predictions of duplicate gene preservation, and then correlates this with the partitioning of tissue-specific patterns of expression between the duplicate genes. 5. Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics 1999, 151:1531-1545. 6. Lynch M, Conery JS: The evolutionary fate and consequences of •• duplicate genes. Science 2000, 290:1151-1155. The authors survey genomic datasets to identify duplicate genes and determine their rate of creation, and then use these data to examine rate variation and evidence for selection in paralagous pairs. They propose that, after duplication, differential subfunctionalisation in isolated populations will form a powerful selective barrier to interbreeding, leading to the origin of new species. 7. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W et al.: Initial sequencing and analysis of the human genome. Nature 2001, 409:860-921. 8. Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA et al.: The sequence of the human genome. Science 2001, 291:1304-1351. 9. Furlong RF, Holland PWH: Were vertebrates octaploid? Phil Trans R • Soc Lond B 2002, 357:531-544. Robust phylogenetic analysis of numerous gene families identified in both amphioxus and vertebrates leads to a model of vertebrate genome evolution in which homeologous crossover occurred between chromosomes in an octaploid ancestor early in vertebrate evolution. 10. Simmen MW, Leitgeb S, Clark VH, Jones SJM, Bird A: Gene number in an invertebrate chordate, Ciona intestinalis. Proc Natl Acad Sci USA 1998, 95:4437-4440. 11. Friedman R, Hughes AL: Pattern and timing of gene duplication in animal genomes. Genome Res 2001, 11:1842-1847. 12. Holland PW, Garcia-Fernàndez J, Williams NA, Sidow A: Gene duplications and the origins of vertebrate development. Development 1994, Suppl:125-133. 13. Schubert M, Holland LZ, Panopoulou G, Lehrach H, Holland ND: • Characterization of amphioxus AmphiWnt8: insights into the evolution of patterning of the embryonic dorsoventral axis. Evol Dev 2000, 2:85-92. Phylogenetic analysis shows that the amphioxus Wnt8 orthologue is positioned at the basis of the vertebrate Wnt8 family. The pattern of AmphiWnt8 expression in the brain, gut and the mesoderm is the sum of the expression patterns of the vertebrate Wnt8 genes. This is a clear example of 396 Pattern formation and developmental mechanisms subfunctionalisation after duplication when comparing vertebrate and amphioxus genes. 27. Schubert M, Holland LZ, Stokes MD, Holland ND: Three amphioxus Wnt genes (AmphiWnt3, AmphiWnt5, and AmphiWnt6) associated with the tail bud: the evolution of somitogenesis in chordates. Dev Biol 2001, 240:262-273. 14. Holland LZ, Schubert M, Holland ND, Neuman T: Evolutionary conservation of the presumptive neural plate markers AmphiSox1/2/3 and AmphiNeurogenin in the invertebrate chordate amphioxus. Dev Biol 2000, 226:18-33. 28. Williams NA, Holland PWH: Old head on young shoulders. Nature 1996, 383:490. 15. Ishii Y, Abu-Elmagd M, Scotting PJ: Sox3 expression defines a common primordium for the epibranchial placodes in chick. Dev Biol 2001, 236:344-353. 29. Shimeld SM: The evolution of the hedgehog gene family in chordates: Insights from amphioxus hedgehog. Dev Genes Evol 1999, 209:40-47. 16. Dermitzakis ET, Clark AG: Differential selection after duplication in mammalian developmental genes. Mol Biol Evol 2001, 18:557-562. 30. Shimeld SM: Characterisation of amphioxus HNF-3 genes: conserved expression in the notochord and floor plate. Dev Biol 1997, 183:74-85. 17. Massingham T, Davies LJ, Lio P: Analysing gene function after duplication. BioEssays 2001, 23:873-876. 18. Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV: Selection in the • evolution of gene duplication. Genome Biol 2002, 3:0008.1-0008.9. The authors describe a two-stage evolutionary model of gene duplication that is based on surveys of paralagous gene pairs from completely sequenced genomes. The first stage is characterised by purifying selection and predicts that only immediately advantageous duplications will be retained. This is followed by the creation of new functions only later in evolution. 19. Van de Peer Y, Taylor JS, Braasch I, Meyer A: The ghost of selection • past: rates of evolution and functional divergence of anciently duplicated genes. J Mol Evol 2001, 53:436-446. The authors analyse paralagous gene pairs that resulted from duplications specific to the zebrafish lineage. Similar to Kondrashov et al. [18•], they find a significant degree of purifying selection in both paralogues and also identify some rate heterogeneity that suggests an increased rate of evolution for paralogues and possible positive selection. 20. Amores A, Force A, Yan Y, Joly L, Amemiya C, Fritz A, Ho R, Langeland J, Prince V, Wang YL et al.: Zebrafish hox clusters and vertebrate genome evolution. Science 1998, 282:1711-1714. 21. Taylor JS, Van de Peer Y, Braasch I, Meyer A: Comparative genomics provides evidence for an ancient genome duplication event in fish. Phil Trans R Soc Lond B 2001, 356:1661-1679. 22. Robinson-Rechavi M, Marchand O, Escriva H, Laudet V: An ancestral whole-genome duplication may not have been responsible for the abundance of duplicated fish genes. Curr Biol 2001, 11:R458-R459. 23. Bruce AE, Oates AC, Prince VE, Ho RK: Additional hox clusters in the zebrafish: divergent expression patterns belie equivalent activities of duplicate hoxB5 genes. Evol Dev 2001, 3:127-144 24. De Martino S, Yan Y-L, Jowett T, Postlethwaite JH, Varga ZM, Ashworth A, Austin CA: Expression of sox11 gene duplicates in zebrafish suggests the reciprocal loss of ancestral gene expression patterns in development. Dev Dyn 2000, 217:279-292. 25. McClintock JM, Carlson R, Mann DM, Prince VE: Consequences of Hox gene duplication in vertebrates: an investigation of zebrafish Hox paralogue group 1 genes. Development 2001, 128:2471-2484. 26. Schubert M, Holland LZ, Holland ND: Characterization of an amphioxus wnt gene, AmphiWnt11, with possible roles in myogenesis and tail outgrowth. Genesis 2000, 27:1-5. 31. Langeland JA, Tomsa JM, Jackman WR, Kimmel CB: An amphioxus snail gene: expression in paraxial mesoderm and neural plate sggests a conserved role in patterning the vertebrate embryo. Dev Genes Evol 1998, 208:569-577. 32. Holland N, Holland L, Kozmik Z: An amphioxus Pax gene, AmphiPax-1, expressed in embryonic endoderm, but not in mesoderm: implications for the evolution of class I paired box genes. Mol Mar Biol Biotechnol 1995, 4:206-214. 33. Holland PWH, Holland LZ, Williams NA, Holland ND: An amphioxus homeobox gene: sequence conservation, spatial expression during development and insights into vertebrate evolution. Development 1992, 116:653-661. 34. Wada H, Garcia-Fernàndez J, Holland PWH: Colinear and segmental expression of amphioxus Hox genes. Dev Biol 1999, 213:131-141. 35. Holland LZ, Schibert M, Kozmik Z, Holland ND: AmphiPax3/7, an amphioxus paired box gene: insights into chordate myogenesis, neurogenesis and the possible evolutionary precursor of definitive vertebrate neural crest. Evol Dev 1999, 1:153-165. 36. Yasui K, Zhang S, Uemura M, Saiga H: Left–right asymmetric expression of BbPtx, a Ptx-related gene, in a lancelet species and the developmental left-sidedness in deuterostomes. Development 2000, 127:187-195. 37. Holland ND, Panganiban G, Henyey EL, Holland LZ: Sequence and developmental expression of AmphiDll, an amphioxus Distalless gene transcribed in the ectoderm, epidermis and nervous system: insights into the evolution of craniate forebrain and neural crest. Development 1996, 122:2911-2920. 38. Ferrier DEK, Brooke NM, Panopoulou G, Holland PWH: The Mnx homeobox gene class defined by HB9, MNR2 and amphioxus AmphiMnx. Dev Genes Evol 2001, 211:103-107. 39. Holland LZ, Kene M, Williams NA, Holland ND: Sequence and embryonic expression of the amphioxus engrailed gene (AmphiEn): the metameric pattern of transcription resembles that of its segment-polarity homolog in Drosophila. Development 1997, 124:1723-1732. 40. Jackman WR, Langeland JA, Kimmel CB: islet reveals segmentation in the amphioxus hindbrain homolog. Dev Biol 2000, 220:16-26.