Download Gene duplication and divergence in the early evolution of

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Long non-coding RNA wikipedia , lookup

NEDD9 wikipedia , lookup

Essential gene wikipedia , lookup

Population genetics wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Genetic engineering wikipedia , lookup

Epistasis wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Segmental Duplication on the Human Y Chromosome wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Adaptive evolution in the human genome wikipedia , lookup

Copy-number variation wikipedia , lookup

Transposable element wikipedia , lookup

Gene therapy wikipedia , lookup

Pathogenomics wikipedia , lookup

Public health genomics wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

History of genetic engineering wikipedia , lookup

Gene nomenclature wikipedia , lookup

Genomic imprinting wikipedia , lookup

Ridge (biology) wikipedia , lookup

Gene desert wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Minimal genome wikipedia , lookup

Nutriepigenomics wikipedia , lookup

RNA-Seq wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Gene wikipedia , lookup

The Selfish Gene wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genome (book) wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Gene expression programming wikipedia , lookup

Gene expression profiling wikipedia , lookup

Designer baby wikipedia , lookup

Microevolution wikipedia , lookup

Genome evolution wikipedia , lookup

Transcript
393
Gene duplication and divergence in the early evolution of vertebrates
Francoise Mazet* and Sebastian M Shimeld†
The duplication–degeneration–complementation model of
duplicate gene preservation by subfunctionalisation is currently
the best explanation for the high level of retention of duplicate
genes in early vertebrate evolution. But a direct test of the
applicability of this model to such ancient evolutionary events
may be difficult. More likely, recent duplications in other
lineages will allow us to establish general principles concerning
the fate of genes of different types that are duplicated in
different ways. These principles may be then extrapolated to
understanding the early evolution of the vertebrates.
Addresses
School of Animal and Microbial Sciences, University of Reading,
Whiteknights, Reading RG6 6AJ, UK
*e-mail: [email protected]
† e-mail: [email protected]
Current Opinion in Genetics & Development 2002, 12:393–396
0959-437X/02/$ — see front matter
© 2002 Elsevier Science Ltd. All rights reserved.
Abbreviation
DDC
duplication–degeneration–complementation
Introduction
Few authors doubt the prevalence of gene duplication in
the evolutionary history of the genomes of organisms of all
levels of biological organisation; the evidence, in the form of
multigene families in the genomes of individual organisms,
is overwhelming. But the importance of gene duplication
in evolutionary innovation, the degree and method of
duplicate gene preservation, and even the method of gene
duplication are all the subject of hotly contentious theories.
In this review we summarise some of these theories. In particular, we focus on the evolution of the vertebrates and on the
expansion during early vertebrate evolution of many of the
gene families that were involved in vertebrate development.
Maintenance of duplicated genes: why and how?
In some special cases it is easy to identify an adaptive advantage for organisms that possess multiple copies of a gene, for
example, the increased dosage of a protein that multiple
copies provide can protect individuals against pesticides (e.g.
see [1]). The importance of such short-term selective advantage should not be underestimated; however, a fundamental
assumption in most instances has been that for both daughter
genes of a duplication to be maintained by selection they
must in some way diverge. In particular it has been proposed
that duplication frees one copy from a requirement to maintain key functions, allowing it to evolve new functions [2];
such ‘neofunctionalisation’ could provide the genetic material for the evolution of new morphology — a process that could
be summarised as gene duplication → increased genetic complexity → increased morphological complexity (Figure 1a).
But as gene duplications take place within individuals within
populations, first we must assume that the duplicated allele
is neutral or immediately advantageous or it will be purged
from the population; second, for the above model of neofunctionalisation to occur, a gene duplication must evolve
new (i.e. divergent) functions and move to fixation in a population before one gene becomes a pseudogene. Selection
does not recognise ‘future potential’, and consequently the
classical model predicts a race between neofunctionalisation and pseudogene formation that should result in the
elimination of most duplicated genes.
Empirical evidence, however, does not support this
scheme but suggests that there is a much higher retention
of duplicate copies [3]. This seems to be a major paradox
that was turned upside-down by the duplication–degeneration–complementation (DDC) model, which takes into
account the multiple regulatory regions and the protein
structure domains of multiple genes [4••,5,6••]. This
model, in which functions are partitioned between daughter
genes (a process known as subfunctionalisation), has the
advantage of a firm foundation in population genetics, and
predicts in such cases that selection will act to maintain
both copies (Figure 1b). Consequently, the need for a
mutation to occur in the gene and confer a selectively
advantageous new phenotype is negated. At its extreme, the
DCC model suggests that a multifunctional state, and therefore
presumably some degree of complexity, must precede
duplication rather than follow it. It should be noted, however,
that although subfunctionalisation might be required for
initial duplicate maintenance, it is not incompatible with
subsequent neofunctionalisation (Figure 1c).
Gene duplication during early vertebrate
evolution
Do vertebrates have more genes than other animal taxa? The
number of predicted genes in the human genome, when
compared to the number in the sequenced genomes of
Drosophila and Caenorhabditis elegans, suggests that they do
[7,8]. But such comparisons should be treated with caution as
they assume that humans and Drosophila/C. elegans are representative of all vertebrates and invertebrates, respectively.
More informative are comparisons made with lineages that
diverged closer to the origin of vertebrates, such as ascidians,
which do support this interpretation as do surveys of
numerous individual gene families from amphioxus — the
closest living invertebrate to the vertebrates [9•,10]. Genes
that are involved in regulating development have been
especially well studied in this context, and most show
evidence of duplication that is specific to vertebrates
(Table 1). This degree of duplication has been proposed to
reflect genome duplication on the vertebrate stem — a
theory that is supported by the widespread occurrence
394
Pattern formation and developmental mechanisms
Figure 1
A
(a) Degeneration
A
A
A
(b) Neofunctionalisation
(c) Subfunctionalisation
A
AC
A
AB
AB
A
AC
AB
B
BD
A
AB
Fates of duplicated genes. (a) The classical
model of degeneration of one copy after
duplication. (b) Neofunctionalisation, in which
initially identical duplicates with function A
diverge by acquiring new functions B and C.
(c) Subfunctionalisation, in which duplicate
genes with multiple functions A and B diverge
by reciprocal loss. The extension indicates the
possibility of subsequent neofunctionalisation
by acquisition of further functions C and D. It
is important to recognise that these models
are not mutually exclusive. It is likely that all
are correct to some degree and all occur to
some extent, but there may be variations
according to taxon, type of gene and type of
duplication. The important question is does
one model account for most of the preserved
duplications seen generally in nature or in
specific taxa?
Current Opinion in Genetics & Development
of paralogy in the human genome, but which has its
detractors [11,12].
Gene duplication, divergence and vertebrate evolution
Irrespective of the debate on the mode of duplication, it is
clear that vertebrates have more members of most ‘developmental’ gene families than do their closest relatives. It is
also clear that there is a correlation between the expansion
Table 1
Expansion during early vertebrate evolution for a selection
of gene families, and evidence for sub- or
neofunctionalisation.
Gene
family
Number of Number of Neofunctionalisation
amphioxus vertebrate and/or
genes
genes
subfunctionalisation References
Wnt8
1
2
Sub
[13]
Wnt1
1
1
NA
[26]
Wnt5
1
1
NA
[27]
Wnt3
1
2
?
[27]
Wnt7
1
2
?
[27]
Otx
1
2/3
Neo
[28]
Hh
1
3
Neo
[29]
HNF-3
1
3
?
[30]
Snail
1
3
Neo
[31]
Pax1/9 1
2
Neo
[32]
Hox
1
4
Neo
[33,34]
Pax3/7 1
2
Neo
[35]
Pitx
1
3
Sub + Neo
[36]
Dll
1 or 2
6
Neo
[37]
Sox1/2/3 1
3
Neo
[14]
Mnx
1
2
?
[38]
En
1
2
Neo
[39]
Islet
1
2
Neo
[40]
A question mark indicates that the evidence for either is
ambiguous. Please note this is our interpretation of the data and
does not necessarily reflect the views of the authors of each
publication. NA, not applicable.
of gene families and the complexity of the central nervous
system and other organ systems in vertebrates, at least with
respect to early vertebrate evolution. Development of the
vertebrate body plan requires the combinatorial action
of many genes and, notably, many vertebrate-specific
structures are marked by the expression of gene duplicates
(Table 1). Is there evidence here for the direct linking of
gene duplication with increasingly complex morphology?
If so, does it support one of the above theories in preference to the others?
Evidence for sub- and neofunctionalisation
To establish sub- and neofunctionalisation that is specific
to vertebrates, comparisons must be made between
vertebrate genes and those in their closest relatives, most
informatively in amphioxus. Many such studies have been
carried out in recent years on several gene families. Some
of these are listed in Table 1, together with our interpretation
of the pattern of evolution that they display.
Such comparisons reveal some indication of subfunctionalisation. A good example is the Wnt8 family, in which a
single gene in amphioxus possesses several domains of
expression that seem to have been subdivided between
the two vertebrate Wnt8 paralogues [13•]. But clear-cut
examples of subfunctionalisation are rare, with most gene
families instead showing evidence of neofunctionalisation.
A good example of this is the Sox1/Sox2/Sox3 family,
which in amphioxus is expressed only in the nervous
system — a site that we can deduce to be primitive by
comparison with expression in Drosophila [14]. In vertebrates, all three Sox genes are expressed in the central
nervous system and in some tissues that are presumed to
be vertebrate-specific, such as the epibranchial placodes
[15]. Other examples of such divergent multigene families
Gene duplication and divergence in the early evolution of vertebrates Mazet and Shimeld
in vertebrates are the Hedgehog, Engrailed, Pax1/Pax9
and Distalless families (Table 1).
Can we take the predominance of new functions in these
amphioxus/vertebrate gene family comparisons as evidence
in favour of neofunctionalisation over subfunctionalisation
in the early evolution of vertebrates, and consequently as
falsification of the DDC model? Our answer is we cannot.
First, with the available data we can only detect divergence
on the basis of expression, and not on the basis of evolutionary change of protein function. The latter is hard to test
experimentally, because it requires the ability to manipulate gene function in several taxa, although it may be
predicted perhaps by comparing rates of evolution in
different parts of the proteins of paralagous genes [16,17].
We also cannot rule out the possibility that neofunctionalisation followed subfunctionalisation. The evolutionary
separation between extant vertebrate and invertebrate
lineages is such that subtle subfunctionalisation may have
become obscured by subsequent change. Although examining the relative rates of evolution of paralogues has been
informative in the analysis of more recent duplications
[6••,18•,19•], it is unlikely that we will be able to use this
and more sophisticated molecular evolutionary analysis,
such as tests for adaptive evolution, to examine informatively early vertebrate evolution, principally owing to the
length of time of divergence. With this in mind, it may be
impossible to test definitively which process was most
significant in early vertebrate evolution.
An alternative approach is to examine similar, more recent
evolutionary events and extrapolate these findings to the
origin of vertebrates. For example, some lineages of teleost
fish have been found to possess duplicate copies of genes
that are present in single copy in other vertebrate lineages
and mapping data from zebrafish imply that this may have
resulted from a genome duplication, thereby presenting an
analogous situation to what occurred at the origin of vertebrates (though note this theory has its detractors) [20–22].
Comparison on a wide scale of the fates of these more
recently created paralagous genes will help to establish
general principles that can then be applied to vertebrate
origins. It is notable that some recent studies provide
strong evidence for subfunctionalisation of duplicate genes
in fish [23,24], whereas others suggest the evolution of new
functions [25]. It will be important to see which predominates but this will have to wait until many paralagous
genes with a variety of functions have been studied in
detail — a considerable undertaking.
Conclusions
Although many genes show evidence of duplication in
early vertebrate evolution, experimental analyses of
individual families suggests that there is a predominance
of neofunctionalisation, with little evidence of subfunctionalisation. This may, however, be an artefact of the
considerable length of time that vertebrates and their
nearest relatives have been separated. Direct tests of
395
hypotheses concerning the action of selection on such
ancient evolutionary events may never be possible, and
consequently we must look to more recent duplications,
such as those in teleost fish and amphibia, to establish
general principles that can be extrapolated to the base of
vertebrate evolution.
Acknowledgements
The authors would like to acknowledge the support of the BBSRC.
References and recommended reading
Papers of particular interest, published within the annual period of review,
have been highlighted as:
• of special interest
•• of outstanding interest
1.
Guillemaud T, Raymond M, Tsagkarakou A, Bernard C, Rochard P,
Pasteur N: Quantitative variation and selection of esterase
gene amplification in Culex pipiens. Heredity 1999,
83:87-99.
2.
Ohno S: Evolution by Gene Duplication. New York: Springer-Verlag;
1970.
3.
Hughes MK, Hughes AL: Evolution of duplicate genes in a
tetraploid animal, Xenopus laevis. Mol Biol Evol 1993,
10:1360-1369.
4.
••
Lynch M, Force A: The probability of duplicate gene
preservation by subfunctionalisation. Genetics 2000,
154:459-473.
This paper unites quantitative population genetics with predictions of
duplicate gene preservation, and then correlates this with the partitioning
of tissue-specific patterns of expression between the duplicate genes.
5.
Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J:
Preservation of duplicate genes by complementary, degenerative
mutations. Genetics 1999, 151:1531-1545.
6. Lynch M, Conery JS: The evolutionary fate and consequences of
•• duplicate genes. Science 2000, 290:1151-1155.
The authors survey genomic datasets to identify duplicate genes and
determine their rate of creation, and then use these data to examine rate
variation and evidence for selection in paralagous pairs. They propose that,
after duplication, differential subfunctionalisation in isolated populations will
form a powerful selective barrier to interbreeding, leading to the origin of
new species.
7.
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J,
Devon K, Dewar K, Doyle M, FitzHugh W et al.: Initial sequencing
and analysis of the human genome. Nature 2001, 409:860-921.
8.
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG,
Smith HO, Yandell M, Evans CA, Holt RA et al.: The sequence of the
human genome. Science 2001, 291:1304-1351.
9. Furlong RF, Holland PWH: Were vertebrates octaploid? Phil Trans R
•
Soc Lond B 2002, 357:531-544.
Robust phylogenetic analysis of numerous gene families identified in both
amphioxus and vertebrates leads to a model of vertebrate genome evolution
in which homeologous crossover occurred between chromosomes in an
octaploid ancestor early in vertebrate evolution.
10. Simmen MW, Leitgeb S, Clark VH, Jones SJM, Bird A: Gene number
in an invertebrate chordate, Ciona intestinalis. Proc Natl Acad Sci
USA 1998, 95:4437-4440.
11. Friedman R, Hughes AL: Pattern and timing of gene duplication in
animal genomes. Genome Res 2001, 11:1842-1847.
12. Holland PW, Garcia-Fernàndez J, Williams NA, Sidow A: Gene
duplications and the origins of vertebrate development.
Development 1994, Suppl:125-133.
13. Schubert M, Holland LZ, Panopoulou G, Lehrach H, Holland ND:
•
Characterization of amphioxus AmphiWnt8: insights into the
evolution of patterning of the embryonic dorsoventral axis. Evol
Dev 2000, 2:85-92.
Phylogenetic analysis shows that the amphioxus Wnt8 orthologue is
positioned at the basis of the vertebrate Wnt8 family. The pattern of
AmphiWnt8 expression in the brain, gut and the mesoderm is the sum of the
expression patterns of the vertebrate Wnt8 genes. This is a clear example of
396
Pattern formation and developmental mechanisms
subfunctionalisation after duplication when comparing vertebrate and
amphioxus genes.
27.
Schubert M, Holland LZ, Stokes MD, Holland ND: Three amphioxus
Wnt genes (AmphiWnt3, AmphiWnt5, and AmphiWnt6) associated
with the tail bud: the evolution of somitogenesis in chordates.
Dev Biol 2001, 240:262-273.
14. Holland LZ, Schubert M, Holland ND, Neuman T: Evolutionary
conservation of the presumptive neural plate markers
AmphiSox1/2/3 and AmphiNeurogenin in the invertebrate
chordate amphioxus. Dev Biol 2000, 226:18-33.
28. Williams NA, Holland PWH: Old head on young shoulders. Nature
1996, 383:490.
15. Ishii Y, Abu-Elmagd M, Scotting PJ: Sox3 expression defines a
common primordium for the epibranchial placodes in chick.
Dev Biol 2001, 236:344-353.
29. Shimeld SM: The evolution of the hedgehog gene family in
chordates: Insights from amphioxus hedgehog. Dev Genes Evol
1999, 209:40-47.
16. Dermitzakis ET, Clark AG: Differential selection after duplication in
mammalian developmental genes. Mol Biol Evol 2001, 18:557-562.
30. Shimeld SM: Characterisation of amphioxus HNF-3 genes:
conserved expression in the notochord and floor plate. Dev Biol
1997, 183:74-85.
17.
Massingham T, Davies LJ, Lio P: Analysing gene function after
duplication. BioEssays 2001, 23:873-876.
18. Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV: Selection in the
•
evolution of gene duplication. Genome Biol 2002,
3:0008.1-0008.9.
The authors describe a two-stage evolutionary model of gene duplication that
is based on surveys of paralagous gene pairs from completely sequenced
genomes. The first stage is characterised by purifying selection and predicts
that only immediately advantageous duplications will be retained. This is
followed by the creation of new functions only later in evolution.
19. Van de Peer Y, Taylor JS, Braasch I, Meyer A: The ghost of selection
•
past: rates of evolution and functional divergence of anciently
duplicated genes. J Mol Evol 2001, 53:436-446.
The authors analyse paralagous gene pairs that resulted from duplications
specific to the zebrafish lineage. Similar to Kondrashov et al. [18•], they find
a significant degree of purifying selection in both paralogues and also
identify some rate heterogeneity that suggests an increased rate of evolution
for paralogues and possible positive selection.
20. Amores A, Force A, Yan Y, Joly L, Amemiya C, Fritz A, Ho R,
Langeland J, Prince V, Wang YL et al.: Zebrafish hox clusters and
vertebrate genome evolution. Science 1998, 282:1711-1714.
21. Taylor JS, Van de Peer Y, Braasch I, Meyer A: Comparative genomics
provides evidence for an ancient genome duplication event in
fish. Phil Trans R Soc Lond B 2001, 356:1661-1679.
22. Robinson-Rechavi M, Marchand O, Escriva H, Laudet V: An ancestral
whole-genome duplication may not have been responsible for the
abundance of duplicated fish genes. Curr Biol 2001,
11:R458-R459.
23. Bruce AE, Oates AC, Prince VE, Ho RK: Additional hox clusters in
the zebrafish: divergent expression patterns belie equivalent
activities of duplicate hoxB5 genes. Evol Dev 2001, 3:127-144
24. De Martino S, Yan Y-L, Jowett T, Postlethwaite JH, Varga ZM,
Ashworth A, Austin CA: Expression of sox11 gene duplicates in
zebrafish suggests the reciprocal loss of ancestral gene
expression patterns in development. Dev Dyn 2000, 217:279-292.
25. McClintock JM, Carlson R, Mann DM, Prince VE: Consequences of
Hox gene duplication in vertebrates: an investigation of zebrafish
Hox paralogue group 1 genes. Development 2001, 128:2471-2484.
26. Schubert M, Holland LZ, Holland ND: Characterization of an
amphioxus wnt gene, AmphiWnt11, with possible roles in
myogenesis and tail outgrowth. Genesis 2000, 27:1-5.
31. Langeland JA, Tomsa JM, Jackman WR, Kimmel CB: An amphioxus
snail gene: expression in paraxial mesoderm and neural plate
sggests a conserved role in patterning the vertebrate embryo.
Dev Genes Evol 1998, 208:569-577.
32. Holland N, Holland L, Kozmik Z: An amphioxus Pax gene,
AmphiPax-1, expressed in embryonic endoderm, but not in
mesoderm: implications for the evolution of class I paired box
genes. Mol Mar Biol Biotechnol 1995, 4:206-214.
33. Holland PWH, Holland LZ, Williams NA, Holland ND: An amphioxus
homeobox gene: sequence conservation, spatial expression
during development and insights into vertebrate evolution.
Development 1992, 116:653-661.
34. Wada H, Garcia-Fernàndez J, Holland PWH: Colinear and
segmental expression of amphioxus Hox genes. Dev Biol 1999,
213:131-141.
35. Holland LZ, Schibert M, Kozmik Z, Holland ND: AmphiPax3/7, an
amphioxus paired box gene: insights into chordate myogenesis,
neurogenesis and the possible evolutionary precursor of
definitive vertebrate neural crest. Evol Dev 1999, 1:153-165.
36. Yasui K, Zhang S, Uemura M, Saiga H: Left–right asymmetric
expression of BbPtx, a Ptx-related gene, in a lancelet species and
the developmental left-sidedness in deuterostomes. Development
2000, 127:187-195.
37.
Holland ND, Panganiban G, Henyey EL, Holland LZ: Sequence and
developmental expression of AmphiDll, an amphioxus Distalless
gene transcribed in the ectoderm, epidermis and nervous system:
insights into the evolution of craniate forebrain and neural crest.
Development 1996, 122:2911-2920.
38. Ferrier DEK, Brooke NM, Panopoulou G, Holland PWH: The Mnx
homeobox gene class defined by HB9, MNR2 and amphioxus
AmphiMnx. Dev Genes Evol 2001, 211:103-107.
39. Holland LZ, Kene M, Williams NA, Holland ND: Sequence and
embryonic expression of the amphioxus engrailed gene
(AmphiEn): the metameric pattern of transcription resembles that
of its segment-polarity homolog in Drosophila. Development 1997,
124:1723-1732.
40. Jackman WR, Langeland JA, Kimmel CB: islet reveals segmentation
in the amphioxus hindbrain homolog. Dev Biol 2000, 220:16-26.