Download The Limits of Natural Selection in a

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Viral phylodynamics wikipedia , lookup

Genome evolution wikipedia , lookup

Genetics and archaeogenetics of South Asia wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Adaptive evolution in the human genome wikipedia , lookup

Inbreeding wikipedia , lookup

Oncogenomics wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Frameshift mutation wikipedia , lookup

Genetic drift wikipedia , lookup

Polymorphism (biology) wikipedia , lookup

Epistasis wikipedia , lookup

Point mutation wikipedia , lookup

Koinophilia wikipedia , lookup

Human genetic variation wikipedia , lookup

Group selection wikipedia , lookup

Mutation wikipedia , lookup

Population genetics wikipedia , lookup

Microevolution wikipedia , lookup

Transcript
TIGS 1258 No. of Pages 10
Opinion
The Limits of Natural Selection
in a Nonequilibrium World
Yaniv Brandvain1 and Stephen I. Wright2,*
Evolutionary theory predicts that factors such as a small population size or low
recombination rate can limit the action of natural selection. The emerging field of
comparative population genomics offers an opportunity to evaluate these
hypotheses. However, classical theoretical predictions assume that populations
are at demographic equilibrium. This assumption is likely to be violated in the
very populations researchers use to evaluate selection's limits: populations that
have experienced a recent shift in population size and/or effective recombination rates. Here we highlight theory and data analyses concerning limitations on
the action of natural selection in nonequilibrial populations and argue that
substantial care is needed to appropriately test whether species and populations show meaningful differences in selection efficacy. A move toward modelbased inferences that explicitly incorporate nonequilibrium dynamics provides
a promising approach to more accurately contrast selection efficacy across
populations and interpret its significance.
Characterizing the Influence of Nonequilibrium Demographic History on the
Efficacy of Natural Selection
Despite the relentless action of natural selection, no organism is perfect. Understanding the
factors that limit natural selection is a fundamental concern of population genetics [1]. These
limits can inform the basis of human genetic diseases [2,3], the extinction of small [4,5], asexual
[6], or selfing populations [7–10], the degradation of sex chromosomes [11], the evolution of
gene [12] and genome structure [13–15], and the consequences of domestication [16–19]. For
example, the history of population bottlenecks associated with range expansion might be
responsible for the excess of potentially damaging and, in some cases, disease-causing
[20,21] variants in historically bottlenecked human populations [21–26]. Similarly, an excess
of putatively deleterious mutations in predominantly self-fertilizing plants suggests that purifying
selection is limited in these taxa [27–30].
Because of its wide-ranging implications for micro- and macroevolution, conservation, plant and
animal breeding, and human health, there has been much research on understanding the limits
of selection. Classic theory predicts that smaller effective population sizes (see Glossary),
higher ploidy levels, and lower effective recombination rates, among other factors, can limit the
action of natural selection, driving populations toward lower mean fitness [1]. Population
genomic datasets allow researchers to precisely identify and quantify putatively deleterious
mutations, providing the opportunity for powerful tests of these hypotheses by investigating
whether populations that differ in one or more of these features result in different patterns of
divergence and variation for putatively deleterious mutations (many are reviewed in Table 1).
However, there is growing recognition that inferring and interpreting differences in the efficacy of
selection based on population genomic data can be more difficult than initially thought. Here we
indicate the challenges in connecting differences in mutational patterns between taxa to differences in the action of natural selection (see Box 1 for a discussion of how researchers attempt to
Trends in Genetics, Month Year, Vol. xx, No. yy
Trends
Next-generation DNA and RNA
sequencing datasets are enabling
powerful genomic tests of the factors
influencing the efficacy of natural selection against deleterious mutations.
Theoretical advances have provided
important insights into the extent to
which demographic history influences
selection strengths and genetic load.
There is growing recognition that
nonequilibrium demographic history
can lead to spurious signals of
increased genetic load.
Model-based methods investigating
the efficacy of selection, incorporating
demographic history and uncertainty in
the dominance of deleterious mutations, are being developed to provide
further insights into the factors limiting
selection in natural populations.
1
Department of Plant Biology,
University of Minnesota, St[1_TD$IF] Paul, MN
55108, USA
2
Department of Ecology and
Evolutionary Biology, University of
Toronto, Toronto, ON M5S 3B2,
Canada
*Correspondence:
[email protected]
(S.I. Wright).
http://dx.doi.org/10.1016/j.tig.2016.01.004
© 2016 Elsevier Ltd. All rights reserved.
1
TIGS 1258 No. of Pages 10
measure selection's limits). In particular, we highlight the complications that arise in attempts to
infer limited efficacy of selection in nonequilibrial populations.
There is growing recognition, particularly in the human population genomics literature, that
nonequilibrium demography can complicate empirical tests for differences in the efficacy of
selection from polymorphism data. This complication has led to considerable debate on the
interpretation of genomic patterns [24,31–33]. For example, following a strong population
bottleneck, sites under purifying selection are expected to recover equilibrium faster than neutral
sites, which can lead to spurious signals of reduced efficacy of selection in bottlenecked
populations [31,34,35]. Therefore, finding strong evidence for limited selection in such cases
is difficult, particularly because theory predicts that population bottlenecks and expansions can
have subtle and complex influences on selection. Although the confounding effects of nonequilibrium demography on our inference of relaxed selection is well appreciated in the human
literature, we argue that such consideration must be extended to studies of other populations
that have experienced a recent bottleneck, including domesticated species, those that have
experienced a shift in mating system, and other candidates for relaxed selection.
We argue that the field has yet to fully address these challenges, meaning that many examples
purporting to provide evidence for selection limits remain ambiguous. Therefore, clarifying what
is robust, genome-wide evidence for differences in selection, and improved understanding of
between-species differences in fitness, will be a major task in the coming years. Below we outline
both the promise and the challenges of using genomic data to test theory about the limits of
natural selection and to investigate what forces are most important in imposing limits, especially
as populations grow, shrink, and spread in our changing world.
Limits to Selection
Theoretical Predictions of the Limits of Natural Selection
The number and frequency of damaging mutations is determined by their introduction into a
population by mutation, their stochastic change by drift, and their removal by selection. For
strongly deleterious mutations in modestly sized populations, selection, mutation, and the mode
of inheritance can describe the expected number and frequency of deleterious mutations, as well
as the mutation load. For large-effect mutations, the predicted genetic load is simply a function
of the mutation rate [36]; however, additional biological reality complicates this prediction [37].
For mutations with more subtle influences on fitness, stochasticity can overwhelm the action of
selection as the number of independent chromosomes in a population decreases. Therefore,
deleterious mutations are predicted to be more common in smaller populations (stronger effects of
drift) and/or populations or genomic regions with lower recombination rates (less independence of
selection on these chromosomes). Another constraint on selection's action is its ability to ‘see’
mutations. In diploids, deleterious variation can be hidden in the recessive state. In species or
genomic regions with higher ploidy, a similar effect can occur and therefore selection may be unable
to eliminate deleterious loss-of-function mutations that are hidden by functional gene copies.
Together, population size, recombination rates, mutation rates, ploidy, and the mode of gene action
interact to set the limits of selection, which we can hope to uncover from genomic data (Table 1).
In this brief treatment, we neglect constraints in organismal design and the mutational landscape
that can also impose limits on selection [1]. Uncovering these limits from genomic data presents
an additional set of challenges orthogonal to the points below.
Estimating Demographic History and Recombination Rates
The basic predictors of the limits of natural selection – recombination rates, mutation rates, and
historical population sizes – can, in principle, be precisely estimated from modern genomic data.
2
Trends in Genetics, Month Year, Vol. xx, No. yy
Glossary
p: the expected number of sequence
differences between two
chromosomes.
pn: the expected number of
sequence differences between two
chromosomes at nonsynonymous
sites within a species or population.
ps: the expected number of
sequence differences between two
chromosomes at synonymous sites
within a species or population.
Distribution of fitness effects
(DFE): the distribution of the
proportion of new mutations in each
class of selection coefficient.
dN: the expected number of
sequence differences between two
chromosomes at nonsynonymous
sites between species.
dS: the expected number of
sequence differences between two
chromosomes at synonymous sites
between species
Effective population size (Ne): the
effective number of breeding
individuals.
Gene surfing: the spread of new
and standing mutations that are on
the wave front of a range expansion;
surfing by neutral or deleterious
mutations mimics positive selection
because they spread and fix over
large areas.
Genetic load: the difference
between the average population
fitness and the fitness of a mutationfree genotype.
TIGS 1258 No. of Pages 10
Table 1. Selected Examples of Genome-Wide Tests of Selection's Limits
Factor
Theoretical
Prediction
Comparison
Metric of Relaxed Selection
Refs
Significant
Effect?
Effective
population
size
Small population
sizes increase
the effects of drift
relative to
selection [68]
Domesticated vs
wild tomato
dN/dS
[69]
+
Domesticated vs
wild sunflowers
pn/ps
[17]
+
Domesticated vs
wild rice
[7_TD$IF]dN/[8_TD$IF]dS
dN/[9_TD$IF]dS
[70]
[16]
+
+
Domesticated vs
wild horses
# Deleterious mutations/ind
[19]
+
Arabidopsis thaliana
populations of
different sizes
pn/ps
[71]
+
Yeast species of
different sizes
Various
[72]
+
Human populations
experiencing
historical differences
in population size
pn/ps (+ related summaries)
pn/ps (+ related summaries)
# Deleterious mutations/ind
# Deleterious mutations/ind
# Deleterious mutations/ind
Predicted individual fitness
[25]
[22]
[31]
[33]
[52]
[51,52]
+
+
-
Low- vs highrecombination
genomic regions
pn/ps, GERP scores
dN/dS
DFE
pn/ps, dN/dS, DFE
[74]
[75]
[76]
[77]
+
Selfing vs
outcrossing species
and populations
pn/ps, dN/dS, DFE
# Stop codons, pn/ps
dN/dS
pn/ps
dN/dS
pn/ps
DFE
DFE
[78]
[28]
[10_TD$IF][28]
[79]
[79]
[30]
[27]
[80]
+
+
dN/dS, pn/ps
pn/ps
[29]
[81]
+
Recombination
rates
Reduced
recombination
will cause higher
selective
interference,
weakening
selection [73]
Asexual vs sexual
populations
+
+
+
+
+
+
+
+
Tests of whether selection is weaker in populations with reduced effective population size or effective recombination rates.
Response variables include the ratio of nonsynonymous to synonymous substitutions (dN/dS), the ratio of nonsynonymous
to synonymous polymorphism pn/ps, and the inferred DFE.
For example, one diploid genome can provide a sketch of an entire population's history [38] and
many sequences allow a fine-grained view of recent population sizes and migration rates
[39–41]. Dense genotype data from many individuals provides precise characterization of
the meiotic recombination landscape and whole-genome sequencing of parent–offspring trios
can reveal the rate of spontaneous mutation [42,43], providing access to additional predictors of
selection's limits. With such information, researchers can explore the relationship between the
action of selection and population parameters of interest [31–33,44].
Quantifying the Limits to Selection
A key measure of interest in evaluating the efficacy of selection is the genetic load – the difference
between achieved fitness and maximum potential fitness. In theory one could test whether two
populations differ in their load by comparing the fitness of individuals from these populations, and
although the genetic load is not an exact measure of the historical action of selection, the efficacy
of selection was likely to be more limited in the population with the larger load. In practice,
however, comparing fitness of individuals adapted to different biotic and abiotic challenges is often
Trends in Genetics, Month Year, Vol. xx, No. yy
3
TIGS 1258 No. of Pages 10
Box 1. How Can We Quantify and Compare the Efficacy of Selection?
Various metrics are applied to compare the efficacy of negative selection, but such comparisons have a number of
challenges in their power and interpretation.
The ratio of nonsynonymous to synonymous substitutions between species (dN/dS) is a common (and crude) measure of
the strength of selection. Elevated dN/dS values are often interpreted as reflecting reduced efficacy of negative selection,
but elevated dN/dS can also indicate elevated positive selection [48,82]. In addition, because new mutations accumulate
slowly, we expect only modest differences in the number of deleterious mutations between recently diverged populations. Because most comparative population genomic studies focus on pairs of closely related populations (Table 1)
between-species contrasts are often underpowered, limiting our ability to contrast contemporary differences in selection.
Therefore, comparisons of intraspecific diversity at selected and neutral sites (e.g., pn/ps) fell into favor as a way to
overcome this limitation. However, nonequilibrium demography can mislead this measure (Box 3).
The use of evolutionary models to infer the DFE of deleterious mutations [62,63] is thus becoming increasingly
widespread (Table 1). By combining information on the reduction in nonsynonymous diversity with the frequency
distribution of nonsynonymous and synonymous mutations, these methods explicitly quantify the strength of selection
against deleterious mutations rather than simply reporting a ratio of diversity levels at functional and neutral sites.
However, cross-species comparisons of the DFE do not always generate straightforward interpretations of differences in
the efficacy of selection. For example, one species can show increased proportions of both effectively neutral and
strongly deleterious mutations compared with another [27].
Additionally, genomic data allow improved assessment of which mutations are likely to have deleterious effects [57].
Researchers can use algorithms (e.g., SIFT [59], MAPP [60]) that combine cross-species sequence conservation and
functional predictions. Since tests of the limits of selection are focused on mutations with fitness effects, an improved
search image for damaging mutations may increase the power of such tests. However, these methods inherently rely on
sites subject to purifying selection over long macroevolutionary history and weakly selected sites that are more likely to be
subject to shifts in the efficacy of selection could remain undetected, and beneficial changes in function can be confused
for deleterious mutations.
unfeasible. As we describe below and in Box 1, annotated population genomic data may provide
an alternative glimpse into the historical action of natural selection, and genome-informed inferences can, in principle, provide information about the genetic load in populations [37].
Such data allow precise estimation of the number and frequency of putatively deleterious
mutations genome wide, potentially facilitating strong hypothesis testing regarding what factors
influence the number and frequency of deleterious variants. The genome contains enough sites
variable within or divergent among populations to investigate whether individuals from a given
population carry more deleterious mutations than individuals in a different population. To test
whether selection's limits differ between populations, species, or genomic regions, researchers
often compare the number or frequency of putatively deleterious variants, perhaps weighted by
their inferred effect on fitness, relative to putatively neutral ones and/or fit a model of demography
and selection to such data (Box 1). The challenge, which we discuss in more detail below, is
relating observed mutational patterns to the goal of quantifying and comparing historical
selection. As we highlight in Box 1, this challenge is not trivial: there is currently no single
agreed on measure of the efficacy of selection.
Evidence of Selection's Limits
Recent analyses of whole-genome data have attempted to test hypotheses of selection's limits
(Table 1). Such tests take species, populations, or genomic regions that differ in some biological
feature such as effective recombination rate, effective population size, and/or ploidy and ask
whether there is a corresponding difference in the action of purifying selection as inferred from
patterns of polymorphism and/or divergence.
The results are somewhat mixed, with some studies purporting to find evidence for reduced
efficacy of selection while others do not. In addition to variable results, the studies differ in how
they test for relaxed selection. Moreover, tests of specific biological hypotheses can be quite
different even when the explanatory variables are superficially similar. For example, both a severe
4
Trends in Genetics, Month Year, Vol. xx, No. yy
TIGS 1258 No. of Pages 10
recent bottleneck and a long-term small census size decrease the effective population size, but
their predicted influence on selection is quite different (see below).
Limits to Selection in Nonequilibrial Populations
Theory in Nonequilibrial Populations
Most results concerning the limits to selection assume demographic equilibrium. However, most
populations are rarely at equilibrium. Many taxa that have been studied to test specific predictions of natural selection's limits have experienced a recent bottleneck (e.g., out-of-African
human populations, domesticated plants and animals, plants that recently evolved a propensity
to self-pollinate; Table 1). In these nonequilibrial populations, the effective population size
estimated by sequence diversity (e.g., pairwise sequence diversity at synonymous sites) does
not adequately summarize the complex predictions of population genetic theory. We summarize
recent developments in population genetic theory addressing the limits of selection in nonequilibrium populations in Box 2. We also highlight the extension of these models to spatially
expanding populations [26,45].
Overall, the theory summarized in Box 2 demonstrates that both a rapid decrease in population
size followed by a fast recovery and a spatial population expansion will have subtle influences on
the action of selection. If mutations have an additive effect on fitness, a recent bottleneck or
population expansion will have a negligible effect on population fitness [31,33,46]. However, the
case becomes more complex for recessive variants [24,45–47], where a short-term bottleneck
can change a population's fitness. Thus, while longer-term reductions in effective population size
are expected to have important effects on population mean fitness, the effects of shorter-term
demographic bottlenecks followed by recovery depend in important ways on the dominance of
deleterious mutations.
Inferring Selection's Limits in Nonequilibrial Populations
A reconsideration of what constitutes empirical evidence of reductions in the efficacy of selection
has accompanied the development of theory described in Box 2. Importantly, one common
measure of weakened selection – the relative ratio of putatively deleterious to putatively neutral
variation within populations (e.g., pn/ps) can be particularly misled by recent bottlenecks. That is,
because neutral variation approaches equilibrium more slowly than deleterious variation (Box 3),
ratios of deleterious to neutral variation will be elevated in bottlenecked populations. Therefore,
Box 2. Theory: Selection's Limits in Nonequilibrial Populations
Most rare mutations are stochastically lost in the contraction phase of a bottleneck. Therefore, a recently bottlenecked
population will retain few segregating deleterious mutations. However, those that survive the contraction are pushed to a
higher frequency, and the average number of deleterious mutations per individual remains roughly constant after a
population contraction [33,46,47]. Providing a bottlenecked population remains small, mildly deleterious mutations can
accumulate; however, this effect is largely minor and transient, especially if the period of small population size is limited.
The impact of a bottleneck on population fitness depends on the mode of gene action. Because the average number of
deleterious mutations per individual is insensitive to recent bottlenecks, the genetic load induced by alleles with additive
effects on fitness is also insensitive to brief bottlenecks [33,46]. By contrast, the higher frequency of alleles surviving a
bottleneck means that the deleterious effects recessive mutations will be expressed, increasing the genetic load [46,47].
However, this burden is temporary: recessive variants return to equilibrium frequencies well before novel deleterious
mutations drift to considerable frequencies [46]. Therefore, theory predicts a deficit in the per-individual number of
deleterious recessive mutations in bottlenecked populations [47].
The impact of population expansions on the action of selection has also received much theoretical interest. Mutations
arising on the geographical edge of an expanding population experience an evolutionary advantage with little regard to
their fitness consequence, a phenomenon known as gene surfing [83–90]. The decline in fitness associated with
deleterious mutations riding the wave front is called the expansion load [26,45]. Standing variants with additive effects on
fitness do not contribute to the expansion load because drift does not influence expected allele frequencies [26,45].
However, new deleterious mutations with additive effects gradually accumulate at the wave front, generating an
expansion load [26]. Additionally, standing recessive variants that ride the wave front increase the expansion load
by exposing their fitness costs [45].
Trends in Genetics, Month Year, Vol. xx, No. yy
5
TIGS 1258 No. of Pages 10
Box 3. Population Contractions Generate a Relative Excess of Deleterious Variation, Regardless of the
Efficacy of Selection
Over time, mutation, drift, and selection generate an equilibrium distribution of allele frequencies across classes of
functional elements in the genome. Major changes in population size shift genetic variation from this equilibrium by
changing the number of chromosomes (in the population) that can be hit by mutation and in the face of population
contractions, removing a large number of segregating mutations.
The influence of demography on genome-wide patterns of variation is well known. However, the fact that this equilibrium
is reached more quickly for deleterious mutations than neutral variants (Figure I) (see also [34,35,49]) has not been
appreciated in our interpretation of the relative ratios of synonymous and nonsynonymous variation. In other words,
segregating deleterious mutations are at a lower frequency at equilibrium than are weakly selected mutations and so
deleterious mutations recover to their pre-bottleneck equilibrium levels faster than weakly selected mutations simply
because their frequency was not high before the bottleneck.
Genec diversity X me, by strength of purifying selecon,
starng with one chromsome
Key:
Neutral (s=0)
s = 0.001
s = 0.01
s = 0.1
0.000
0.002
0.004
π%
0.006
0.008
0.010
The implications of this result for inferring reductions in the efficacy of selection are serious. We often evaluate evidence for
differences in selection in recently bottlenecked populations using polymorphism patterns. The more rapid return of
equilibrium variation at nonsynonymous than synonymous sites means that we could be misled to infer our hypothesized
prediction; that is, we will infer that selection is relaxed following a bottleneck (elevated pn/ps) regardless of the actual
process. Thus, a direct estimate of the number of putatively deleterious mutations can provide a stronger test of
predictions of relaxed selection [31,33]. Alternatively, methods that estimate the distribution of fitness effects while
explicitly controlling for nonequilibrium population history can be implemented [62,63,91–94].
0
1000
2000
3000
4000
5000
Gen
Figure I. pn Reaches Equilibrium Faster Than ps after a Population Bottleneck. Pairwise sequence
diversity for a population of a constant size that begins with no variation (e.g., a population that has
undergone[2_TD$IF] a contraction [3_TD$IF]that [4_TD$IF]wiped [5_TD$IF]out [6_TD$IF]all genetic diversity and then experiences an instantaneous
expansion) with differing strengths of purifying selection. The blue or green lines are proxies for pn
and the red and black lines are proxies for ps. At less than 2000 generations into the expansion, for
example, pn/ps calculated using the green line (pn) to the red line (ps) must be interpreted with the
caveat that the red line has not reached equilibrium. Our cartoon corresponds directly to the finding
of Pennings et al. [35]: after a genome-wide sweep associated with drug resistance in HIV, the ratio
of nonsynonymous to synonymous diversity (pn/ps) is high and decreases with time. This result
reflects the more rapid approach to equilibrium under negative selection (pn) than with drift (ps) [34],
presumably because deleterious mutations rise to a lower frequency.
6
Trends in Genetics, Month Year, Vol. xx, No. yy
TIGS 1258 No. of Pages 10
pn/ps measures in bottlenecked populations can generate statistical artifacts that mimic expectations of relaxed selection [31,33–35,48,49].
Simons et al. [33] and Do et al. [31] have therefore suggested that an increased number of
deleterious mutations per individual, rather than the relative frequency of segregating variants,
should be used to estimate selection's limits. However, while the number of putatively deleterious mutations is robust to artifacts induced by nonequilibrium demography, this measure
neglects the frequency of mutations. Therefore, simply counting the number of deleterious
mutations per individual does not reliably estimate the genetic load when deleterious mutations
are partially recessive [24,33]. Because deleterious mutations are likely to be often recessive [50],
and because theoretical results concerning selection in nonequilibrial populations depend on the
mode of gene action, some authors have found the per-individual number of deleterious
mutations measure to be an inadequate summary of the genetic load [24,45,51,52].
The challenges in estimating the genetic load has intensified debate on the extent to which
selection has been limited in populations experiencing recent demographic shifts and particularly
in out-of-African human populations [24,26,31–33,53]. Out-of-African human populations contain proportionately more putatively deleterious variation relative to neutral genetic diversity than
African populations [25,26,54,55]. Counting the per-individual number of putatively deleterious
derived mutations reveals subtle [52,56] or no [31,33] difference among populations. However,
on average out-of-African populations are more homozygous for the derived, putatively deleterious mutations they do carry, and if these mutations are on average recessive, out-of-African
populations may have an elevated mutation load [24,32,52]. More generally, most populations
with recent dips in effective population size carry proportionally more putatively deleterious
variation than closely related populations with large effective population sizes (Table 1). The
observed increase in the proportion of putatively deleterious variation is consistent with expectations that selection is limited in populations with lower effective population sizes but can
plausibly be induced by the artifacts discussed above. Accounting for nonequilibrium while
considering the effects of dominance is therefore critical to our interpretation of the extent to
which selection has been relaxed in bottlenecked populations.
Promising Directions
As a field, we have not yet identified how to best infer selection's limits. One creative approach
weights genotypes by an estimate of their fitness costs under alternative dominance coefficients
[52]. However, such attempts must admit the considerable uncertainty in estimates of both the
mode of gene action and the fitness effect of mutations, which are usually inferred using crossspecies sequence conservation and/or functional predictions [57–60] and are not an actual
fitness estimate.
Similar challenges lie in the interpretation of other comparisons of selection efficacy in Table 1.
Selfing populations, for example, rarely show evidence for higher rates of substitution at selected
sites between species (using dN/dS), but more often show higher selected relative to neutral
variation segregating within populations (see [61] and examples in Table 1). Domesticated
organisms have similarly shown elevated proportions of selected variation compared with wild
relatives. The extent to which these polymorphism patterns are a simple consequence of
nonequilibrium demography without a corresponding increase in genetic load remains uncertain; however, recent studies in dogs [18] and horses [19] suggest that an increase in the genetic
load has accompanied domestication. This supports the hypothesis that a higher genetic load is
a ‘cost of domestication’ [16].
Model-based inference provides an alternative approach to comparing the efficacy of selection
across populations and genomic regions. Of particular note is a popular set of models that
Trends in Genetics, Month Year, Vol. xx, No. yy
7
TIGS 1258 No. of Pages 10
estimate the distribution of fitness effects (DFE) of deleterious mutations while explicitly
incorporating simple models of nonequilibrium demography [62,63]. The DFE approach has been
recently harnessed to infer both an accumulation of modestly deleterious mutations and the purging
of severely damaging mutations following the transition to self-fertilization [27]. However, DFE is
improperly calibrated under complex demographic histories [62] and with recurrent selection [64].
Moreover, current DFE models assume an additive model of gene action and random mating,
which can complicate interpretations of fitness differences across species in the presence of
recessive mutations, especially across species that differ in ploidy and selfing rate [27,65].
Outstanding Questions
More explicit modeling of the joint effects of dominance, demography, and nonequilibrium
transitions in population size and mating system will enable a more informed interpretation of
selection from population genomic data. With realistic forward simulations incorporating nonequilibrium, mating system, and our uncertainty in selection and dominance parameters, we
should be able to gain better insights into the scale and significance of selection differences.
These comparisons in combination with model fitting may also in turn inform us about the
distribution of dominance coefficients.
How rapidly can changes in population
size and recombination drive major
shifts in mean fitness and mutation load?
We caution, however, that the interpretation of results from model-based inference should be
treated with care, as model misspecification can mislead inference. For example, some increase
in the genetic load of out-of-African human populations is to likely reflect the introgression of
deleterious alleles from archaic human populations [66,67] in which selection was likely to have
been relaxed [31] and not accounting for this history of archaic introgression would seriously
mislead models that aim to infer the extent to which population expansion has limited natural
selection as some members of our species left Africa. Another possibility is that colonization
bottlenecks and transitions in mating system are associated with reduced selection coefficients
themselves, as shifts in environment and selective pressures free populations from historical
selection pressures [30]. Distinguishing the role of effective population size change and differences in selection strengths adds a further challenge in interpretation.
Concluding Remarks
As large-scale genomic datasets accumulate, there are exciting possibilities for investigating the
extent to which populations and genomic regions differ in the strength of selection and how this
affects the evolutionary process. Such analyses, however, highlight the difficulties in quantifying
selection's limits, especially in nonequilibrial populations. In parallel with increasingly sophisticated model-based approaches to investigate population history using neutral diversity, it is
becoming clear that simple comparisons of summaries of selected variation can give a limited or
inaccurate picture of differences in selection. Future research addressing selection's limits will
improve with explicit consideration of nonequilibrium conditions and mode of gene action; this
will inform better understanding of the extent to which species differ in deleterious mutation
accumulation (see Outstanding Questions). Meanwhile, we encourage researchers evaluating
evidence for relaxed selection to present multiple analyses of their data and explicit modeling of
demographic and mutational effects on fitness and to jointly consider the benefits and limitations
of these approaches when interpreting their results.
Acknowledgments
The authors thank Suzanne McGaugh and Josh Schraiber for their comments and suggestions on the manuscript and Brian
Charlesworth for helpful discussion.
References
1. Barton, N. and Partridge, L. (2000) Limits to natural selection.
Bioessays 22, 1075–1084
3. Pritchard, J.K. (2001) Are rare variants responsible for susceptibility to complex diseases? Am. J. Hum. Genet. 69, 124–137
2. Eyre-Walker, Y.C. and Eyre-Walker, A. (2014) The role of mutation
rate variation and genetic diversity in the architecture of human
disease. PLoS ONE 9, e90166
4. Lande, R. (1998) Risk of population extinction from fixation
of deleterious and reverse mutations. Genetica 102–103,
21–27
8
Trends in Genetics, Month Year, Vol. xx, No. yy
How can we develop models that
inform the historical action of selection
in nonequilibrial populations?
What is the distribution of dominance
coefficients of deleterious mutations
and how does this influence differences
among populations and species in the
efficacy of selection?
Did domestication bottlenecks, transitions to self-fertilization, out-of-African
bottlenecks, and northward expansion
following the retreat of the glaciers drive
an increase in mutation load and concomitant reduction in mean fitness?
TIGS 1258 No. of Pages 10
5. Lynch, M. et al. (1995) Mutation accumulation and the extinction of
small populations. Am. Nat. 146, 489–518
6. Lynch, M. et al. (1993) The mutational meltdown in asexual populations. J. Hered. 84, 339–344
31. Do, R. et al. (2015) No evidence that selection has been less
effective at removing deleterious mutations in Europeans than in
Africans. Nat. Genet. 47, 126–131
7. Goldberg, E.E. et al. (2010) Species selection maintains selfincompatibility. Science 330, 493–495
32. Lohmueller, K.E. (2014) The impact of population demography
and selection on the genetic architecture of complex traits. PLoS
Genet. 10, e1004379
8. Igic, B. and Busch, J.W. (2013) Is self-fertilization an evolutionary
dead end? New Phytol. 198, 386–397
33. Simons, Y.B. et al. (2014) The deleterious mutation load is insensitive to recent population history. Nat. Genet. 46, 220–224
9. Takebayashi, N. and Morrell, P.L. (2001) Is self-fertilization an
evolutionary dead end? Revisiting an old hypothesis with genetic
theories and a macroevolutionary approach. Am. J. Bot. 88,
1143–1150
34. Gordo, I. and Dionisio, F. (2005) Nonequilibrium model for estimating parameters of deleterious mutations. Phys. Rev. E: Stat.
Nonlin. Soft Matter Phys. 71, 031907
10. Wright, S.I. et al. (2013) Evolutionary consequences of self-fertilization in plants. Proc. Biol. Sci. 280, 20130133
11. Charlesworth, B. and Charlesworth, D. (2000) The degeneration of
Y chromosomes. Philos. Trans. R. Soc. Lond. B: Biol. Sci. 355,
1563–1572
12. Lynch, M. (2006) The origins of eukaryotic gene structure. Mol.
Biol. Evol. 23, 450–468
13. Lynch, M. (2007) The Origins of Genome Architecture, Sinauer
Associates
14. Lynch, M. and Conery, J.S. (2003) The origins of genome complexity. Science 302, 1401–1404
35. Pennings, P.S. et al. (2014) Loss and recovery of genetic diversity
in adapting populations of HIV. PLoS Genet. 10, e1004000
36. Haldane, J.B.S. (1937) The effect of variation on fitness. Am. Nat.
71, 337–349
37. Agrawal, A.F. and Whitlock, M.C. (2012) Mutation load: the fitness
of individuals in populations where deleterious alleles are abundant. Annu. Rev. Ecol. Evol. 43, 115–135
38. Li, H. and Durbin, R. (2011) Inference of human population history
from individual whole-genome sequences. Nature 475, 493–496
39. Rasmussen, M.D. et al. (2014) Genome-wide inference of ancestral recombination graphs. PLoS Genet. 10, e1004342
15. Lynch, M. et al. (2006) Mutation pressure and the evolution of
organelle genomic architecture. Science 311, 1727–1730
40. Schiffels, S. and Durbin, R. (2014) Inferring human population size
and separation history from multiple genome sequences. Nat.
Genet. 46, 919–925
16. Lu, J. et al. (2006) The accumulation of deleterious mutations in
rice genomes: a hypothesis on the cost of domestication. Trends
Genet. 22, 126–131
41. Sheehan, S. et al. (2013) Estimating variable effective population
sizes from multiple genomes: a sequentially Markov conditional
sampling distribution approach. Genetics 194, 647–662
17. Renaut, S. and Rieseberg, L.H. (2015) The accumulation of deleterious mutations as a consequence of domestication and
improvement in sunflowers and other Compositae crops. Mol.
Biol. Evol. 32, 2273–2283
42. Conrad, D.F. et al. (2011) Variation in genome-wide mutation rates
within and between human families. Nat. Genet. 43, 712–714
18. Marsden, C.D. et al. (2016) Bottlenecks and selective sweeps
during domestication have increased deleterious genetic variation
in dogs. Proc. Natl. Acad. Sci. U.S.A. 113, 152–157
44. Gravel, S. (2014) When is selection effective? bioRxiv Published
online October 30, 2014. http://dx.doi.org/10.1101/010934
19. Schubert, M. et al. (2014) Prehistoric genomes reveal the genetic
foundation and cost of horse domestication. Proc. Natl. Acad. Sci.
U.S.A. 111, E5661–E5669
20. Laberge, A.M. et al. (2005) Population history and its impact on
medical genetics in Quebec. Clin. Genet. 68, 287–301
21. Polvi, A. et al. (2013) The Finnish Disease Heritage Database
(FinDis) update – a database for the genes mutated in the Finnish
disease heritage brought to the next-generation sequencing era.
Hum. Mutat. 34, 1458–1466
22. Casals, F. et al. (2013) Whole-exome sequencing reveals a rapid
change in the frequency of rare functional variants in a founding
population of humans. PLoS Genet. 9, e1003845
23. Lim, E.T. et al. (2014) Distribution and medical impact of loss-offunction variants in the Finnish founder population. PLoS Genet.
10, e1004494
24. Lohmueller, K.E. (2014) The distribution of deleterious genetic
variation in human populations. Curr. Opin. Genet. Dev. 29,
139–146
25. Lohmueller, K.E. et al. (2008) Proportionally more deleterious
genetic variation in European than in African populations. Nature
451, 994–995
26. Peischl, S. et al. (2013) On the accumulation of deleterious mutations during range expansions. Mol. Ecol. 22, 5972–5982
27. Arunkumar, R. et al. (2015) The evolution of selfing is accompanied
by reduced efficacy of selection and purging of deleterious mutations. Genetics 199, 817–829
28. Brandvain, Y. et al. (2014) Speciation and introgression between
Mimulus nasutus and Mimulus guttatus. PLoS Genet. 10,
e1004410
29. Hollister, J.D. et al. (2015) Recurrent loss of sex is associated with
accumulation of deleterious mutations in Oenothera. Mol. Biol.
Evol. 32, 896–905
30. Slotte, T. et al. (2013) The Capsella rubella genome and the
genomic consequences of rapid mating system evolution. Nat.
Genet. 45, 831–835
43. Kong, A. et al. (2012) Rate of de novo mutations and the importance of father's age to disease risk. Nature 488, 471–475
45. Peischl, S. and Excoffier, L. (2015) Expansion load: recessive
mutations and the role of standing genetic variation. Mol. Ecol.
24, 2084–2094
46. Kirkpatrick, M. and Jarne, P. (2000) The effects of a bottleneck
on inbreeding depression and the genetic load. Am. Nat. 155,
154–167
47. Balick, D.J. et al. (2015) Dominance of deleterious alleles controls
the response to a population bottleneck. PLoS Genet. 11,
e1005436
48. Jensen, J.D. and Bachtrog, D. (2011) Characterizing the influence
of effective population size on the rate of adaptation: Gillespie's
Darwin domain. Genome Biol. Evol. 3, 687–701
49. Song, Y.S. and Steinrucken, M. (2012) A simple method for finding
explicit analytic transition densities of diffusion processes with
general diploid selection. Genetics 190, 1117–1129
50. Agrawal, A.F. and Whitlock, M.C. (2011) Inferences about the
distribution of dominance drawn from yeast gene knockout data.
Genetics 187, 553–566
51. Henn, B.M. et al. (2015) Estimating the mutation load in human
genomes. Nat. Rev. Genet. 16, 333–343
52. Henn, B.M. et al. (2016) Distance from sub-Saharan Africa predicts mutational load in diverse human genomes. Proc. Natl. Acad.
Sci. U.S.A. 113, E440–E449
53. Sousa, V. et al. (2014) Impact of range expansions on current
human genomic diversity. Curr. Opin. Genet. Dev. 29, 22–30
54. Fu, W.Q. et al. (2013) Analysis of 6,515 exomes reveals the
recent origin of most human protein-coding variants. Nature
493, 216–220
55. Tennessen, J.A. et al. (2012) Evolution and functional impact of
rare coding variation from deep sequencing of human exomes.
Science 337, 64–69
56. Fu, W. et al. (2014) Characteristics of neutral and deleterious
protein-coding variation among individuals and populations.
Am. J. Hum. Genet. 95, 421–436
57. Chun, S. and Fay, J.C. (2009) Identification of deleterious mutations within three human genomes. Genome Res. 19, 1553–1561
Trends in Genetics, Month Year, Vol. xx, No. yy
9
TIGS 1258 No. of Pages 10
58. Davydov, E.V. et al. (2010) Identifying a high fraction of the human
genome to be under selective constraint using GERP++. PLoS
Comput. Biol. 6, e1001025
76. Gossmann, T. et al. (2014) Highly variable recombinational landscape modulates efficacy of natural selection in birds. Genome
Biol. Evol. 6, 2061–2075
59. Ng, P.C. and Henikoff, S. (2003) SIFT: predicting amino acid
changes that affect protein function. Nucleic Acids Res. 31,
3812–3814
77. Campos, J.L. et al. (2014) The relation between recombination
rate and patterns of molecular evolution and variation in Drosophila
melanogaster. Mol. Biol. Evol. 31, 1010–1028
60. Stone, E.A. and Sidow, A. (2005) Physicochemical constraint
violation by missense substitutions mediates impairment of protein
function and disease severity. Genome Res. 15, 978–986
78. Burgarella, C. et al. (2015) Molecular evolution of freshwater snails
with contrasting mating systems. Mol. Biol. Evol. 32, 2403–2416
61. Glemin, S. and Galtier, N. (2012) Genome evolution in outcrossing
versus selfing versus asexual species. Methods Mol. Biol. 855,
311–335
62. Keightley, P.D. and Eyre-Walker, A. (2007) Joint inference of the
distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies.
Genetics 177, 2251–2261
63. Boyko, A.R. et al. (2008) Assessing the evolutionary impact of
amino acid mutations in the human genome. PLoS Genet. 4,
e1000083
64. Messer, P.W. and Petrov, D.A. (2013) Frequent adaptation and the
McDonald–Kreitman test. Proc. Natl. Acad. Sci. U.S.A. 110,
8615–8620
65. Douglas, G.M. et al. (2015) Hybrid origins and the earliest
stages of diploidization in the highly successful recent polyploid
Capsella bursa-pastoris. Proc. Natl. Acad. Sci. U.S.A. 112,
2806–2811
66. Harris, K. and Nielsen, R. (2015) The genetic cost of Neanderthal
introgression. bioRxiv Published online October 30, 2015. http://
dx.doi.org/10.1101/030387
67. Juric, I. et al. (2015) The strength of selection against Neanderthal
introgression. bioRxiv Published online November 21, 2015.
http://dx.doi.org/10.1101/030148
68. Akashi, H. et al. (2012) Weak selection and protein evolution.
Genetics 192, 15–31
69. Koenig, D. et al. (2013) Comparative transcriptomics reveals patterns of selection in domesticated and wild tomato. Proc. Natl.
Acad. Sci. U.S.A. 110, E2655–E2662
70. Nabholz, B. et al. (2014) Transcriptome population genomics
reveals severe bottleneck and domestication cost in the African
rice (Oryza glaberrima). Mol. Ecol. 23, 2210–2227
71. Cao, J. et al. (2011) Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat. Genet. 43, 956–963
72. Elyashiv, E. et al. (2010) Shifts in the intensity of purifying selection:
an analysis of genome-wide polymorphism data from two closely
related yeast species. Genome Res. 20, 1558–1573
79. Brandvain, Y. et al. (2013) Genomic identification of founding
haplotypes reveals the history of the selfing species, Capsella
rubella. PLoS Genet. 9, e1003754
80. Qiu, S. et al. (2011) Reduced efficacy of natural selection on codon
usage bias in selfing Arabidopsis and Capsella species. Genome
Biol. Evol. 3, 868–880
81. Pellino, M. et al. (2013) Asexual genome evolution in the apomictic
Ranunculus auricomus complex: examining the effects of hybridization and mutation accumulation. Mol. Ecol. 22, 5908–5921
82. Mank, J.E. et al. (2010) Effective population size and the Faster-X
effect: empirical results and their interpretation. Evolution 64,
663–674
83. Edmonds, C.A. et al. (2004) Mutations arising in the wave front
of an expanding population. Proc. Natl. Acad. Sci. U.S.A. 101,
975–979
84. Excoffier, L. and Ray, N. (2008) Surfing during population expansions promotes genetic revolutions and structuration. Trends Ecol.
Evol. 23, 347–351
85. Hallatschek, O. and Nelson, D.R. (2010) Life at the front of an
expanding population. Evolution 64, 193–206
86. Klopfstein, S. et al. (2006) The fate of mutations surfing on the
wave of a range expansion. Mol. Biol. Evol. 23, 482–490
87. Lehe, R. et al. (2012) The rate of beneficial mutations surfing on the
wave of a range expansion. PLoS Comput. Biol. 8, e1002447
88. Moreau, C. et al. (2011) Deep human genealogies reveal a selective advantage to be on an expanding wave front. Science 334,
1148–1150
89. Slatkin, M. and Excoffier, L. (2012) Serial founder effects during
range expansion: a spatial analog of genetic drift. Genetics 191,
171–181
90. Travis, J.M.J. et al. (2007) Deleterious mutations can surf to high
densities on the wave front of an expanding population. Mol. Biol.
Evol. 24, 2334–2343
91. Eyre-Walker, A. and Keightley, P.D. (2007) The distribution of
fitness effects of new mutations. Nat. Rev. Genet. 8, 610–618
73. Hill, W.G. and Robertson, A. (1966) The effect of linkage on limits
to artificial selection. Genet. Res. 8, 269–294
92. Eyre-Walker, A. and Keightley, P.D. (2009) Estimating the rate of
adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. Mol. Biol. Evol. 26,
2097–2108
74. Hussin, J.G. et al. (2015) Recombination affects accumulation of
damaging and disease-associated mutations in human populations. Nat. Genet. 47, 400–404
93. Eyre-Walker, A. et al. (2006) The distribution of fitness effects of
new deleterious amino acid mutations in humans. Genetics 173,
891–900
75. Bullaughey, K. et al. (2008) No effect of recombination on
the efficacy of natural selection in primates. Genome Res. 18,
544–554
94. Kousathanas, A. and Keightley, P.D. (2013) A comparison of
models to infer the distribution of fitness effects of new mutations.
Genetics 193, 1197–1208
10
Trends in Genetics, Month Year, Vol. xx, No. yy