Download lecture 01 - sources of variation - Cal State LA

Document related concepts

Hardy–Weinberg principle wikipedia , lookup

Polymorphism (biology) wikipedia , lookup

Genetic testing wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Public health genomics wikipedia , lookup

Nutriepigenomics wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

RNA-Seq wikipedia , lookup

Mutagen wikipedia , lookup

Adaptive evolution in the human genome wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene expression programming wikipedia , lookup

Oncogenomics wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Genome evolution wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Gene wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genetic engineering wikipedia , lookup

Microsatellite wikipedia , lookup

Dominance (genetics) wikipedia , lookup

Genetic code wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Designer baby wikipedia , lookup

Genome (book) wikipedia , lookup

History of genetic engineering wikipedia , lookup

Human genetic variation wikipedia , lookup

Genetic drift wikipedia , lookup

Koinophilia wikipedia , lookup

Epistasis wikipedia , lookup

Frameshift mutation wikipedia , lookup

Mutation wikipedia , lookup

Population genetics wikipedia , lookup

Point mutation wikipedia , lookup

Microevolution wikipedia , lookup

Transcript
Biol 418: Advanced Evolution
Course website:
instructional1.calstatela.edu/pkrug
NO WWW at beginning!
click on “Bio 418” to get powerpoint files, study guides, syllabus
Pre-requisite: Evolution (Biol 350)
- undergraduate students without this pre-req will be
dropped without notice
Course Overview – BIOL 418
Co-taught by Drs. Krug and Ellingson
[email protected]
[email protected]
one midterm exam
one final exam
four in-class exercises done (a) at home, then
(b) in small groups during class, based on
reading the primary scientific literature
Evolution: genetic change over time
Things to review/remember:
- populations evolve; individuals do not
- all evolution does not produce adaptation
- much evolution results from genetic drift (random change)
- change can be mal-adaptive (bad)
- the raw material of evolution is genetic variation present
in a population
- without underlying variation, a population may not be
capable of evolving
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
1) What alleles are present (variants of a given gene)?
2) How are those alleles arranged into genotypes?
3) How do (i) allele combinations, and (ii) rearing environment
together result in the phenotype?
 The distribution of alleles, genotypes and phenotypes in
a population are then acted on by (i) genetic drift, and
(ii) various forms of selection, resulting in evolution
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
AA
Aa
aa
aa Aa AA
- this population is
polymorphic at this
genetic locus
(= has more than one allele)
allele frequencies can change
AA
AA
AA
AA
AA
AA
- this population is fixed for
the “A” allele at this locus
no evolution is possible
at this locus until new
alleles enter population
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
AA AA
AA
aa
aa
aa aa
AA
3 different populations with same allele frequencies:
A = a = 0.5
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
At HW equilibrium, genotype frequencies
can be calculated from allele frequencies
p = frequency of A allele = 8/16 = 0.5
q = frequency of a allele = 8/16 = 0.5
Hardy-Weinberg
equilibrium
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
At HW equilibrium, genotype frequencies
can be calculated from allele frequencies
p2 + 2pq + q2 = 1
(0.5)2 + 2(0.5)(0.5) + (0.5)2 = 1
Hardy-Weinberg
equilibrium
0.25 + 0.5 + 0.25 = 1
AA
Aa
aa
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
What is unusual in this population?
What could have caused it?
AA AA
AA
aa
aa
aa aa
AA
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
AA AA
AA
aa
aa
aa aa
AA
What is unusual in this population? - too many heterozygotes
What could have caused it?
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
AA AA
AA
aa
aa
aa aa
AA
What is unusual in this population? - too many heterozygotes
What could have caused it? - overdominance (selection)
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
What is unusual in this population?
What could have caused it?
AA AA
AA
aa
aa
aa aa
AA
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
AA AA
AA
aa
aa
aa aa
AA
What is unusual in this population? - too many homozygotes
What could have caused it?
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
- what alleles are present (variants of a given gene)?
- how are those alleles arranged into genotypes?
AA
Aa
Aa
aa Aa aa
Aa AA
Aa
Aa
Aa
aa Aa AA
Aa Aa
AA AA
AA
aa
aa
aa aa
AA
What is unusual in this population? - too many homozygotes
What could have caused it? - inbreeding, underdominance,
migration
Evolution: genetic change over time
The raw material of evolution is genetic variation, including:
1) What alleles are present (variants of a given gene)?
2) How are those alleles arranged into genotypes?
3) How do (i) allele combinations, and (ii) the environment,
together result in the phenotype?
- Is a trait highly heritable, or mostly due to environment?
- How many loci contribute to a given trait – 3, 300, 3000?
- Is a trait phenotypically plastic? (adjustable on the fly)
Sources of genetic variation
What are the sources of genetic variation in natural populations?
1) mutation
2) gene flow (dispersal between populations)
3) sexual reproduction
4) hybridization with related species
Sources of genetic variation 1: mutation
Mutation is the ultimate source of new genetic variation
Mutation rate = rate at which changes in the DNA actually occur
Substitution rate = rate at which changes appear and persist
long enough for us to measure them, by sequencing DNA
from multiple individuals in a population
Subtle but important distinction: Many, perhaps most, mutations
are bad for your fitness (or even lethal). These mutations tend
to get you killed, or to kill you. We won’t “see” these mutations
when we sequence DNA from a bunch of individuals, because
those really bad mutants are already dead.
Substitution rate is what we can easily measure; it is much lower
than the actual mutation rate.
Two kinds of DNA substitutions
Transition mutations are observed more frequently, because
they are not fixed as often by DNA repair enzymes
 that is, they escape correction so we see them more often
Sources of genetic variation 1: mutation
Some mutations are harder for DNA repair enzymes to detect,
so escape correction and thus occur more often
Transitions: A
G, C
T
- replace a purine w/ a purine
or pyrimidine w/ pyrimidine
This class of mutations is observed more frequently, because
they are not fixed as often by DNA repair enzymes
Transversion: everything else; more rare
Sources of genetic variation 1: mutation
Transversions are usually caught & corrected by DNA repair
enzymes, so get detected less often in sequence data
inserted
guanine
Transversions stick two
non-complimentary bases
up against each other
 disrupts DNA helix,
making bulge
 more likely to get noticed
& fixed by repair enzymes
Sources of variation 1: mutation
1) transitions are more common than transversions
2) substitutions that (i) occur in non-coding regions of DNA, or
(ii) result in synonymous amino acid substitutions, are
more likely to persist in natural populations
 because most changes to the amino acid sequence of any
protein are likely to make the protein worse, not better, they
are typically removed by selection before we can see them
 thus, the substitution rate is lower than the actual mutation
rate; many mutations disappear before we have a chance
to measure them
Sources of variation 1: mutation
1) transitions are more common than transversions
2) substitutions that (i) occur in non-coding regions of DNA, or
(ii) result in synonymous amino acid substitutions, are
more likely to persist in natural populations
i) mutations in non-coding DNA do not usually affect the
phenotype, so are termed ‘silent’
ii) synonymous substitutions change one codon to another
for the same amino acid, thus do not change the protein
- usually occur at the 3rd codon position, sometimes 1st
Sources of variation 1: mutation
Sources of variation 1: mutation
1) transitions are more common than transversions
2) substitutions that (i) occur in non-coding regions of DNA, or
(ii) result in synonymous amino acid substitutions, are
more likely to persist in natural populations
i) mutations in non-coding DNA do not usually affect the
phenotype, so are termed ‘silent’
ii) synonymous substitutions change one codon to another
for the same amino acid, thus do not change the protein
iii) non-synonymous substitutions change the amino acid at
a given position, thus changing the protein and potentially
the phenotype
Sources of variation 1: mutation
non-synonymous substitutions tend to be less commonly
observed than synonymous substitutions, but that doesn’t
mean they occur less often
 in fact, they must occur more often; why?
 if they occur more often, why are non-synonymous mutations
only rarely observed in DNA sequences?
DNA sequence alignment
E_sp19
Edio_2
Edio_4
Edio_3
Edio_1
04Pan01
04Pan04
04Pan05
04Pan07
04Pan09
04Pan10
04Pan16
06Jam01
06Jam02
06Jam09
06Jam12
06Dom02
06Dom03
06Dom04
06Dom05
06Dom10
07Gei01
07Gei02
07Gei03
07Gei05
07Gei08
07Swe01
07Swe02
07Swe04
07Swe09
07Swe10
07LSS01
07LSS02
07LSS04
10NEx01
10NEx02
10NEx03
10NEx05
TGGTCTAGTCGGAACTGGTTTAAGATTATTAATTCGATTTGAATTAGGAACTTCTGGTGCTTTCCTGGGTGATGATCATTTCTACAATGTTATTGTTAC
...CT....A..G..C..A.....G..................C....T..............TT.A..............T..T.....C........
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.T...........C........G...........C..TT.A........C..C.....T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.T...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.T..............GC....G...........C..TT.A........C.....T..T..............
....T.......T............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T....T..G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......T............C.CC.......G...........G........C.....TT.A........C..C..T..T..............
...CT.......T............C.CC................G..G...........C..TT.A........C..C.....T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
DNA sequence alignment
E_sp19
Edio_2
Edio_4
Edio_3
Edio_1
04Pan01
04Pan04
04Pan05
04Pan07
04Pan09
04Pan10
04Pan16
06Jam01
06Jam02
06Jam09
06Jam12
06Dom02
06Dom03
06Dom04
06Dom05
06Dom10
07Gei01
07Gei02
07Gei03
07Gei05
07Gei08
07Swe01
07Swe02
07Swe04
07Swe09
07Swe10
07LSS01
07LSS02
07LSS04
10NEx01
10NEx02
10NEx03
10NEx05
TGGTCTAGTCGGAACTGGTTTAAGATTATTAATTCGATTTGAATTAGGAACTTCTGGTGCTTTCCTGGGTGATGATCATTTCTACAATGTTATTGTTAC
...CT....A..G..C..A.....G..................C....T..............TT.A..............T..T.....C........
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.T...........C........G...........C..TT.A........C..C.....T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.T...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.T..............GC....G...........C..TT.A........C.....T..T..............
....T.......T............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T....T..G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......T............C.CC.......G...........G........C.....TT.A........C..C..T..T..............
...CT.......T............C.CC................G..G...........C..TT.A........C..C.....T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
1) Most observed substitutions in real data
are transitions (A G, C T)
2) changes at 3rd codon position > 1st >> 2nd
Sources of variation 1: mutation
1) transitions are more common than transversions
2) synonymous substitutions are more common than
non-synonymous changes when comparing sequences of
individuals in natural populations (for most loci)
3) non-synonymous substitutions resulting in conservative
amino acid changes are more likely to survive
- conservative = swapping one residue for another of the
same size, charge, and/or polarity
- for the same reasons as discussed previously, such
changes are less likely to drastically alter protein function,
and so are less likely to be removed promptly by selection
Sources of variation 1: mutation
3) conservative amino acid changes are more likely to survive
- swapping non-polar for another non-polar amino acid
- swapping a polar for another polar residue
DNA sequence alignment
E_sp19
Edio_2
Edio_4
Edio_3
Edio_1
04Pan01
04Pan04
04Pan05
04Pan07
04Pan09
04Pan10
04Pan16
06Jam01
06Jam02
06Jam09
06Jam12
06Dom02
06Dom03
06Dom04
06Dom05
06Dom10
07Gei01
07Gei02
07Gei03
07Gei05
07Gei08
07Swe01
07Swe02
07Swe04
07Swe09
07Swe10
07LSS01
07LSS02
07LSS04
10NEx01
10NEx02
10NEx03
10NEx05
TGGTCTAGTCGGAACTGGTTTAAGATTATTAATTCGATTTGAATTAGGAACTTCTGGTGCTTTCCTGGGTGATGATCATTTCTACAATGTTATTGTTAC
...CT....A..G..C..A.....G..................C....T..............TT.A..............T..T.....C........
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T....A..G..C..A.....G...C..............C.T..T..............TT.A..............T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C........T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.T...........C........G...........C..TT.A........C..C.....T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T....T..G............C.C....................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.T...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.T..............GC....G...........C..TT.A........C.....T..T..............
....T.......T............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......G............C.TC...................G...........C..TT.A.....C..C..C..T..T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T....T..G............C.C...............C....G...........C..TT.A........C.....T..T..............
....T.......T............C.CC.......G...........G........C.....TT.A........C..C..T..T..............
...CT.......T............C.CC................G..G...........C..TT.A........C..C.....T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......G............C.C...............C....G...........C..TT.A........C..C..T..T..............
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
....T.......T............C.TC...................G...........C..TT.A........C..C..T..T...........C..
Amino acid alignment
E_sp19
Edio_2
Edio_4
Edio_3
Edio_1
04Pan01
04Pan04
04Pan05
04Pan07
04Pan09
04Pan10
04Pan16
06Jam01
06Jam02
06Jam09
06Jam12
06Dom02
06Dom03
06Dom04
06Dom05
06Dom10
07Gei01
07Gei02
07Gei03
07Gei05
07Gei08
07Swe01
07Swe02
07Swe04
07Swe09
07Swe10
07LSS01
07LSS02
07LSS04
10NEx01
10NEx02
10NEx03
10NEx05
10NPr01
GLVGTGLSLLIRFELGTSGAFLGDDHFYNVIVTAHAFVMIFFMVMPLMIGGFGNWMVPILIGAPDMSFPRMNNMSFWLLPPSFIFLLSSSLVEGGAGTG
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
.........................................................................................N.........
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
Amino acid alignment
E_sp19
Edio_2
Edio_4
Edio_3
Edio_1
04Pan01
04Pan04
04Pan05
04Pan07
04Pan09
04Pan10
04Pan16
06Jam01
06Jam02
06Jam09
06Jam12
06Dom02
06Dom03
06Dom04
06Dom05
06Dom10
07Gei01
07Gei02
07Gei03
07Gei05
07Gei08
07Swe01
07Swe02
07Swe04
07Swe09
07Swe10
07LSS01
07LSS02
07LSS04
10NEx01
10NEx02
10NEx03
10NEx05
10NPr01
GLVGTGLSLLIRFELGTSGAFLGDDHFYNVIVTAHAFVMIFFMVMPLMIGGFGNWMVPILIGAPDMSFPRMNNMSFWLLPPSFIFLLSSSLVEGGAGTG
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
.........................................................................................N.........
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
...................................................................................................
one S  N change
serine (-OH) to
glutamine (-CONH2)
Sources of variation 1: mutation
1) transitions are more common than transversions
2) synonymous substitutions are more common than
non-synonymous changes when comparing sequences of
individuals in natural populations (for most loci)
3) conservative amino acid changes are more likely to survive
4) different positions on a protein evolve at very different rates
- surface may evolve more easily than core
- sites involved with catalysis, or contacting partner proteins
Some genes are more toleratant of substitutions (many changes
have little effect on protein structure/function); other genes
show fewer substitutions (amino acid change is usually harmful)
Mutation rates also vary among chromosomes, and between
chromosome regions, creating “local rates” of mutation
- variation in chromatin structure affects local rates
Prendergast et al. 2007
- reporter genes inserted in different places throughout the
genome in human cell lines had 60-fold difference in
mutation rate, depending on where it was inserted
Lichtenauer-Kaligis et al. 1993
Whitkopp & Kilay 2011,
Nat. Rev. Gen.
mutation rate
(per DNA site per generation)
10-9 low
ribosomal
rRNA
conservative
replicators
high 10-8
genes
individuals
immune
receptors
mutator
strains
species
C. elegans
C. briggsae
[cited rates are for animals; for bacteria, rates are 10x higher]
Mutation rates are higher for DNA passed through males than
through females, as predicted by Haldane in 1947
- more cycles of cell division are involved in spermatogenesis,
allowing for more errors in DNA replication per generation
- generally confirmed by genetic studies:
- ‘male-driven evolution’ or male-biased substitution rates
are ~5-fold higher for male primates
Makova & Li 2002, Nature
- not a function of the Y-chromosome; male mutation rate
~5x higher than female’s rate in birds, yet female birds
are the heterogametic sex (WZ, vs. WW for male birds)
Ellegren & Fridolfsson 1997, Nature Genet.
In 2011, the 1st whole-genome sequencing comparison of two
human familes (two parents, one child) found:
- 49 and 35 germline de novo mutations (in kid, not parents)
“germline” = in sperm/eggs; will be passed to offspring
- 1,586 non-germline mutations (somatic origin; non-heritable)
- in one family, 92% of germline mutations were from paternal
germline
- in 2nd family, 64% of germline mutations were from maternal
germline
Conrad et al. 2011, Nat. Gen.
 high variation in mutation rates between human families
human genome size: 3 x 109 per gamete
Mutation as a cost of sexuality
Given all these constraints on evolution, why do proteins ever
evolve at all..?
Mutation is generally understood as an evolutionary
consequence of costs of DNA proof-reading and repair,
especially for sexually-reproducing organisms
- meiosis allows sex, which creates new allele combinations,
but appears to be very hard to achieve without a certain
level of error (mistakes in copying DNA = mutation)
- we could mutate less, but at too high of a cost to be
favored by selection
Mutation via hitch-hiking in asexuals
In asexual or clonally reproducing organisms (e.g., bacteria)
mutator alleles can become linked to rare, beneficial mutations
that they create, and thus “hitch-hike” to high frequency
DNA proofreading gene
wt = wild-type
(normal)
mutation creates a sloppy allele
of proof-reading enzyme..“mutator”
wt wt
M wt
other genes
affecting fitness
wt wt wt
1) “Mutator” allele arises randomly; creates a strain that
mutates at a higher frequency
Mutation via hitch-hiking in asexuals
1) “Mutator” allele arises randomly, creates a high-mutation strain
2) Most mutations will lower fitness, and be lost...
wt wt
M wt
M wt
M wt
wt wt wt
wt wt
wt = wild-type
(normal)
= deleterious
(bad) mutation
wt
M
wt wt
wt
Mutation via hitch-hiking in asexuals
1) “Mutator” allele arises randomly, creates a high-mutation strain
2) Most mutations will lower fitness, and be lost...
wt wt
M wt
M wt
M wt
wt wt wt
wt wt
wt
M
wt wt
wt
Mutation via hitch-hiking in asexuals
1) “Mutator” allele arises randomly, creates a high-mutation strain
2) Most mutations will lower fitness, and be lost...
wt wt
M wt
wt wt
wt wt wt
wt wt wt
wt wt
wt wt
wt wt
wt wt wt
wt wt wt
wt wt wt
wt wt
wt wt wt
wt wt wt
wild-type would appear to win,
and out-reproduce Mutator strain
Mutation via hitch-hiking in asexuals
1) “Mutator” allele arises randomly, creates a high-mutation strain
2) Most mutations will lower fitness, and be lost...
wt wt
new allele that
doubles fitness
M wt
wt wt wt
M
3) However, beneficial mutations
(normally rare) will occur more
wt wt wt
often by chance in Mutator strains..
4) Now, Mutator allele of DNA proof-reading enzyme is linked
to the beneficial allele it created through ‘sloppy work’
Mutation via hitch-hiking in asexuals
1) “Mutator” allele arises randomly, creates a high-mutation strain
2) Most mutations will lower fitness, and be lost
3) However, beneficial mutations (normally rare) will occur more
often by chance in Mutator strains
4) Now, Mutator allele of DNA proof-reading enzyme is linked
to the beneficial allele it created through ‘sloppy work’
5) Wildtype proof-readers get out-competed due to lower fitness
M
M
M
M
wt wt
wt wt wt
wt wt
wt wt wt
M
wt wt wt
M
wt wt wt
wt wt wt M
wt wt wt
wt wt wt
Mutation via hitch-hiking in asexuals
1) “Mutator” allele arises randomly, creates a high-mutation strain
2) Most mutations will lower fitness, and be lost
3) However, beneficial mutations (normally rare) will occur more
often by chance in Mutator strains
4) Now, Mutator allele of DNA proof-reading enzyme is linked
to the beneficial allele it created through ‘sloppy work’
5) Wildtype proof-readers get out-competed due to lower fitness
6) This phenomenon – “genetic hitch-hiking” – occurs easily in
clonal species because recombination (sex) rarely separates
the mutator alleles, and the beneficial alleles
with which they become associated
M
(by chance) on the chromosome
wt wt wt
For most of the last century, it was assumed that differences
between species were due to different, species-specific
proteins that each had evolved
 since the genomic revolution, it has become clear that
instead, many evolutionarily important mutations affect
not the protein product itself, but gene expression
- how much is a gene expressed?
- in what tissues?
- when during development?
Whitkopp & Kilay 2011,
Nat. Rev. Gen.
Mutations affecting any of these can produce dramatic
changes in phenotype, behavior, and life history
Such mutational changes affect enhancers and promoters,
non-coding DNA that controls gene expression
A given gene may be under the control of multiple enhancers,
each one switching that gene on in a subset of tissues, or
at different times in development
- mutations in an enhancer can alter the timing or expression
pattern of a gene in one tissue type without affecting the
expression at other times or in other tissues, which will be
controlled by other enhancers
- this restricts the effects of a mutation to one tissue or stage,
whereas a change in the protein itself will affect all cells
all else being equal, a mutation in an enhancer is less
likely to kill you than a mutation in the gene that enhancer
controls (modularity)
promoters and enhancers regulate gene expression (a) in a
tissue-specific manner, that (b) can vary during development
A given gene may be under the control of multiple enhancers,
each one switching that gene on in a subset of tissues, or
at different times in development
early development, leg buds
mid-development, antennae
late development, wing buds
late development, tail
four enhancers direct tissue-specific
and timing-dependent expression of
a key gene (turn on gene where colored)
Consider a gene that triggers cell growth early in development
of body appendages; 4 different tissue-specific enhancers
turn this gene on at different times, in different places
A mutation in the protein ITSELF, causing let’s say, too much
cell growth... causes too much cell growth EVERYWHERE
Happy normal bug
Horrible freak monster; dies
A mutation in one enhancer can cause substantial changes in
phenotype without disrupting the rest of normal development
Happy normal bug
Few extra legs (possibly beneficial);
otherwise normal
 “modularity” means restricted change
 these kinds of cis-regulatory mutations are now known to
explain the bulk of phenotypic differences between species
Mutations in the enhancer HACNS1 led to a novel activity in
human evolution
 this enhancer now activates genes in the developing limbs
of humans, but does not have this activity in our closest
relatives
Prabhakar et al.
2008,Science
Loss of enhancers can lead to significant evolutionary change
- 509 regions containing possible enhancers are highly
conserved between chimps and other mammals, but
are missing from humans
- these deletions are associated with some loss of function
(sensory organs, penile spines) and gain of other function
(expanded brain capacity)
McLean et al. 2011, Nature
Transposons (“jumping genes”) can move enhancers into
novel locations, creating rapid evolutionary change
All these kinds of non-traditional mutations can produce
dramatic changes in phenotype, behavior, and life history
Another evolutionarily important type of mutation involves sites
that regulate alternative splicing of proteins
Barbosa-Morais et al. 2012, Science
exon 1
exon 2
intron
mRNA splicing
exon 3
intron
unspliced
mRNA
(removes intron sequences)
spliced
mRNA
encodes full-length
version of protein
goes to ribosome
Another evolutionarily important type of mutation involves sites
that regulate alternative splicing of proteins
Barbosa-Morais et al. 2012, Science
exon 1
exon 2
intron
exon 3
intron
unspliced
mRNA
mutations can occur at the intron-exon
border that cause a different pattern of
splicing in some tissues, leaving that
exon out of the final mRNA message
Another evolutionarily important type of mutation involves sites
that regulate alternative splicing of proteins
Barbosa-Morais et al. 2012, Science
exon 1
exon 2
intron
exon 3
intron
unspliced
mRNA
alternative
splicing
encodes full-length
version of protein
encodes shorter
version of protein with
modified function
Another evolutionarily important type of mutation involves sites
that regulate alternative splicing of proteins
- primates have more
alternative splicing
sites (AS) in their
genes than other
vertebrates
- frequency of AS
increased sharply
during the evolution
of humans in brain
and testes tissues,
compared with other
primates
Barbosa-Morais et al. 2012, Science