Download Population Genetics 2: Linkage disequilibrium Consider two loci and

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nutriepigenomics wikipedia , lookup

Gene desert wikipedia , lookup

RNA-Seq wikipedia , lookup

Inbreeding avoidance wikipedia , lookup

Ridge (biology) wikipedia , lookup

Tag SNP wikipedia , lookup

History of genetic engineering wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Genome evolution wikipedia , lookup

Polyploid wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Koinophilia wikipedia , lookup

Gene wikipedia , lookup

Genomic imprinting wikipedia , lookup

Gene expression profiling wikipedia , lookup

Human genetic variation wikipedia , lookup

X-inactivation wikipedia , lookup

Polymorphism (biology) wikipedia , lookup

Dominance (genetics) wikipedia , lookup

Designer baby wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genome (book) wikipedia , lookup

Genetic drift wikipedia , lookup

Gene expression programming wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Population genetics wikipedia , lookup

Microevolution wikipedia , lookup

Hardy–Weinberg principle wikipedia , lookup

Transcript
Population Genetics 2:
Linkage disequilibrium
Consider two loci and 1 generation of random mating:
A gene: AA, Aa, and aa
B gene: BB, Bb, and bb
Genotype frequencies in a population
A gene
B gene
2
2
fAA = p
fAa = 2pq
2
faa = q
fBB = x
fBb = 2xy
2
fbb = y
p+q=1
x+y=1
Random association of alleles at a single locus: HWE
What about random association of alleles at different loci after
random mating?
1
Consider two loci and 1 generation of random mating:
Random association in gametes
Alleles at B locus
Alleles at A locus
A(p)
a(q)
B
(x)
AB
(px)
aB
(qx)
b
(y)
Ab
(py)
ab
(qy)
remember: p + q =1 and x + y = 1
Random association of alleles at a different locus:
LINKAGE EQUILIBRIUM
GAMETIC PHASE EQUILIBRIUM
Consider two loci and 1 generation of random mating:
Surprisingly common result:
Gene A: HWE
Gene B: HWE
Gene A + Gene B: disequilibrium
2
Consider two loci (on different chromosomes) and 1 generation of
random mating:
Example:
Population 1: 100% AABB
Population 2: 100% aabb
Mix populations equally:
50% AABB + 50% aabb
1 generation of random mating (only three matings possible) :
AABB x AABB = AABB
aabb x aabb = aabb
AABB x aabb = AaBb
Nine genotypes are possible:
AaBB
aaBB
aaBb
AABb
AAbb
Aabb
AABB
aabb
AaBb
We only see 1/3 after 1
generation of random mating!
They did not reach equilibrium after one generation of random mating.
With continued random mating the “missing” genotypes would appear, but not
immediately at their equilibrium frequencies!
Consider two loci and 1 generation of random mating:
- attainment of linkage equilibrium is gradual
- about 50% of disequilibrium “breaks down” per generation
- linkage disequilibrium (LD) persists in populations for many generations
- LD = gametic phase disequilibrium
3
LD in individuals (BIOL 2030 stuff):
We need a new symbolism
Case 1: AB gamete + ab gamete = AaBb
Case 2: Ab gamete + aB gamete = AaBb
New symbolism:
AB/ab
indicates the union of AB gamete + ab gamete
LD in individuals:
Let’s take an AB/ab individual as an
example:
Physical linkage:
A B
What types of gametes can the AB/ab make?
a b
By the way, lets assume physical linkage.
(1)  AB
(2)  ab
(3) Ab
(4) aB
Notation = AB/ab
Parental or non-recombinant gametes
Non-parental or recombinant gametes
4
Review meiosis in a single meiocyte:
Gametes
Notation:
Parental (non-recombinant): A B
Parental
configuration
Recombinant: a B
Recombinant: A b
Parental (non-recombinant): a b
1 crossing over event: 50% parental and 50% recombinant !!!
LD in individuals:
When genes are on the same chromosome:
Physical linkage:
A B
fAB = fab ≥ fAb = faB
a b
f (non-recombinant) ≥ f (recombinant)
Notation = AB/ab
Recombination fraction (r) is the proportion of recombinant gametes
produced by an individual.
When r = 0: fAB + fab = 100%
[fAb + faB = 0%]
When r = 0.5: fAB + fab = 50% [fAb + faB = 50%]
5
From:
iGenetics
P. J. Russell (2002)
page 350
LD in individuals:
When genes are on different chromosomes:
Un linked genes:
A
fAB = fab = fAb = faB
f (non-recombinant) = f (recombinant)
a
B
b
r = 0.5
•  when genes are on different chromosomes
•  when genes are on same chromosome and recombination is high
enough for independent assortment
6
LD in individuals:
Individual AB/ab produces the following:
(1) AB: fAB = 0.38
(2) ab: fab = 0.38
(3) Ab: fAb = 0.12
(4) aB: faB = 0.12
r = 0.12 + 0.12 = 0.24
What do we expect for individuals if allelic association is random?
What do we expect in a population if allelic association is random?
LD in populations:
Random association in gametes
Alleles at B locus
Alleles at A locus
A(p)
a(q)
B
(x)
AB
(px)
aB
(qx)
b
(y)
Ab
(py)
ab
(qy)
remember: p + q =1 and x + y = 1
fAB =px
fab = qy
fAb = py
faB =qx
fAB + fab + fAb + faB = 1
7
LD in populations:
The frequency of an AB gamete in a population has two sources:
A B
AB
•  Some individuals have this configuration: non-recombinant [parental]
•  Some individuals produce this as a recombinant configuration.
LD in populations:
f AB
'
- recombinants
recombinants
non

 

= (1 − r )
f AB
+ (
r)
px



frequency of AB
probability of
no recombination gametes in last
generation
prob of prob of putting
recomb together A and B
at random from
recombinants
8
'
f AB =
- recombinants
recombinants
non

 

(1 − r )
f AB
+ (
r)
px



frequency of AB
probability of
no recombination gametes in last
generation
Obs. – exp.
prob of prob of putting
recomb together A and B
at random from
recombinants
'
f AB − px = (1 − r )( f AB − px )
D = (1 − r )( f AB − px) = the linkage disequilibrium parameter
fAB = px + D
In excess due to LD
fab = qy + D
fAb = py - D
faB =qx - D
Deficient due to LD
Remember:
Individual AB/ab produces the following:
(1) AB: fAB = 0.38
(2) ab: fab = 0.38
(3) Ab: fAb = 0.12
(4) aB: faB = 0.12
r = 0.12 + 0.12 = 0.24
What do we expect for an individuals if association is random?
What do we expect in a population if association is random?
9
Forces that increase D in populations:
1. Migration
2. Natural selection
3. Genetic Drift
LD in populations:
•  Comparing D among populations is difficult.
•  Standardize D as a fraction of the theoretical maximum for the popn
D
Dmax
D = f AB × f ab − f Ab × f aB






excess due to LD
deficient due to LD
Dmax = qx or py (whichever is smaller)
10
LD in a population:
Example: blood group polymorphism in a sample of 1000 British people
MN blood group
Gamete frequencies
expected
fM = p = 0.5425
MS = 474/2000 = 0.2370 (+)
px = 0.1671
fn = q = 0.4575
Ms = 611/2000 = 0.3055 (-)
py = 0.3751
nS = 142/2000 = 0.0710 (-)
qx = 0.1409
ns = 773/2000 = 0.3865 (+)
qy = 0.3166
Ss blood group
D = ef
×
ef
× ef
MS
ns − ef
Ms
nS




non recombinant
recombinant
Dmax = qx or py
€
fS = x = 0.3080
fs = y = 0.6920
D = (0.2370)(0.3865) – (0.3055)(0.0710) = 0.07
Dmax: qx = 0.14 or py = 0.37; so Dmax = 0.14
D is (0.07/0.14)*100 = 50% of the theoretical maximum
Homework:
Genotype counts in the population
Gametes
MN locus
Ss locus
MS = 474
MM = 298
MN = 489
NN = 213
SS = 483
Ss = 418
ss = 99
NS = 142
Ms = 611
Ns = 773
Use chi-square test to:
1. 
determine if each locus is in HWE
2. 
determine if gamete frequencies are in equilibrium
11
Recombination reduces LD:
Rate of decay of LD under various recombination rates
Standardized disequilibrium
D/Dmax
1
0.9
r= 0.001
0.8
0.7
0.6
r= 0.01
0.5
0.4
0.3
0.2
r= 0.1
0.1
r= 0.5
0
1
9
17
25
33
41
49
57
65
73
81
89
97
generations
Recombination reduces LD:
“Hitchhiking” of a mutator gene with and without recombination
No recombination
r=0
Recombination
r = 0.5
Mutator allele that increase the mutation rate
Beneficial allele subject to strong positive selection
Adapted from Sniegowski et al. (2000) BioEssays 22:1057-1066.
12
Mapping disease genes:
1.  Family studies
• 
Uses family pedigrees
• 
Co-segregation of disease and a marker on the pedigree
2.  Allelic association studies (LD mapping)
• 
Uses population data
• 
Relies on strong LD only among closely linked loci
• 
Sample affected and unaffected individuals
• 
Very large samples are required!
• 
Look for markers with more LD in affected individuals
Both approached have powers and pitfalls
Mapping disease genes:
r = 0.01 (1 centiMorgan) is about 1million bp in humans
13
female-limited Batesian mimicry
5-locus linkage group
(supergene):
§  Tail length
§  Hind-wing pattern
§  Forewing pattern
§  Epaulet color
§  Body color
Selection:
Bitter tasting
Only certain complex color
morphs provide gains in
fitness
Tasty mimics
(Papilio memnon
females)
LD:
Few maladaptive patterns
produced per generation due
to linkage
male
Class I
Class III
Class II
The MHC locus on human chromosome 6
LD over 10s-100s million of bp due
to selection for combinations of loci
14
Sexual reproduction reduces LD
Linage disequilibrium:
Keynotes
•
Attainment of equilibrium at different loci is gradual; > 1 generation of random mating.
•
Physical linkage slows the rate to equilibrium even more!
•
“r” determines the rate to equilibrium, the lower the fraction, the longer to equilibrium.
•
When r = 0.5 the loci are said to be un-linked; such loci are very far apart on the same chromosome, or
in different chromosomes. When r < 0.5 the genes are said to be linked. When r =0 the loci are in
permanent disequilibrium.
•
Disequilibrium can arise from sources other than linkage:
o
o
o
o
•
Admixture of populations
Natural selection acting on one or more of the loci
Inbreeding in plants that regularly undergo self-fertilization
Genes located in a chromosomal inversion (SUPERGENE)
The term LINKAGE DISEQUILIBRIUM is used to describe any source of disequilibrium, regardless of whether
the two genes are physically linked or not.
15