* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Chapter 12. Population Genetics
Neuronal ceroid lipofuscinosis wikipedia , lookup
Inbreeding avoidance wikipedia , lookup
Genetics and archaeogenetics of South Asia wikipedia , lookup
Fetal origins hypothesis wikipedia , lookup
Quantitative trait locus wikipedia , lookup
Public health genomics wikipedia , lookup
Polymorphism (biology) wikipedia , lookup
Genome-wide association study wikipedia , lookup
Population genetics wikipedia , lookup
Microevolution wikipedia , lookup
Genetic drift wikipedia , lookup
Chapter 8. Population Genetics Population : group of organisms of the same species living in the same geographical area. Population genetics is the study of the distribution of genes in populations and of the factors that maintain or change the frequency of genes and genotypes from generation to generation. Medical population genetics OUTLINE 1. Hardy-Weinberg Equilibrium 2. Factors affect H-W Equilibrium 1.Hardy-Weinberg Equilibrium important concepts Allele frequency: the proportion of a specific allele at a given locus, considering that the population may contain from one to many alleles at that locus. Genotype frequency: the proportion of a specific genotype at a given locus, considering that many different genotypes may be possible. Phenotype frequency: the proportion of individuals in a population that exhibit a given phenotype. 1. Hardy-Weinberg Equilibrium -allele frequency 1. two alleles at a gene - A and a 2.p =frequency of the A allele 3.q =frequency of the a allele 4.p + q = 1 A a p=fA= A A+a q=fa= a A+a More than two alleles at a gene 1. 2. 3. 4. let p = the frequency of the A allele let q = the frequency of the B allele, let r = the frequency of the i allele. such that p + q + r = 1 IA p=fI = A IA IA + IB + i q=fI = B IB IA + IB + i r=fi= IB i IA + IB + i i 1. Hardy-Weinberg Equilibrium -genotype frequency genotype A a genotype frequency AA Aa aa AA Aa aa AA+Aa+aa AA+Aa+aa AA+Aa+aa Let fAA=D, fAa=H, faa=R, such that D+H+R=1。 Allele A frequency p= Allele a frequency q= 2D+H 2D+2H+2R 2R+H 2D+2H+2R = 2D+H 2(D+H+R) = R+½H = D+½H In a population of 747 individuals,there were 233 type M individuals (31.2%), 129 type N individuals (17.3%), 485 type MN individuals (51.5%), calculate the allele frequencies. Let fLM=p, fLN=q p=MM+½MN=0.312+½×0.515=0.57 q=NN+½MN=0.173+½×0.515=0.43=1-p When the phenotype is equal to the genotype (esp. Codominant inheritance), the calculation of gene frequency is based on the gene counting method. In a population, 50% people were AA, 20%people were Aa, 30%people were aa, calculate the frequencies of alleles . Let fA=p, fa=q p=fAA+½ fAa=0.5+½ ×0.2=0.6 q=faa+½ fAa=0.3 +½ ×0.2=0.4=1-p=1-0.6 1. Hardy-Weinberg Equilibrium Hardy-Weinberg equilibrium: Frequencies of alleles & genotypes in an interbreeding population remain constant from generation to generation. To be in equilibrium a population must match 5 main conditions: Very large population size -to insure no sampling error from one generation to the next No migration No new mutations -or mutational equilibrium, even new mutations have little effect on allele frequencies from one generation to the next Random mating -no assortive mating or mating by genotype No natural selection -all genotypes reproduce with equal success The H-W conditions also imply that when the gene frequencies are p and q, the genotype frequencies will be p2, 2pq and q2 respectively for the dominant, the heterozygotes and the recessive in a two allele system. A(p) a(q) A(p) AA(p2) Aa(pq) Aa(pq) aa(q2) a(q) fAA=p2, fAa=2pq , faa=q2, so p2+2pq+q2=1. In a population there are 3 genotypes (AA, Aa, aa) and 6 mating types. AA AA Aa Aa aa aa Frequency of Mating types (random mating) Paternal genotypes Maternal genotypes AA(p2) Aa(2pq) aa(q2) AA (p2) AA X AA (p4) AA X Aa (2p3q) AA X aa (p2q2) Aa (2pq) Aa X AA (2p3q) Aa X Aa (4p2q2) Aa X aa (2pq3) aa (q2) aa X AA (p2q2) aa X Aa (2pq3) aa X aa (q4) offspring's genotypes frequency AA Aa aa p4 0 0 AA×AA p2•p2=p4 AA×Aa 2(p2• 2pq)=4p3q 2p3q 2p3q 0 AA×aa Aa×Aa 2(p2•q2)=2p2q2 0 2p2q2 0 2pq • 2pq=4p2q2 p2q2 2p2q2 p2q2 Aa×aa 2(2pq• q2)=4pq3 0 2pq3 2pq3 aa×aa q2•q2=q4 0 0 q4 (p+q)4=1 q2(p2+2pq+q2) p2(p2+2pq+q2) =q2 =p2 2pq(p2+2pq+q2) =2pq In a population study, 100 individuals were genotyped for A locus. There were 60 AA individuals, 20 Aa individuals, and 20 aa individuals. Is this population in Hardy-Weinberg equilibrium? (1) fAA=0.6, fAa=0.2 , faa=0.2 allele A frequency: p=0.6+½ ×0.2=0.7, allele a frequency: q=0.2+½ ×0.2=0.3 (2) If this population were at Hardy-Weinberg equilibrium, one would predict that fAA=0.72=0.49, fAa=2×0.7×0.3=0.42, faa=0.32=0.09. It is quite different than the observation. Thus the population is not at Hardy-Weinberg equilibrium. offspring’s genotype Mating types AA AA×AA 0.62 AA×Aa 2(0.6×0.2) AA×aa 2(0.6×0.2) Aa×Aa 0.2×0.2 Aa×aa 2(0.2×0.2) aa×aa 0.22 0.36 0.12 1 0.49 0.01 Aa aa 0.12 0.24 0.02 0.01 0.04 0.04 0.04 0.42 0.09 After one generation with random mating, the population is at Hardy-Weinberg equilibrium . 1. Hardy-Weinberg Equilibrium - application Application of Hardy-Weinberg Equilibrium Determine allele frequency in a population Determine inheritance pattern Determine allele frequency - Autosomal recessive diseases Genotype AA Genotype frequency p2 Phenotype phenotype frequency Aa aa 2pq q2 Unaffected Fu affected Fa Disease allele frequency(q)=(Fa)1/2 e.g. disease frequency (Fa)=1/10000 disease allele frequency=1/100 carrier frequency=2pq=2X0.01X0.99=0.0198 Determine allele frequency - Codominant alleles MN blood group: In a population, there were 233 type M individuals, 485 type MN individuals, and 129 type N individuals. IM allele frequency= 233X2+485 =0.57 2X(233+485+129) IN allele frequency= 129X2+485 =0.43 2X(233+485+129) Determine allele frequency - Multiple alleles ABO blood group: type A=41.72%; type B=8.56%; type O=46.68%; type AB=3.04% IA allele frequency= p; IB allele frequency= q i allele frequency= r Phenotype Genotype Genotype frequency Type A IAIA, IAi Type B IBIB, IBi Type AB IAIB Type O ii p2+2pr q2+2qr 2pq r2 r=(O)1/2=(0.4668)1/2=0.683 A+O= p2+2pr+r2=(p+r)2=(1-q)2 q=1-(A+O)1/2 =1-(0.4172+0.4668)1/2=0.06 p=1-(B+O)1/2 =1-(0.0856+0.4668)1/2=0.257 Determine allele frequency - X-linked recessive disease Phenotype affected Unaffected Genotype XAXA Genotype frequency p2 XAXa 2pq XAY X a Xa Xa Y p q2 q Disease allele frequency (q)= disease frequency in male or Disease allele frequency (q)= (disease frequency in female)1/2 Example - Color blindness: disease frequency is 7% in male , 0.49% in female Disease allele frequency = 7% For a rare AD disease, almost all the affected individuals are heterozygotes (Aa genotype). 2pq ≈1 p2+2pq For a rare AR disease, the genotype frequency of Aa is : 2pq=2q(1-q)=2q-2q2 ≈ 2q。 For a rare AR disease, the ratio of carriers to affected individuals is: 2pq 2 2q ≈ = 2 q2 q q For a rare XD disease, the ratio of affected males to affected females is: p 1 = = 2 p +2pq p +2q For 1 1 1 ≈ = p +2(1- p) 2- p 2 a rare XR disease, the ratio of affected males to q 1 affected females is: = q2 q Determine inheritance pattern Example - high myopia population study Disease frequency: 0.724% In the families with one affected parent, there were 104 offspring, of which 8 affected, frequency was 0.769%; In the families with unaffected parents, there were 1637 offspring, of which 10 affected, frequency was 0.61%. Based on the these results, please determine inheritance pattern of high myopia in the population. Determine inheritance pattern Suppose high myopia in the population was inherited in AR Disease allele frequency (q) = (0.724%)1/2=0.085 For the families with one affected parent Mating type frequency AA X aa 2p2q2 Aa X aa 4pq3 Offspring AA Aa 2p2q2 2pq3 aa 2pq3 Expected disease frequency in these families =2pq3/(2p2q2+4pq3)=q/(p+2q)=q/(1+q)=0.085/(1+0.085)=7.83% Observed disease frequency in these families was 7.69%, which is close to 7.83%. Determine inheritance pattern For the families with unaffected parent Mating type frequency AA X AA p4 AA X Aa 4p3q Aa X Aa 4p2q2 Offspring AA Aa p4 2p3q p2q2 aa 2p3q 2p2q2 Expected disease frequency in these families =p2q2/(p4+4p3q+4p2q2)=q2/(p2+4pq+4q2)=q2/(1+q)2 =0.0852/(1+0.085)2=0.647% 0.647% (expected) Vs. 0.61% (observed) p2q2 OUTLINE 1. Hardy-Weinberg Equilibrium 2. Factors affect H-W Equilibrium Factors affect H-W Equilibrium -Mutation Generation I A A A a a p=0.6; q=0.4 II A A a a a p=0.4; q=0.6 Factors affect H-W Equilibrium -Mutation A u v a fA=p A→a: ∆q=pu=(1-q)u a→A: ∆p=qv pu=(1-q)u > qv pu=(1-q)u < qv (1-q)u = qv fa ↑ fA ↑ q=u/(u+v), p=v/(u+v) fa=q Factors affect H-W Equilibrium -Selection Selection represents the action of environmental factors on a particular phenotype, and hence its genotype Selection may be positive or negative Selection is the consequence of differences of biological fitness (f). Fitness (f) : the probability of transmitting one’s genes to the next generation as compared with the average probability for the population. is a measure of fertility certain genotypes produce more offspring leads to adaptation selection coefficient S =1-f Selection for/against dominant alleles is efficient Mutant allele encoding dominant traits are expressed in heterozygotes and thus exposed to direct selection Aa aa A very weak selective change can rapidly alter the allele frequency H=2pq≈2p v=Sp=S×½H Selection for/against recessive alleles is inefficient AA Aa aa Why? - because then most recessive alleles are in heterozygotes Thus, rare disease-causing recessive alleles persist in the population in heterozygote carriers, even if they are lethal when homozygous u=Sq2 Selection against /for X-linked recessive alleles XAXA XAXa p2 2pq XaXa XAY XaY p q q2 fXa=⅓fM+⅔fF =⅓q+⅔(q2 +1/2 2pq) = q+2pq+2q2 3 =q Affected males: XaY; affected female: XaXa u=S*⅓q Efficiency: AD>XR>AR Selection against /for X-linked dominant alleles XAXA XAXa p2 2pq XaXa XAY q2 fXA=⅓fM+⅔fF=⅓p+⅔(p2 +1/2 2pq) = XaY q p p+2pq+2p2 3 =p Affected males: XAY; affected female: XAXa, XAXA v=S*(⅓p+⅔(p2 +1/2 2pq) )=S(⅓p+ ⅔p)=Sp Mutation affects genetic population AD: Aa 0 AR: aa ?? XR: XaY 1/3 XD: XAXa, XAY 0 Efficiency: AD or XD>XR>AR Factors affect H-W Equilibrium -Genetic drift Random fluctuation of gene frequencies in small population Factors affect H-W Equilibrium -Founder effect Founder effect A high frequency of a mutant gene in a population founded by a small ancestral group when one or more of the founders was o carrier of the mutant gene. Probably accounts for high frequency of inherited disorders in some human population Factors affect H-W Equilibrium -migration Generation I II A A A a a A A A a a A a a p=0.6; q=0.4 p=0.5; q=0.5 Gene flow: gradual diffusion of genes from one population to another across a barrier. The barrier may be physical or cultural and may be breached by migration or mixing Factors affect H-W Equilibrium -Nonrandom mating In human population, mating is seldom random Defined by racial, ethnic, religious, or other criteria Consanguinity- mating among close relatives, a special form of nonrandom mating in human population Consanguinity does increase the proportion of homozygotes in the next generation, thereby exposing disadvantageous recessive phenotypes to selection. Such selection may in turn alter allele frequencies in subsequent generations. 堂兄妹 姑表 姨表 舅表 二级表兄妹 Factors affect H-W Equilibrium -Nonrandom mating 1. Coefficient of relationship (r) : expresses the fraction of genes shared by two individuals 1st degree (parent, sib, child) 1/2 2nd degree (uncle, niece, grandchild) 1/4 3rd degree (first cousin, great-grand) 1/8 12.2 Factors affect H-W Equilibrium -Nonrandom mating 1/2 1/2 1/4 1/2 1/4 1/8 Factors affect H-W Equilibrium -Nonrandom mating 2. Inbreeding Coefficient (F) : The probability of identical homozygosity due to common ancestor Coefficient of Inbreeding (F) can be calculated by the formula n F= Σ(1/2) Where n = the number of steps between the proband and the common ancestor and Σ says sum it for the number of “alleles” that can become homozygous by descent First cousin P1 A1A2 P2 A1: A3A4 P1 ½ ½ F1 F2 G1 A3A3 S A1A1 A4A4 A2A2 G2 F1 ½ F2 ½ G1 ½ G2 ½ A1A1: (½)6=1/64 A2A2: (½)6=1/64 A3A3: (½)6=1/64 A4A4: (½)6=1/64 S F=4 X 1/64=1/16 A1A2 Second cousin P1 1/2 A1 A1:(½ )8=1/128 A2 A2:(½ )8=1/128 A3 A3:(½ )8=1/128 A4 A4:(½ )8=1/128 F=4×(½ )8=1/64 For autosome loci:F=r/2 B1 1/2 C1 1/2 A3A4 P2 1/2 B2 1/2 C2 1/2 D1 1/2 S D2 1/2 A 1A 1 A 2A 2 A 3A 3 A4A4 X-linked X1: P1 P2 X1 X2X3 F1 F2 G1 X3X3 P1 1 G2 S X1X1 X2X2 1 F1 ½ F2 ½ G1 1 G2 ½ S X2: P2 ½ ½ F1 ½ F2 ½ X1X1: (½)3=1/8 X2X2: (½)5=1/32 X3X3: (½)5=1/32 G1 1 G2 ½ S F=3/16 X-linked X1Y P1 X2X3 0 P2 1/2 1 1/2 B1 B2 1 1/2 C1 C2 1 1/2 S F=2×1/16=1/8 X2X2 X3X3 (½)4=1/16 (½)4=1/16 P1 P2 X1 X2X3 F1 F2 F=0 G1 X3X3 G2 S X1X1 X2X2 X1Y P1 X2X3 0 P2 1/2 1/2 0 B1 B2 1 0 C1 C2 0 S 1/2 F=0 例:已知Ⅱ1为AR患者,该病的PF=1/10000,求Ⅱ2与 其姨表兄弟结婚后代的再发风险? Ⅱ2与群体中一 个无关个体婚配,后代的再发风险又是多少? Aa Aa aa Aa 2/3 Aa 1/2 Aa 1/4 ? 1)如果Ⅱ2与其姨表兄弟婚配,子代发病风险为 2 1 1 1 = × × 3 4 4 24 2)如果Ⅱ2与群体中无亲缘关系的人婚配,子代发 病风险为: 由于PF=1/10000则:q2=1/10000, 则q=1/100,2pq≈2q=1/50。 2 1 1 1 = × × 3 50 4 300 Genetic Load Genetic load is the reduction in fitness for a population caused by mutant genes. mutational load segregation load