Download Genetics of quantitative traits and the Central Limit Theorem

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of neurodegenerative diseases wikipedia , lookup

Gene nomenclature wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Pathogenomics wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Twin study wikipedia , lookup

Epistasis wikipedia , lookup

Public health genomics wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Gene desert wikipedia , lookup

History of genetic engineering wikipedia , lookup

Essential gene wikipedia , lookup

Nutriepigenomics wikipedia , lookup

RNA-Seq wikipedia , lookup

Heritability of IQ wikipedia , lookup

Genomic imprinting wikipedia , lookup

Gene expression programming wikipedia , lookup

Gene wikipedia , lookup

Genome evolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Ridge (biology) wikipedia , lookup

Minimal genome wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Genome (book) wikipedia , lookup

Microevolution wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Designer baby wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Gene expression profiling wikipedia , lookup

Transcript
1
Plolygenic traits and the central limit theorem
José Miguel Ponciano: [email protected]
University of Florida, Biology Department
2
Darwin was once also frustrated with math. . .
Mathematics . . . was repugnant to me . . . [but] I have deeply regretted that I did not proceed
far enough at least to understand something of the great leading principles of mathematics;
for men thus endowed seem to have an extra sense. -Charles Darwin (Autobiography)
But this quote shows he came to understand why math was important, even for him.
Have you heard of Darwin Finches from the Galapagos Islands and how they are used to
illustrate the interaction between evolution and ecology (beak length variation)?
Have you heard explanations of evolutionary changes that go all the way to the effect of
multiple genes? Well, Ricklefs, my undergrad ecology class book author brings genes into
the the explanation of evolutionary phenomena
3
An example: polygenic traits
Why (to the first sentence of the caption)? What does the book author means here????
4
From the effect of one gene to the effect of many genes
• The beak size of Darwin’s finches is known to be controlled by many genes.
5
From the effect of one gene to the effect of many genes
• The beak size of Darwin’s finches is known to be controlled by many genes
• Suppose that the contribution of a single gene to the total beak size of a polygenic trait
can be either Small, Medium or Large (say 1, 2 or 3 cms. of width respectively) depending
on the physiology, other genes present, etc...
6
From the effect of one gene to the effect of many genes
• The beak size of Darwin’s finches is known to be controlled by many genes
• Suppose that the contribution of a single gene to the total beak size of a polygenic trait
can be either Small, Medium or Large (say 1, 2 or 3 cms. of width respectively) depending
on the physiology, other genes present, etc...
• Then, suppose that for a single gene involved in the beak size, the probability that the
conditions are such that its effect is small is 0.6, medium 0.3, large 0.1
7
From the effect of one gene to the effect of many genes
• The beak size of Darwin’s finches is known to be controlled by many genes
• Suppose that the contribution of a single gene to the total beak size of a polygenic trait
can be either Small, Medium or Large (say 1, 2 or 3 cms. of width respectively) depending
on the physiology, other genes present, etc...
• Then, suppose that for a single gene involved in the beak size, the probability that the
conditions are such that its effect is small is 0.6, medium 0.3, large 0.1
• This probabilistic model for the trait effect is what in probability is known as a “discrete
probability distribution”
8
The probability distribution of the effect of a single gene
1000 2000 3000 4000 5000 6000
0
Frequency
10000 draws from the gene effect prob. distribution
1.0
1.5
2.0
Gene effect size
2.5
3.0
9
From the effect of one gene to the effect of many genes
• The beak size of Darwin’s finches is known to be controlled by many genes
• Suppose that the contribution of a single gene to the total beak size of a polygenic trait
can be either Small, Medium or Large (say 1, 2 or 3 cms. of width respectively) depending
on the physiology, other genes present, etc...
• Then, suppose that for a single gene involved in the beak size, the probability that the
conditions are such that its effect is small is 0.6, medium 0.3, large 0.1
• This probabilistic model for the trait effect is what in probability is known as a “discrete
probability distribution”
• Calculation and definition of the mean effect size and the variance in the effect size for a
single gene.
10
From the effect of one gene to the effect of many genes
• The beak size of Darwin’s finches is known to be controlled by many genes
• Suppose that the contribution of a single gene to the total beak size of a polygenic trait
can be either Small, Medium or Large (say 1, 2 or 3 cms. of width respectively) depending
on the physiology, other genes present, etc...
• Then, suppose that for a single gene involved in the beak size, the probability that the
conditions are such that its effect is small is 0.6, medium 0.3, large 0.1
• This probabilistic model for the trait effect is what in probability is known as a “discrete
probability distribution”
• Calculation and definition of the mean effect size and the variance in the effect size for a
single gene.
• What would the distribution of the sum of two identical genes look like? And three?
11
The probability distribution of the effect of 2 genes
20000
10000
0
Frequency
30000
100000 draws adding the effect of two genes
2
3
4
Two genes effect size
5
6
12
The probability distribution of the effect of 3 genes
15000
5000
0
Frequency
25000
100000 draws adding the effect of three genes
3
4
5
6
7
Three genes effect size
8
9
13
The probability distribution of the effect of 4 genes
5000 10000
0
Frequency
20000
100000 draws adding the effect of four genes
4
6
8
Four genes effect size
10
12
14
The probability distribution of the effect of 5 genes
10000 15000 20000 25000
5000
0
Frequency
100000 draws adding the effect of five genes
6
8
10
Five genes effect size
12
14
15
The probability distribution of the effect of many genes
10000
5000
0
Frequency
15000
20000
100000 draws adding the effect of 30 genes
35
40
45
50
30 genes effect size
55
60
16
And that’s the explanation of the first sentence of the
book figure’s caption
• Suppose one starts with a single random variable (like the distribution of one gene’s effect
on the overall size of the beak), with a given mean (say m) and variance (say v). Then, the
Central Limit Theorem (CLT) that you learn in your most basic stats class tells us that if
we add the outcome of a large number (say n) of random variables that each have the same
distribution and are independent from each other, then the resulting sum will be normally
distributed. The mean of that normal distribution will be nm and its variance will be nv.
17
Now, in a dry year, mostly finches with larger beaks
survive, and the population distribution shifts to the
right. Did that shift (evolution) really happen?
We saw that the frequency distribution of the effect of 30 genes affecting beak size can be well
described with
distribution with mean µ = 45 and variance σ 2 = 13.5 (standard
p a Normal
deviation = (σ 2) ≈ 3.674235
Suppose that to gather evidence regarding whether evolution happened after drought we
capture n = 30 birds and measure the beak, obtaining a sample mean of x̄ = 46.5. Is that
evidence to show that evolution occurred?
Amazingly, the very same piece of math (the Central Limit Theorem) that allows us to describe
a polygenic trait with a normal distribution, allows us to answer this scientific question
18
The central limit theorem and the sample mean
• If we want to answer our biological question of interest (did evolution occurred after a dry
year), we’ll need the following result.
• Suppose one starts with a single random variable (like the distribution of one gene’s effect
on the overall size of the beak), with a given mean (say m) and variance (say v). Then, the
Central Limit Theorem (CLT) that you learn in your most basic stats class tells us that if
we add the outcome of a large number (say n) of random variables that each have the same
distribution and are independent from each other, then the resulting sum will be normally
distributed. The mean of that normal distribution will be nm and its variance will be nv.
• Now, before answering the question of interest, we need to learn about sampling
distributions, and the distribution of the sample mean!