Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
PBG 650 Advanced Plant Breeding Mathematical Statistics Concepts –Probability laws –Binomial distributions –Mean and Variance of Linear Functions Probability laws • the probability that either A or B occurs (union) Pr( A B) Pr( A ) Pr(B) Pr( A B) • • the probability that both A and B occur (intersection) the joint probability of A and B Pr( A B) Pr( A,B) • the conditional probability of B given A Pr( A,B) Pr(B A ) Pr( A ) Statistical Independence • If events A and B are independent, then Pr(B A ) Pr(B) Pr( A B) Pr( A ) Pr( A,B) Pr( A ) Pr(B) Bayes’ Theorum (Bayes’ Rule) • • Pr( A B ) Conditional probability Bayes’ Theorum Pr( A B ) Pr( A, B ) Pr(B ) Pr(B A)Pr A Pr(B ) . Pr(A) is called the prior probability Pr(A|B) is called the posterior probability Pr( Aj B ) Pr(B Aj )Pr Aj k Pr(B A )Pr A i 1 i i Probabilities Plant Height A1A1 A1A2 A2A2 Marginal Prob. height ≤ 50 cm 0.10 0.14 0.06 0.30 50 < height ≤ 75 0.04 0.18 0.10 0.32 height > 75 cm 0.02 0.16 0.20 0.38 Marginal Prob. 0.16 0.48 0.36 1.00 Marginal Probability: Pr(Genotype= A1A1) = 0.16 Joint Probability: Pr(Genotype= A1A1, height ≤ 50) = 0.10 Conditional Probability: Prheight 50 Genotype A1A1 Pr(height 50, Genotype A 1A1 ) 0.10 0.625 Pr(Genotype A 1A 1 ) 0.16 Statistical Independence If X is statistically independent of Y, then their joint probability is equal to the product of the marginal probabilities of X and Y A1A1 A1A2 A2A2 Marginal Prob. height ≤ 50 cm 0.0480 0.1440 0.1080 0.30 50 < height ≤ 75 0.0512 0.1536 0.1152 0.32 height > 75 cm 0.0608 0.1824 0.1368 0.38 Marginal Prob. 0.16 0.48 0.36 1.00 Plant Height Discrete probability distributions • Let x be a discrete random variable that can take on a value Xi, where i = 1, 2, 3,… • The probability distribution of x is described by specifying Pi = Pr(Xi) for every possible value of Xi • • • 0 ≤ Pr(Xi) ≤ 1 for all values of Xi ΣiPi = 1 The expected value of X is E(X) = ΣiXiPr(Xi) =X Binomial Probability Function • A Bernoulli random variable can have a value of one or zero. The Pr(X=1) = p, which can be viewed as the probability of success. The Pr(X=0) is 1-p. • A binomial distribution is derived from a series of independent Bernouli trials. Let n be the number of trials and y be the number of successes. – Calculate the number of ways to obtain that result: n! n y y! (n y )! – Calculate the probability of that result: n! n y Pr( y ) p y 1 p y! (n y )! Probability Function Binomial Distribution Probability Binomial Distribution (n=20, p=0.5) 0.20 0.18 0.16 0.14 0.12 0.10 0.08 0.06 0.04 0.02 0.00 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Number of successes Average = np = 20*0.5 = 10 Variance = np(1-p) = 20*0.5*(1-0.5) = 5 For a normal distribution, the variance is independent of the mean For a binomial distribution, the variance changes with the mean Mean and variance of linear functions • Mean and variance of a constant (c) E (c ) c 0 2 c • Adding a constant (c) to a random variable Xi E( X c ) E( X ) c the mean increases by the value of the constant the variance remains the same 2 X c 2 X Mean and variance of linear functions • Multiplying a random variable by a constant E (cX ) c[E ( X )] multiply the mean by the constant 2 cX c 2 X2 multiply the variance by the square of the constant Adding two random variables X and Y E ( X Y ) E ( X ) E (Y ) mean of the sum is the sum of the means (2X Y ) VAR( X ) VAR(Y ) 2COVXY variance of the sum the sum of the variances if the variables are independent Variance - definition • The variance of variable X V ( X ) E ( X i X )2 E ( X i2 ) X2 • Usual formula X 2 X i X n 2 2 X i n X i2 n • Formula for frequency data (weighted) V ( X ) X2 f i X i - X2 Covariance - definition • The covariance of variable X and variable Y Cov ( X ,Y ) E ( X X )(Y Y ) E ( XY ) X Y • Usual formula X XY i X Yi Y n X i Yi X i Yi n n • Formula for frequency data (weighted) Cov ( X , Y ) XY f i X iYi - X Y