Download Corporate Profile

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
PBG 650 Advanced Plant Breeding
Mathematical Statistics Concepts
–Probability laws
–Binomial distributions
–Mean and Variance of Linear Functions
Probability laws
•
the probability that either A or B occurs (union)
Pr( A  B)  Pr( A )  Pr(B)  Pr( A  B)
•
•
the probability that both A and B occur (intersection)
the joint probability of A and B
Pr( A  B)  Pr( A,B)
•
the conditional probability of B given A
Pr( A,B)
Pr(B A ) 
Pr( A )
Statistical Independence
• If events A and B are independent, then
Pr(B A )  Pr(B)
Pr( A B)  Pr( A )
Pr( A,B)  Pr( A ) Pr(B)
Bayes’ Theorum (Bayes’ Rule)
•
•
Pr( A B ) 
Conditional probability
Bayes’ Theorum
Pr( A B ) 
Pr( A, B )
Pr(B )
Pr(B A)Pr  A 
Pr(B )
.
Pr(A) is called the prior probability
Pr(A|B) is called the posterior probability
Pr( Aj B ) 
Pr(B Aj )Pr  Aj 
k
 Pr(B A )Pr  A 
i 1
i
i
Probabilities
Plant Height
A1A1
A1A2
A2A2
Marginal
Prob.
height ≤ 50 cm
0.10
0.14
0.06
0.30
50 < height ≤ 75
0.04
0.18
0.10
0.32
height > 75 cm
0.02
0.16
0.20
0.38
Marginal Prob.
0.16
0.48
0.36
1.00
Marginal Probability:
Pr(Genotype= A1A1) = 0.16
Joint Probability:
Pr(Genotype= A1A1, height ≤ 50) = 0.10
Conditional Probability:
Prheight  50 Genotype  A1A1 

Pr(height  50, Genotype  A 1A1 ) 0.10

 0.625
Pr(Genotype  A 1A 1 )
0.16
Statistical Independence
If X is statistically independent of Y, then their joint
probability is equal to the product of the marginal
probabilities of X and Y
A1A1
A1A2
A2A2
Marginal
Prob.
height ≤ 50 cm
0.0480
0.1440
0.1080
0.30
50 < height ≤ 75
0.0512
0.1536
0.1152
0.32
height > 75 cm
0.0608
0.1824
0.1368
0.38
Marginal Prob.
0.16
0.48
0.36
1.00
Plant Height
Discrete probability distributions
•
Let x be a discrete random variable that can take on
a value Xi, where i = 1, 2, 3,…
•
The probability distribution of x is described by
specifying Pi = Pr(Xi) for every possible value of Xi
•
•
•
0 ≤ Pr(Xi) ≤ 1 for all values of Xi
ΣiPi = 1
The expected value of X is
E(X) = ΣiXiPr(Xi) =X
Binomial Probability Function
•
A Bernoulli random variable can have a value of one or zero.
The Pr(X=1) = p, which can be viewed as the probability of
success. The Pr(X=0) is 1-p.
•
A binomial distribution is derived from a series of
independent Bernouli trials. Let n be the number of trials and
y be the number of successes.
– Calculate the number of ways to obtain that result:
n!
n 
y
y! (n  y )!
– Calculate the probability of that result:


n!
n y
Pr( y ) 
p y 1  p
y! (n  y )!

Probability Function
Binomial Distribution
Probability
Binomial Distribution (n=20, p=0.5)
0.20
0.18
0.16
0.14
0.12
0.10
0.08
0.06
0.04
0.02
0.00
0
1
2
3
4
5
6
7
8
9 10 11 12 13 14 15 16 17 18 19 20
Number of successes
Average = np = 20*0.5 = 10
Variance = np(1-p) = 20*0.5*(1-0.5) = 5
For a normal distribution, the variance is independent of the mean
For a binomial distribution, the variance changes with the mean
Mean and variance of linear functions
•
Mean and variance of a constant (c)
E (c )  c
 0
2
c
•
Adding a constant (c) to a random variable Xi
E( X  c )  E( X )  c
the mean increases by the
value of the constant

the variance remains the same
2
X c

2
X
Mean and variance of linear functions
•
Multiplying a random variable by a constant
E (cX )  c[E ( X )]
multiply the mean by the constant
2
 cX
 c 2 X2
multiply the variance by the
square of the constant
Adding two random variables X and Y
E ( X  Y )  E ( X )  E (Y )
mean of the sum is the sum
of the means
 (2X Y )  VAR( X )  VAR(Y )  2COVXY
variance of the sum  the sum of the variances if the variables
are independent
Variance - definition
• The variance of variable X
V ( X )  E ( X i  X )2   E ( X i2 )  X2
• Usual formula
X


 
2
X
i
 X
n
2
2


X


 i  n
   X i2 
n




• Formula for frequency data (weighted)
V ( X )   X2   f i X i -  X2
Covariance - definition
• The covariance of variable X and variable Y
Cov ( X ,Y )  E ( X  X )(Y  Y )  E ( XY )  X Y
• Usual formula
X


XY 
i
 X  Yi  Y 
n

X i  Yi 

   X i Yi 
 n
n


• Formula for frequency data (weighted)
Cov ( X , Y )   XY   f i X iYi -  X Y
Related documents