Download ppt - UBC Zoology

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
How do I know which
distribution to use?
Three discrete distributions
Proportional
Poisson
Binomial
Proportional
Given a number of categories
Probability proportional to number of opportunities
Days of the week, months of the year
Binomial
Number of successes in n trials
Have to know n, p under the null hypothesis
Punnett square, many p=0.5 examples
Poisson
Number of events in interval of space or time
n not fixed, not given p
Car wrecks, flowers in a field
Proportional
Binomial
Binomial with large n, small p converges
to the Poisson distribution
Poisson
Examples: name that distribution
•
•
•
•
•
Asteroids hitting the moon per year
Babies born at night vs. during the day
Number of males in classes with 25 students
Number of snails in 1x1 m quadrats
Number of wins out of 50 in rock-paper-scissors
Proportional
Binomial
Poisson
Generate
expected
values
Calculate
2 test statistic
Sample
Test statistic
Null hypothesis
compare
Null distribution
How unusual is this test statistic?
P < 0.05
Reject Ho
P > 0.05
Fail to reject Ho
Chi-squared goodness of fit test
Null hypothesis:
Data fit a particular
Discrete distribution
Sample
Calculate expected values
Chi-squared
Test statistic
compare
Null distribution:
2 With
N-1-p d.f.
How unusual is this test statistic?
P < 0.05
Reject Ho
P > 0.05
Fail to reject Ho
The Normal Distribution
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Babies are “normal”
Babies are “normal”
Normal distribution
0.4
0.3
0.2
0.1
-2
-1
0
1
2
3
Normal distribution
• A continuous probability distribution
• Describes a bell-shaped curve
• Good approximation for many biological
variables
Continuous Probability
Distribution
0.4
0.3
0.2
0.1
-2
-1
0
1
2
3
The normal distribution is very
common in nature
Human body temperature
Human birth weight
Number of
bristles on a
Drosophila
abdomen
Discrete probability
distribution
Continuous probability
distribution
Poisson
Probability density
Probability
0.4
0.3
Normal
0.2
0.1
-2
Pr[X=2] = 0.22
-1
0
1
2
3
Pr[X=2] = ?
Probability that X is EXACTLY
2 is very very small
Discrete probability
distribution
Continuous probability
distribution
Poisson
Probability density
Probability
0.4
0.3
Normal
0.2
0.1
-2
Pr[X=2] = 0.22
-1
0
1
2
Pr[1.5≤X≤2.5] =
Area under the curve
3
Discrete probability
distribution
Continuous probability
distribution
Poisson
Probability density
Probability
0.4
0.3
Normal
0.2
0.1
-2
Pr[X=2] = 0.22
-1
0
Pr[1.5≤X≤2.5] =
0.06
1
2
3
Normal distribution
Probability density
0.4
0.3
0.2
0.1
-2
-1
0
1
2
3
x
f x  

1
2
2
e
x  2
2
2
Normal distribution
Probability density
0.4
0.3
0.2
0.1
-2
-1
0
1
2
3
x
Area under the curve:
Pr[a  X  b] 

b
a
f [X]dX
Pr[a  X  b] 


b
1
a
2

2
e

b
a
x  
2
2
f [X]dX
2
dX
But don’t worry about this for now!
A normal distribution is fully
described by its mean and
variance
Fig ure 10.3. The normal distribution with different values of the
mean and variance. (a)  =5,  =4; (b) = -3; =1/2.
A normal distribution is
symmetric around its mean
Probability
Density
Y
About 2/3 of random draws from
a normal distribution are within
one standard deviation of the
mean
About 95% of random draws
from a normal distribution are
within two standard deviations of
the mean
(Really, it’s 1.96 SD.)
Properties of a Normal
Distribution
•
•
•
•
Fully described by its mean and variance
Symmetric around its mean
Mean = median = mode
2/3 of randomly-drawn observations fall
between - and +
• 95% of randomly-drawn observations fall
between -2 and +2
Standard normal distribution
• A normal distribution with:
• Mean of zero. ( = 0)
• Standard deviation of one. (= 1)
0.4
0.3
0.2
0.1
-2
-1
0
1
2
3
Standard normal table
• Gives the probability of getting a random
draw from a standard normal distribution
greater than a given value
Standard normal table: Z = 1.96
x.x0
x.x1
x.x2
.x3
x.x4
x.x5
x.x6
x.x7
x.x8
x.x9
...
1.5 0.06681 0.06552 0.06426 0.06301 0.06178 0.06057 0.05938 0.05821 0.05705 0.05592
1.6
0.0548
0.0537
0.05262 0.05155
0.0505
0.04947 0.04846 0.04746 0.04648 0.04551
1.7 0.04457 0.04363 0.04272 0.04182 0.04093 0.04006
0.0392
0.03836 0.03754 0.03673
1.8 0.03593 0.03515 0.03438 0.03362 0.03288 0.03216 0.03144 0.03074 0.03005 0.02938
1.9 0.02872 0.02807 0.02743
0.0268
0.02619 0.02559
2.0 0.02275 0.02222 0.02169 0.02118 0.02068 0.02018
2.1 0.01786 0.01743
2.2
0.0139
0.017
0.025
0.0197
0.01659 0.01618 0.01578 0.01539
0.01355 0.01321 0.01287 0.01255 0.01222 0.01191
0.02442 0.02385
0.0233
0.01923 0.01876 0.01831
0.015
0.0116
0.01463 0.01426
0.0113
0.01101
Standard normal table: Z = 1.96
Pr[Z>1.96]=0.025
Normal Rules
• Pr[X < x] =1- Pr[X > x]
+
=1
Pr[X<x] + Pr[X>x]=1
Standard normal is symmetric, so...
• Pr[X > x] = Pr[X < -x]
Normal Rules
• Pr[X > x] = Pr[X < -x]
• Pr[X < x] =1- Pr[X > x]
Sample standard normal
calculations
•
•
•
•
•
Pr[Z > 1.09]
Pr[Z < -1.09]
Pr[Z > -1.75]
Pr[0.34 < Z < 2.52]
Pr[-1.00 < Z < 1.00]
What about other normal
distributions?
• All normal distributions are shaped alike,
just with different means and variances
What about other normal
distributions?
• All normal distributions are shaped alike,
just with different means and variances
• Any normal distribution can be converted to
a standard normal distribution, by
Z
Y

Z-score
Z
Y

Z tells us how many standard
deviations Y is from the mean

The probability of getting a value
greater than Y is the same as the
probability of getting a value
greater than Z from a standard
normal distribution.
Z
Y

Z tells us how many standard
deviations Y is from the mean

Pr[Z > z] = Pr[Y > y]
Example: British spies
MI5 says a man has to be shorter
than 180.3 cm tall to be a spy.
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Mean height of British men is
177.0cm, with standard deviation
7.1cm, with a normal distribution.
What proportion of British men are
excluded from a career as a spy by
this height criteria?
Draw a rough sketch of the
question
 = 177.0cm
 = 7.1cm
y = 180.3
Pr[Y > y]
Z
Y 

180.3 177.0
Z
7.1
Z  0.46
Part of the standard normal table
x.x0
x.x1
x.x2
.x3
x.x4
x.x5
x.x6
x.x7
x.x8
x.x9
0.0
0.5
0.49601
0.49202
0.48803
0.48405
0.48006
0.47608
0.4721
0.46812
0.46414
0.1
0.46017
0.4562
0.45224
0.44828
0.44433
0.44038
0.43644
0.43251
0.42858
0.42465
0.2
0.42074
0.41683
0.41294
0.40905
0.40517
0.40129
0.39743
0.39358
0.38974
0.38591
0.3
0.38209
0.37828
0.37448
0.3707
0.36693
0.36317
0.35942
0.35569
0.35197
0.34827
0.4
0.34458
0.3409
0.33724
0.3336
0.32997
0.32636
0.32276
0.31918
0.31561
0.31207
0.5
0.30854
0.30503
0.30153
0.29806
0.2946
0.29116
0.28774
0.28434
0.28096
0.2776
Pr[Z > 0.46] = 0.32276,
so Pr[Y > 180.3] = 0.32276
Sample problem
MI5 says a woman has to be shorter than
172.7 cm tall to be a spy.
The mean height of women in Britain is
163.3 cm, with standard deviation 6.4
cm. What fraction of women meet the
height standard for application to MI5?
Related documents