Download z - Sylmar High School

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
Chapter Seven
Normal Curves
and Sampling
Distributions
Understanding Basic Statistics
Fourth Edition
By Brase and Brase
Prepared by: Lynn Smith
Gloucester County College
Properties of The Normal Distribution
The curve is bell-shaped with the
highest point over the mean, μ.
Copyright © Houghton Mifflin Company. All rights reserved.
7|2
Properties of The Normal Distribution
The curve is symmetrical about a
vertical line through μ.
Copyright © Houghton Mifflin Company. All rights reserved.
7|3
Properties of The Normal Distribution
The curve approaches the horizontal
axis but never touches or crosses it.
Copyright © Houghton Mifflin Company. All rights reserved.
7|4
Properties of The Normal Distribution
The transition points between
cupping upward and downward occur
above μ + σ and μ – σ .
Copyright © Houghton Mifflin Company. All rights reserved.
7|5
The Empirical Rule
• Applies to any symmetrical and bellshaped distribution
Copyright © Houghton Mifflin Company. All rights reserved.
7|6
The Empirical Rule
Approximately 68% of the data values lie within
one standard deviation of the mean.
Copyright © Houghton Mifflin Company. All rights reserved.
7|7
The Empirical Rule
Approximately 95% of the data values lie within
two standard deviations of the mean.
Copyright © Houghton Mifflin Company. All rights reserved.
7|8
The Empirical Rule
Almost all (approximately 99.7%) of the data
values will be within three standard deviations
of the mean.
Copyright © Houghton Mifflin Company. All rights reserved.
7|9
Application of the Empirical Rule
The life of a particular type of light bulb is
normally distributed with a mean of
1100 hours and a standard deviation of
100 hours.
What is the probability that a light bulb of
this type will last between 1000 and
1200 hours?
Approximately 68%
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 10
Standard Score
• The z value or z score tells the number
of standard deviations between the
original measurement and the mean.
• The z value is in standard units.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 11
Formula for z score
x
z

Copyright © Houghton Mifflin Company. All rights reserved.
7 | 12
x Values and Corresponding z Values
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 13
Calculating z scores
The amount of time it takes for a pizza
delivery is approximately normally distributed
with a mean of 25 minutes and a standard
deviation of 2 minutes. Convert 21 minutes
to a z score.
x   21  25
z

 2.00

2
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 14
Calculating z scores
Mean delivery time = 25 minutes
Standard deviation = 2 minutes
Convert 29.7 minutes to a z score.
x   29.7  25
z

 2.35

2
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 15
Raw Score
• A raw score is the result of converting
from standard units (z scores) back to
original measurements, x values.
• Formula:
x=z+
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 16
Interpreting z-scores
Mean delivery time = 25 minutes
Standard deviation = 2 minutes
Interpret a z score of 1.60.
x  z     1 .6( 2 )  25  28 .2
The delivery time is 28.2 minutes.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 17
Standard Normal Distribution:

=0

=1
Any x values are converted to z
scores.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 18
Importance of the
Standard Normal Distribution:
Any Normal Curve:
Areas will be equal.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 19
Areas of a Standard Normal Distribution
• Appendix
• Table 3
• Pages A6 - A7
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 20
Use of the Normal Probability Table
Appendix Table 3 is a left-tail style table.
Entries give the cumulative areas to the
left of a specified z.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 21
To Find the area to the
Left of a Given z score
• Find the row associated with the sign,
units and tenths portion of z in the left
column of Table 3.
• Move across the selected row to the
column headed by the hundredths digit
of the given z.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 22
Find the area to the left of
z = – 2.84
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 23
To find the area to the left of
z = – 2.84
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 24
To find the area to the left of
z = – 2.84
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 25
The area to the left of
z = – 2.84 is .0023
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 26
To Find the Area to the Left of a Given
Negative z Value:
Use Table 3 of the Appendix directly.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 27
To Find the Area to the Left
of a Given Positive z Value:
Use Table 3 of the Appendix directly.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 28
To Find the Area to the Right
of a Given z Value:
Subtract the area to the left of z from
1.0000.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 29
Alternate Way To Find the Area to the Right
of a Given Positive z Value:
Use the symmetry of the normal distribution.
Area to the right of z
= area to left of –z.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 30
To Find the Area Between Two z Values
Subtract area to left of z1 from area to left
of z2 . (When z2 > z1.)
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 31
Convention for Using Table 3
• Treat any area to the left of a z value
smaller than 3.49 as 0.000
• Treat any area to the left of a z value
greater than 3.49 as 1.000
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 32
Use of the Normal Probability Table
a.
.9495
P( z < 1.64 ) = __________
b.
.0034
P( z < - 2.71 ) = __________
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 33
Use of the Normal Probability Table
c.
.3925
P(0 < z < 1.24) = ______
d.
.4452
P(0 < z < 1.60) = _______
e.
.4911
P( 2.37 < z < 0) = ______
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 34
Use of the Normal Probability Table
f.
.9974
P( 3 < z < 3 ) = ________
g.
.9322
P( 2.34 < z < 1.57 ) = _____
h.
.0774
P( 1.24 < z < 1.88 ) = _______
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 35
Use of the Normal Probability Table
i.
P( 2.44 < z < 0.73 ) = _______
.2254
j.
P( z > 2.39 ) = ________
.0084
k.
P( z > 1.43 ) = __________
.9236
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 36
To Work with Any Normal Distributions
• Convert x values to z values using the
formula:
x
z

Use Table 3 of the Appendix to find
corresponding areas and probabilities.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 37
Rounding
• Round z values to the hundredths
positions before using Table 3.
• Leave area results with four digits to
the right of the decimal point.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 38
Application of the Normal Curve
The amount of time it takes for a pizza delivery is
approximately normally distributed with a mean of 25
minutes and a standard deviation of 2 minutes. If you
order a pizza, find the probability that the delivery time
will be:
a. between 25 and 27 minutes.
.3413
a. __________
b. less than 30 minutes.
.9938
b. __________
c. less than 22.7 minutes.
.1251
c. __________
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 39
Inverse Normal Probability Distribution
• Finding z or x values that correspond to
a given area under the normal curve
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 40
Inverse Normal Left Tail Case
Look up area A in body of Table 3 and use
corresponding z value.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 41
Inverse Normal Right Tail Case:
Look up the number 1 – A in body of
Table 3 and use corresponding z value.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 42
Inverse Normal Center Case:
Look up the number (1 – A)/2 in body of
Table 3 and use corresponding ± z value.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 43
Using Table 3 for
Inverse Normal Distribution
• Use the nearest area value rather than
interpolating.
• When the area is exactly halfway between
two area values, use the z value exactly
halfway between the z values of the
corresponding table areas.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 44
When the area is exactly
halfway between two area values
• When the z value corresponding to an
area is smaller than 2, use the z value
corresponding to the smaller area.
• When the z value corresponding to an
area is larger than 2, use the z value
corresponding to the larger area.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 45
Find the indicated z score:
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 46
Find the indicated z score:
– 2.57
z = _______
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 47
Find the indicated z score:
2.33
z = _______
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 48
Find the indicated z scores:
–1.23
–z = _____
Copyright © Houghton Mifflin Company. All rights reserved.
1.23
z = ____
7 | 49
Find the indicated z scores:
± 2.58
± z =__________
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 50
Find the indicated z scores:
± 1.96
± z = ________
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 51
Application of Determining z Scores
The Verbal SAT test has a mean score
of 500 and a standard deviation of 100.
Scores are normally distributed. A
major university determines that it will
accept only students whose Verbal SAT
scores are in the top 4%. What is the
minimum score that a student must
earn to be accepted?
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 52
Application of Determining z Scores
The cut-off score is 1.75 standard deviations
above the mean.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 53
Application of Determining z Scores
Mean = 500
standard
deviation = 100
The cut-off score is 500 + 1.75(100) = 675.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 54
Introduction to Sampling Distributions
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 55
Review of Statistical Terms
• Population
• Sample
• Parameter
• Statistic
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 56
Population
• the set of all measurements or counts
• (either existing or conceptual)
• under consideration
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 57
Sample
• a subset of measurements from a
population
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 58
Parameter
• a numerical descriptive measure of a
population
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 59
Statistic
• a numerical descriptive measure of a
sample
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 60
We use a statistic to make
inferences about a population parameter.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 61
Some Common Statistics
and Corresponding Parameters
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 62
Principal Types of Inferences
• Estimation: estimate the value of a
population parameter
• Testing: formulate a decision about the
value of a population parameter
• Regression: Make predictions or
forecasts about the value of a statistical
variable
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 63
Sampling Distribution
• a probability distribution for the sample
statistic
• based on all possible random samples
of the same size from the same
population
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 64
Example of a Sampling Distribution
• Select samples with two elements each
(in sequence with replacement) from
the set
• {1, 2, 3, 4, 5, 6}.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 65
Constructing a Sampling Distribution of the
Mean for Samples of Size n = 2
List all samples (with 2 items in each) and compute
the mean of each sample.
sample:
mean:
sample:
mean
{1,1}
{1,2}
{1,3}
{1,4}
{1,5}
1.0
1.5
2.0
2.5
3.0
{1,6}
{2,1}
{2,2}
…
3.5
1.5
4
...
How many different samples are there? 36
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 66
Sampling Distribution of the Mean
x
p
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
5.5
6.0
1/36
2/36
3/36
4/36
5/36
6/36
5/36
4/36
3/36
2/36
1/36
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 67
Let x be a random variable with a
normal distribution with mean  and
standard deviation . Let x be the
sample mean corresponding to
random samples of size n taken from
the distribution .
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 68
Theorem 7.1:
• The x distribution is a normal distribution.
• The mean of the x distribution is  (the
same mean as the original distribution).
• The standard deviation of the x distribution
is / n (the standard deviation of the
original distribution, divided by the square
root of the sample size).
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 69
We can use this theorem to draw
conclusions about means of
samples taken from normal
distributions.
If the original distribution is
normal, then the sampling
distribution will be normal.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 70
The Mean of the Sampling Distribution
x
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 71
The mean of the sampling distribution is
equal to the mean of the original
distribution.
x  
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 72
The Standard Deviation
of the Sampling Distribution
x
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 73
The standard deviation of the sampling
distribution is equal to the standard
deviation of the original distribution divided
by the square root of the sample size.

x 
n
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 74
To Calculate z Scores
x  x x  
z

x
 n
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 75
The time it takes to drive between cities A and
B is normally distributed with a mean of 14
minutes and a standard deviation of 2.2
minutes.
1. Find the probability that a trip between
the cities takes more than 15 minutes.
2. Find the probability that mean time of
nine trips between the cities is more
than 15 minutes.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 76
Mean = 14 minutes,
standard deviation = 2.2 minutes
• Find the probability that a trip between
the cities takes more than 15 minutes.
15  14
z
 0.45
2.2
P( z  0.45)  1  0.6736  0.3264
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 77
Mean = 14 minutes,
standard deviation = 2.2 minutes
• Find the probability that the mean time
of nine trips between the cities is more
than 15 minutes.
 x    14

2 .2
x 

 0.73
n
9
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 78
Mean = 14 minutes, standard deviation =
2.2 minutes
• Find the probability that mean time of nine
trips between the cities is more than 15
minutes.
15  14
z
 1.37
0.73
P ( z  1.37 )  1  0.9147  0.0853
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 79
Standard Error of the Mean
• The standard error of the mean is the
standard deviation of the sampling
distribution
• For the x sampling distributi on, the standard error 
of the mean
x  
Copyright © Houghton Mifflin Company. All rights reserved.
n
7 | 80
What if the Original
Distribution Is Not Normal?
• Use the Central Limit Theorem.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 81
Central Limit Theorem
If x has any distribution with mean  and
standard deviation , then the sample
mean based on a random sample of
size n will have a distribution that
approaches the normal distribution
(with mean  and standard deviation 
divided by the square root of n) as n
increases without limit.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 82
How large should the sample size be to
permit the application of the Central Limit
Theorem?
• In most cases a sample size of n = 30
or more assures that the distribution
will be approximately normal and the
theorem will apply.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 83
Central Limit Theorem
• For most x distributions, if we use a
sample size of 30 or larger, the x
distribution will be approximately
normal.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 84
Central Limit Theorem
• The mean of the sampling distribution
is the same as the mean of the original
distribution.
• The standard deviation of the sampling
distribution is equal to the standard
deviation of the original distribution
divided by the square root of the
sample size.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 85
Central Limit Theorem Formula
x  
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 86
Central Limit Theorem Formula

x 
n
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 87
Central Limit Theorem Formula
x x x
z

 x / n
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 88
Application of the Central Limit Theorem
Records indicate that the packages shipped by
a certain trucking company have a mean weight
of 510 pounds and a standard deviation of 90
pounds. One hundred packages are being
shipped today. What is the probability that their
mean weight will be:
a.
b.
c.
more than 530 pounds?
less than 500 pounds?
between 495 and 515 pounds?
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 89
Are we authorized to
use the Normal Distribution?
• Yes, we are attempting to draw
conclusions about means of large
samples.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 90
Applying the Central Limit Theorem
What is the probability that the mean weight will
be more than 530 pounds?
Consider the distribution of sample means:
 x  510 ,  x  90 / 100  9
P( x > 530): z = 530 – 510 = 20 = 2.22
9
9
.0132
P(z > 2.22) = _______
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 91
Applying the Central Limit Theorem
What is the probability that their mean weight
will be less than 500 pounds?
P( x < 500): z = 500 – 510 = –10 = – 1.11
9
9
P(z < – 1.11) = _______
.1335
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 92
Applying the Central Limit Theorem
What is the probability that their mean weight
will be between 495 and 515 pounds?
P(495 < x < 515) :
for 495: z = 495 – 510 = - 15 = - 1.67
9
9
for 515: z = 515 – 510 = 5 = 0.56
9
9
.6648
P(–1.67 < z < 0.56) = ______
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 93
Normal Approximation of
The Binomial Distribution:
• If n (the number of trials) is sufficiently
large, a binomial random variable has a
distribution that is approximately
normal.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 94
Define “sufficiently large”
The sample size, n, is considered to be
"sufficiently large" if
np and nq
are both greater than 5.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 95
Mean and Standard
Deviation: Binomial Distribution
  np and   npq
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 96
Experiment: tossing a coin 20 times
Problem: Find the probability of getting
exactly 10 heads.
Distribution of the number of heads appearing should look
like:
0
Copyright © Houghton Mifflin Company. All rights reserved.
10
20
7 | 97
Using the Binomial Probability Formula
n=
20
x=
10
P(10) = 0.176197052
p=
0.5
q = 1 – p = 0.5
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 98
Normal Approximation of the Binomial
Distribution
First calculate the mean
and standard deviation:
 = np = 20 (.5) = 10

np(1  p ) 
20(. 5 )(. 5 ) 
Copyright © Houghton Mifflin Company. All rights reserved.
5  2.24
7 | 99
The Continuity Correction
Continuity Correction allows us to approximate
a discrete probability distribution with a
continuous distribution.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 100
The Continuity Correction
• We are using the area under the curve
to approximate the area of the
rectangle.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 101
The Continuity Correction
• If r is the left-point of an interval,
subtract 0.5 to obtain the
corresponding normal variable.
• x = r  0.5
• If r is the right-point of an interval, add
0.5 to obtain the corresponding normal
variable.
• x = r + 0.5
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 102
The Continuity Correction
• Continuity Correction: to compute the
probability of getting exactly 10 heads,
find the probability of getting between
9.5 and 10.5 heads.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 103
Using the Normal Distribution
P(9.5 < x < 10.5 ) = ?
For x = 9.5: z = – 0.22
For x = 10.5: z = 0.22
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 104
Using the Normal Distribution
P(9.5 < x < 10.5 ) =
P( – 0.22 < z < 0.22 ) =
.5871 – .4129 = .1742
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 105
Application of Normal Distribution
If 22% of all patients with high blood pressure
have side effects from a certain medication,
and 100 patients are treated, find the
probability that at least 30 of them will
have side effects.
Using the Binomial Probability Formula we would
need to compute:
P(30) + P(31) + ... + P(100) or 1 - P( x < 29)
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 106
Using the Normal Approximation to the
Binomial Distribution
Is n sufficiently large?
Check: np =
nq =
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 107
Using the Normal Approximation to the
Binomial Distribution
Is n sufficiently large?
np = 22
nq = 78
Both are greater than five.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 108
Find the mean and standard deviation
 = 100(.22) = 22
and  = 100(.22)(.78) 
17.16  4.14
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 109
Applying the Normal Distribution
To find the probability that at least 30 of
them will have side effects, find P(x  29.5)
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 110
Applying the Normal Distribution
z = 29.5 – 22 = 1.81
4.14
Find P(z  1.81)
Copyright © Houghton Mifflin Company. All rights reserved.
The probability that
at least 30 of the
patients will have
side effects is
0.0351.
7 | 111
Reminders:
• Use the normal distribution to
approximate the binomial only if both
np and nq are greater than 5.
• Always use the continuity correction
when using the normal distribution to
approximate the binomial distribution.
Copyright © Houghton Mifflin Company. All rights reserved.
7 | 112