Download Chapter 7: The Central Limit Theorem

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Probability wikipedia , lookup

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
7.1
Sampling Distributions (Page 1 of 10)
7.1 Sampling Distributions
Parameters versus Statistics
A parameter is a
Measure
Statistic Parameter
numerical measure of a

x
population. A statistic is Mean
2
s2
a numerical measure of a Variance
Standard Deviation
s

sample. For a given

p
population, a parameter is Proportion
fixed, while the value of a
statistic may vary from sample to sample. Statistics estimate the
value of population parameters and provide a basis for us to make
inferences about the parameters.
Types of Inferences
1. Estimation: In this type of inference, we estimate the value
of a population parameter.
2. Testing: In this type of inference, we formulate a decision
about the value of a population parameter.
3. Regression:
In this type of inference we make
predictions or forecasts about the value of a statistical
variable.
Sampling Distribution
A sampling distribution is a probability distribution of a sample
statistic based on all possible simple random samples of the same
size from the same population.
7.1
Sampling Distributions (Page 2 of 10)
Example 1 – Sampling Distribution for x
The Pinedale Children’s fishing pond has a five fish limit. Every
child catches the limit. The data in table below is the length of each
fish caught by 100 randomly selected children. The mean x of the
5-fish sample for each child is also tabulated.
7.1
Sampling Distributions (Page 3 of 10)
a. What makes up the members of the sample?
There are 100 members in the sample - the 100 means ( x ’s) of
the 5-fish samples.
b. What is the sample statistic corresponding to each sample?
The sample statistic of each sample is the sample mean, x .
c. What population parameter is being estimated by the sample
statistic x . What population parameter are we are trying to
make an inference about?
The population mean,  .
d. What is the sampling distribution?
A sampling distribution is a probability distribution for a
sample statistic (in this case x ). It is given in table below and
its graph is given in Figure 7-1.
x
Class
Boundaries
8.39-8.86
8.77-9.14
9.15-9.12
9.53-9.90
9.91-10.28
10.29-10.66
10.67-11.04
11.05-11.42
11.43-11.80
f
Frequency
1
5
10
19
27
18
12
5
3
f/n
Relative
Frequency
0.01
0.05
0.10
0.19
0.27
0.18
0.12
0.05
0.03
7.2 The Central Limit Theorem (Page 4 of 10)
7.2 Central Limit Theorem (Part 1)
Let x be a random variable with a normal distribution whose mean
is  and standard deviation is  . Let x be the sample mean
corresponding to a random sample of size n taken from the xdistribution. Then the following are true:
(a) The x -distribution is a normal distribution.
(b) The mean of the x -distribution is the population mean (  ).
(c) The standard error is the standard deviation of the x distribution which is  x   / n .
Thus, If x has a normal distribution, then the x -distribution is a
normal distribution for any sample size n and
x   / n
x  
and
Example 2
Suppose a team of biologists has been studying the Pinedale
Children’s fishing pond of example 1 in section 7.1. Let x
represent the length of a single trout taken at random from the
pond. Assume x has a normal distribution with a  = 10.2 in. and
standard deviation  = 1.4 in.
(a) What is the probability that a single trout
taken at random from the pond is between
8 and 12 inches?
(b) What is the probability that the mean
length x of 5 trout taken at random
is between 8 and 12 inches?
(c)
Explain the difference between (a) and
(b).
7.2 The Central Limit Theorem (Page 5 of 10)
Central Limit Theorem (Part 2)
If x is any distribution with mean  and standard deviation  ,
then, as n increases without limit, the sample mean x , based on a
random sample of size n, will have a distribution that approaches a
normal distribution with mean of  x   and standard deviation
(standard error) of  x   / n .
That is, no matter what the original distribution on x, as the sample
size n gets larger and larger, the distribution of sample means
( x ’s) will approach a normal distribution.
Empirically, it has been found that in the vast majority of cases
picking a sample size of thirty or more 4( n  30 ) will yield an x distribution that is approximately normal. The larger n gets, the
more normal the x -distribution will become. Thus, for any sample
( n  30 ) on any x-distribution, we have the following
approximations:  x   , and  x   / n .
Guided Exercise 2
(a) Suppose x has a normal distribution with mean of 18 and
standard deviation of 3. If we draw random samples of size 5
from the x-distribution and x represents the sample mean,
what can be said about the x -distribution?
(b) Suppose the x distribution has mean of 75 and standard
deviation of 12. If we draw random samples of size 30 from
the x-distribution and x represents the sample mean, what can
be said about the x -distribution?
(c) Suppose you did not know if x had a normal distribution.
Would you be justified in saying that the x -distribution is
approximately normal if the sample size n = 8?
7.2 The Central Limit Theorem (Page 6 of 10)
Example 3
A strain of bacteria occurs in all raw milk. Let x be the bacteria
count per milliliter of milk. The health department has found that
if the milk is not contaminated, then the x-distribution is
approximately normal. The mean of the x-distribution is 2500, and
the standard deviation is 300. In a large dairy an inspector takes 42
random samples of milk each day and averages the bacteria count
of the 42 samples to obtain x .
(a) Assuming the milk is not contaminated, what is (describe) the
distribution of x ?
(b) Assuming the milk is not contaminated, what is the probability
that the average bacteria count for one day is between 2350
and 2650 bacteria per milliliter?
(c) Suppose one day the x for 42 samples was not between 2350
and 2650. Would you pass the sample? Why/Why Not?
7.2 The Central Limit Theorem (Page 7 of 10)
Guided Exercise 3
Tunnels are often used instead of long
roads over high passes. However, too
many vehicles in a tunnel at the same
time can be hazardous due to emissions.
If x represents the time for a vehicle to
go through a tunnel, it is known that the
x-distribution is normal with a mean of
12.1 minutes and a standard deviation of 3.8 minutes. Safety
Engineers have said that vehicles should spend between 11 and 13
minutes in the tunnel. Under ordinary conditions about 50 vehicles
are in the tunnel at a time.
a. What is the probability that the time for one vehicle (x) to pass
through the tunnel is between 11 and 13 minutes?
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
b. What is the probability that the mean time for 50 vehicles ( x )
to pass through the tunnel will be between 11 and 13 minutes?
c. Comment on the results.
7.2 The Central Limit Theorem (Page 8 of 10)
Example A
Suppose x = a person’s I.Q. score. The x-distribution is normal
with a mean of 100 and standard deviation of 14.
a. What is the probability that a person selected at random has an
I.Q. (x) between 96 and 104?
b. If a random sample of size n = 5 is drawn, find the probability
that the mean ( x ) of the sample will be between 96 and 104.
c. Repeat part b for n = 10, 15, 20, and 30.
n
x
1
5
10
15
20
30
100 14
100 6.2610
100
100
100
100
x
P(96  x  104) P(x  108)
0.2249
0.4771
7.2 The Central Limit Theorem (Page 9 of 10)
How the x distribution
changes as the
Sample Size
Increases
n = 30
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
n=5
n=1
x = IQ score
x = mean IQ score
for sample size n
d. As the sample size n
increases what happens to:
i.  x
ii.  x
iii. The probability of an x
occurring in an interval
containing  and near  .
iv. The probability of an x
occurring in an interval
not containing  and
away from  (i.e. in the
tails)?
n
1
5
10
15
20
30
x
14
6.2610
4.4272
3.6148
3.1305
2.5560
P(96  x  104)
P(x  108)
0.2249
0.4771
0.6337
0.7315
0.7987
0.8824
0.2839
0.1007
0.0354
0.0134
0.0053
0.0009
7.2 The Central Limit Theorem (Page 10 of 10)
Exercise 7.2 #17
Let x be a random variable that represents the checkout time in
minutes at the express line at a grocery store. Based on extensive
surveys the mean of the x-distribution is 2.7 minutes with a
standard deviation of 0.6 minutes. What is the probability that the
total checkout time of the next 30 customers is less than 90
minutes? Answer the question in the following steps.
a. Let x i (i = 1, 2, 3, . . . , 30) be the checkout times for the next
30 customers and w  x1  x2  x3   x30 . Explain why we
are being asked to compute P(w < 90).
b. Show that w < 90 is equivalent to x = (w/30) < 3.
c. What three things does the central limit theorem say about the
x -distribution?
d. Compute P( x < 3) = P(w < 90).