Download Document

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Confidence interval wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Chapter
9
Estimating the
Value of a
Parameter Using
Confidence
Intervals
© 2010 Pearson Prentice Hall. All rights reserved
Section
9.2
Confidence Intervals
about a Population
Mean When the
Population Standard
Deviation is
Unknown
© 2010 Pearson Prentice Hall. All rights reserved
Objectives
1. Know the properties of Student’s
t-distribution
2. Determine t-values
3. Construct and interpret a confidence interval
for a population mean
© 2010 Pearson Prentice Hall. All rights reserved
9-3
Objective 1
• Know the Properties of Student’s
t-Distribution
© 2010 Pearson Prentice Hall. All rights reserved
9-4
Student’s t-Distribution
Suppose that a simple random sample of size n is
taken from a population. If the population
from which the sample is drawn follows a
normal distribution, the distribution of
x 
t
s
n
follows Student’s t-distribution with n-1
degrees of freedom where x is the sample
mean and s is the sample standard deviation.

© 2010 Pearson Prentice Hall. All rights reserved
9-5
Parallel Example 1: Comparing the Standard Normal
Distribution to the t-Distribution Using Simulation
a)
Obtain 1,000 simple random samples of size n=5 from a
normal population with =50 and =10.
b)
Determine the sample mean and sample standard deviation
for each of the samples.
c)
d)
Compute z 
x 

n
and
x 
for each sample.
t
s
n
Draw a histogram for both z and t.


© 2010 Pearson Prentice Hall. All rights reserved
9-6
Histogram for z
© 2010 Pearson Prentice Hall. All rights reserved
9-7
Histogram for t
© 2010 Pearson Prentice Hall. All rights reserved
9-8
CONCLUSIONS:
• The histogram for z is symmetric and bell-shaped
with the center of the distribution at 0 and virtually all
the rectangles between -3 and 3. In other words, z
follows a standard normal distribution.
• The histogram for t is also symmetric and bell-shaped
with the center of the distribution at 0, but the
distribution of t has longer tails (i.e., t is more
dispersed), so it is unlikely that t follows a standard
normal distribution. The additional spread in the
distribution of t can be attributed to the fact that we
use s to find t instead of . Because the sample
standard deviation is itself a random variable (rather
than a constant such as ), we have more dispersion
in the distribution of t.
© 2010 Pearson Prentice Hall. All rights reserved
9-9
Properties of the t-Distribution
1. The t-distribution is different for different degrees of
freedom.
2. The t-distribution is centered at 0 and is symmetric
about 0.
3. The area under the curve is 1. The area under the
curve to the right of 0 equals the area under the curve
to the left of 0 equals 1/2.
4. As t increases without bound, the graph approaches,
but never equals, zero. As t decreases without bound,
the graph approaches, but never equals, zero.
© 2010 Pearson Prentice Hall. All rights reserved
9-10
Properties of the t-Distribution
5. The area in the tails of the t-distribution is a little
greater than the area in the tails of the standard normal
distribution, because we are using s as an estimate of
, thereby introducing further variability into the tstatistic.
6. As the sample size n increases, the density curve of t
gets closer to the standard normal density curve. This
result occurs because, as the sample size n increases,
the values of s get closer to the values of , by the
Law of Large Numbers.
© 2010 Pearson Prentice Hall. All rights reserved
9-11
© 2010 Pearson Prentice Hall. All rights reserved
9-12
Objective 2
• Determine t-Values
© 2010 Pearson Prentice Hall. All rights reserved
9-13
© 2010 Pearson Prentice Hall. All rights reserved
9-14
Parallel Example 2: Finding t-values
Find the t-value such that the area under the tdistribution to the right of the t-value is 0.2 assuming
10 degrees of freedom. That is, find t0.20 with 10
degrees of freedom.
© 2010 Pearson Prentice Hall. All rights reserved
9-15
Solution
The figure to the left
shows the graph of the
t-distribution with 10
degrees of freedom.
The unknown value of t is labeled, and the area under
the curve to the right of t is shaded. The value of t0.20
with 10 degrees of freedom is 0.8791.
© 2010 Pearson Prentice Hall. All rights reserved
9-16
Objective 3
• Construct and Interpret a Confidence Interval
for a Population Mean
© 2010 Pearson Prentice Hall. All rights reserved
9-17
Constructing a (1-)100% Confidence
Interval for ,  Unknown
Suppose that a simple random sample of size n is taken
from a population with unknown mean  and
unknown standard deviation . A (1-)100%
confidence interval for  is given by
Lower
bound:
s
x  t 
n
2
Upper
bound:
s
x  t 
n
2
Note: The interval is exact when the population is normally
distributed. It is approximately correct for nonnormal
populations, provided that n is large enough.


© 2010 Pearson Prentice Hall. All rights reserved
9-18
Parallel Example 3: Constructing a Confidence Interval
about a Population Mean
The pasteurization process reduces the amount of
bacteria found in dairy products, such as milk. The
following data represent the counts of bacteria in
pasteurized milk (in CFU/mL) for a random sample of
12 pasteurized glasses of milk. Data courtesy of Dr.
Michael Lee, Professor, Joliet Junior College.
Construct a 95% confidence interval for the bacteria
count.
© 2010 Pearson Prentice Hall. All rights reserved
9-19
NOTE: Each observation is in tens of thousand.
So, 9.06 represents 9.06 x 104.
© 2010 Pearson Prentice Hall. All rights reserved
9-20
Solution: Checking Normality and
Existence of Outliers
Normal Probability Plot for CFU/ml
© 2010 Pearson Prentice Hall. All rights reserved
9-21
Solution: Checking Normality and
Existence of Outliers
Boxplot of CFU/mL
© 2010 Pearson Prentice Hall. All rights reserved
9-22
• x  6.41 and s  4.55
•   0.05, n  12, so t 0.05  2.201
2
Lower
bound:
4.55
6.41 2.201
 3.52
12
Upper
bound:

4.55
6.41 2.201
 9.30
12
The 95% confidence interval for the mean bacteria
count in pasteurized milk is (3.52, 9.30).

© 2010 Pearson Prentice Hall. All rights reserved
9-23
Parallel Example 5: The Effect of Outliers
Suppose a student miscalculated the amount of
bacteria and recorded a result of 2.3 x 105. We
would include this value in the data set as 23.0.
What effect does this additional observation have on
the 95% confidence interval?
© 2010 Pearson Prentice Hall. All rights reserved
9-24
Solution: Checking Normality and
Existence of Outliers
Boxplot of CFU/mL
© 2010 Pearson Prentice Hall. All rights reserved
9-25
Solution
• x  7.69 and s  6.34
•   0.05, n 13, so t 0.05  2.179
2
Lower
bound:
6.34
7.69  2.179 
 3.86
13
6.34
Upper
7.69  2.179 
 11.52
bound:
13

The 95% confidence interval for the mean bacteria
count in pasteurized milk, including the outlier

is (3.86, 11.52).
© 2010 Pearson Prentice Hall. All rights reserved
9-26
CONCLUSIONS:
• With the outlier, the sample mean is larger because
the sample mean is not resistant
• With the outlier, the sample standard deviation is
larger because the sample standard deviation is not
resistant
• Without the outlier, the width of the interval
decreased from 7.66 to 5.78.
Without
Outlier

With
Outlier
x
s
95% CI
6.41
4.55
(3.52, 9.30)
7.69
6.34
(3.86, 11.52)
© 2010 Pearson Prentice Hall. All rights reserved
9-27