Download Estimating population mean

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Resampling (statistics) wikipedia , lookup

German tank problem wikipedia , lookup

Misuse of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Determining Sample Size
E=
zα / 2
Slide 1
pˆ qˆ
n
(solve for n by algebra)
n=
( zα / 2)2 pˆ qˆ
E2
Sample Size for Estimating
Proportion p
ˆ
When an estimate of p is known:
n=
( zα / 2 )2 pˆ qˆ
Formula 6-2
E2
When no estimate of p is known:
n=
( zα / 2)2 0.25
E2
Formula 6-3
Slide 2
Slide 3
Example: Suppose a sociologist wants to determine
the current percentage of U.S. households using e-mail.
How many households must be surveyed in order to be
95% confident that the sample percentage is in error by no
more than four percentage points?
a) Use this result from an earlier study: In 1997, 16.9% of U.S.
households used e-mail (based on data from The World
Almanac and Book of Facts).
b) Assume that we have no prior information suggesting a
possible value of p.
ˆ
1- α=95%=.95 , α=0.05
z α/2=z.025=1.96.
E=4%=0.04
(a) phat=16.9%=.169
(b) Use phat=0.5
Slide 4
a) Use this result from an earlier study: In 1997, 16.9% of U.S.
households used e-mail (based on data from The World
Almanac and Book of Facts).
ˆˆ
n = [za/2 ]2 p q
E2
= [1.96]2 (0.169)(0.831)
0.042
= 337.194
= 338 households
To be 95% confident that our
sample percentage is within
four percentage points of the
true percentage for all
households, we should
randomly select and survey
338 households.
Slide 5
b) Assume that we have no prior information suggesting a
possible value of p.
ˆ
n = [za/2 ]2 • 0.25
E2
= (1.96)2 (0.25)
0.042
= 600.25
= 601 households
With no prior information,
we need a larger sample to
achieve the same results
with 95% confidence and an
error of no more than 4%.
Finding the Point Estimate
and E from a
Confidence Interval
ˆ
(upper confidence limit) + (lower confidence limit)
Point estimate of p:
ˆ
p=
2
Margin of Error:
E = (upper confidence limit) — (lower confidence limit)
2
Slide 6
Section 6-3
Estimating a Population Mean:
σ Known
Slide 7
Assumptions:
1. The sample is a simple random
sample.
2. The value of the population standard
deviation σ is known.
3. Either or both of these conditions is
satisfied: The population is
normally distributed or n > 30.
Point estimate
Slide 8
! Population mean: µ (unknown)
! Point Estimate
is a single value (or point) used to approximate
a population parameter.
Example:
The sample mean x is the best point estimate of the
population mean µ.
Slide 9
Example:
A study found the body temperatures of 106
healthy adults. The sample mean was 98.2 degrees and the
sample standard deviation was 0.62 degrees. Find the point
estimate of the population mean µ of all body temperatures.
Because the sample mean x is the
best point estimate of the population
mean µ, we conclude that the best
point estimate of the population
mean µ of all body temperatures is
98.20o F.
Interval Estimate
Slide 10
Level of Confidence
! The confidence level =1 - α,
!where α is the complement of the confidence level.
! For a 0.95(95%) confidence level, α = 0.05. For
a 0.99(99%) confidence level, α = 0.01.
Slide 11
Margin of Error
is the maximum likely difference observed
between sample mean x and population
mean µ, and is denoted by E.
E = zα/2 •
σ
n
Standard Error of the sample mean
Confidence Interval
Slide 12
(or Interval Estimate) for
Population Mean µ when σ is known
x –E <µ< x +E
x +E
(x – E, x + E)
Slide 13
Example:
A study found the body temperatures of 106
healthy adults. The sample mean was 98.2 degrees and the
sample standard deviation was 0.62 degrees. Find the
margin of error E and the 95% confidence interval for µ.
n = 106
x = 98.20o
s = 0.62o
α = 0.05
α /2 = 0.025
z α/ 2 = 1.96
E = z α/ 2 • σ
n
= 1.96 • 0.62
106
= 0.12
x –E <µ< x +E
98.08 < µ < 98.32
o
o
98.20o – 0.12
98.08o
<µ<
< µ <
98.20o + 0.12
98.32o
Sample Size for Estimating
Mean µ
n=
(zα/2) • σ
E
2
Slide 14
Round-Off Rule for
Sample Size n
Slide 15
When finding the sample size n, if the use of
Formula 6-5 does not result in a whole number,
always increase the value of n to the next larger
whole number.
Finding the Sample Size n
when σ is unknown
Slide 16
1. Use the range rule of thumb (see Section 2-5) to
estimate the standard deviation as follows: σ ≈
range/4.
2. Conduct a pilot study by starting the sampling
process. Based on the first collection of at least
31 randomly selected sample values, calculate the
sample standard deviation s and use it in place of
σ.
3. Estimate the value of σ by using the results of
some other study that was done earlier.
Slide 17
Example:
Assume that we want to estimate the mean
IQ score for the population of statistics professors. How
many statistics professors must be randomly selected for
IQ tests if we want 95% confidence that the sample mean is
within 2 IQ points of the population mean? Assume that σ
= 15, as is found in the general population.
α = 0.05
α /2 = 0.025
z α/ 2 = 1.96
E = 2
σ = 15
n =
1.96 • 15 2= 216.09 = 217
2
With a simple random sample of only
217 statistics professors, we will be
95% confident that the sample mean
will be within 2 points of the true
population mean µ.
6-4: σ Not Known
Assumptions
Slide 18
1) The sample is a simple random sample.
2) Either the sample is from a normally
distributed population, or n > 30.
Use Student t distribution
Student t Distribution
Slide 19
If the distribution of a population is
essentially normal, then the distribution of
t =
x-µ
s
n
! is essentially a Student t Distribution for all
samples of size n, and is used to find critical
values denoted by tα/2.
Slide 20
Degrees of Freedom (df )
corresponds to the number of sample values
that can vary after certain restrictions have
been imposed on all data values
df = n – 1
in this section.
Margin of Error E
for Estimate of µ
Slide 21
Based on an Unknown σ and a Small Simple Random
Sample from a Normally Distributed Population
E = tα/
s
2
n
where tα / 2 has n – 1 degrees of freedom.
Student t Distributions for
n = 3 and n = 12
Slide 22
Important Properties of the
Student t Distribution
Slide 23
1. The Student t distribution is different for different sample sizes
(see Figure 6-5 for the cases n = 3 and n = 12).
2. The Student t distribution has the same general symmetric bell
shape as the normal distribution but it reflects the greater
variability (with wider distributions) that is expected with small
samples.
3. The Student t distribution has a mean of t = 0 (just as the
standard normal distribution has a mean of z = 0).
4. The standard deviation of the Student t distribution varies with
the sample size and is greater than 1 (unlike the standard normal
distribution, which has a σ = 1).
5. As the sample size n gets larger, the Student t distribution gets
closer to the normal distribution.
Confidence Interval for
the Estimate of E
Slide 24
Based on an Unknown σ and a Small Simple Random
Sample from a Normally Distributed Population
x–E <µ<x +E
where
E = tα/2 s
n
tα/2 found in Table A-3
Procedure for Constructing a
Confidence Interval for µ
when σ is not known
Slide 25
1. Verify that the required assumptions are met.
2. Using n — 1 degrees of freedom, refer to Table A3 and find the critical value tα/2 that corresponds
to the desired degree of confidence.
3. Evaluate the margin of error E = tα/2 • s / n .
4. Find the values of x - E and x + E. Substitute those
values in the general format for the confidence
interval:
x –E <µ< x +E
5. Round the resulting confidence interval limits.
Slide 26
Example:
A study found the body temperatures of 106
healthy adults. The sample mean was 98.2 degrees and
the sample standard deviation was 0.62 degrees. Find the
margin of error E and the 95% confidence interval for µ.
n = 106
x = 98.20o
s = 0.62o
α = 0.05
α /2 = 0.025
t α/ 2 = 1.984
E = t α/ 2 • s = 1.984 • 0.62 = 0.1195
n
106
x–E <µ< x +E
98.20o – 0.1195 < µ < 98.20o + 0.1195
98.08o <
µ < 98.32o
The interval is the same here as in Section 6-2, but in some
other cases, the difference would be much greater.
Using the Normal and
t Distribution
Figure 6-6
Slide 27