Download Confidence Interval for

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Chapter Contents
Chapter 8
Confidence Intervals
Confidence Interval for a Mean (μ) with Known σ
Confidence Interval for a Mean (μ) with Unknown σ
Confidence Interval for a Proportion (π)
Estimating from Finite Populations
Sample Size Determination for a Mean
Sample Size Determination for a Proportion
8-1
Chapter 8
Confidence Intervals
Chapter Learning Objectives (LO’s)
Construct a 90, 95, or 99 percent confidence interval for μ.
Know when to use Student’s t instead of z to estimate μ.
Construct a 90, 95, or 99 percent confidence interval for π.
Construct confidence intervals for finite populations.
Calculate sample size to estimate a mean or proportion.
Construct a confidence interval for a variance (optional).
8-2
Chapter 8
Confidence Interval for a Mean () with
known ()
What is a Confidence Interval?
8-3
What is a Confidence Interval?
•
Chapter 8
Confidence Interval for a Mean () with known ()
The confidence interval for  with known  is:
8-4
•
A higher confidence level leads to a wider confidence interval.
•
Greater confidence
implies loss of precision
(i.e. greater margin of
error).
95% confidence is
most often used.
•
Chapter 8
Choosing a Confidence Level
Confidence Intervals for Example 8.2
8-5
•
•
•
Chapter 8
Interpretation
A confidence interval either does or does not contain .
The confidence level quantifies the risk.
Out of 100 confidence intervals, approximately 95% may contain ,
while approximately 5% might not contain  when constructing 95%
confidence intervals.
When Can We Assume Normality?
If  is known and the population is normal, then we can safely use the
formula to compute the confidence interval.
• If  is known and we do not know whether the population is normal, a common
rule of thumb is that n  30 is sufficient to use the formula as long as the
distribution
Is approximately symmetric with no outliers.
• Larger n may be needed to assume normality if you are sampling from a strongly
skewed population or one with outliers.
•
8-6
Chapter 8
Confidence Interval for a Mean () with Unknown ()
Student’s t Distribution
•
Use the Student’s t distribution instead of the normal distribution
when the population is normal but the standard deviation  is
unknown and the sample size is small.
8-7
Chapter 8
Student’s t Distribution
8-8
Chapter 8
Student’s t Distribution
•
•
t distributions are symmetric and shaped like the standard normal
distribution.
The t distribution is dependent on the size of the sample.
Comparison of Normal and Student’s t
Figure 8.11
8-9
•
•
•
•
Chapter 8
Degrees of Freedom
Degrees of Freedom (d.f.) is a parameter based on the sample
size that is used to determine the value of the t statistic.
Degrees of freedom tell how many observations are used to
calculate , less the number of intermediate estimates used in
the calculation. The d.f for the t distribution in this case, is given
by d.f. = n -1.
As n increases, the t distribution approaches the shape of the
normal distribution.
For a given confidence level, t is always larger than z, so a
confidence interval based on t is always wider than if z were used.
8-10
•
•
•
Chapter 8
Comparison of z and t
For very small samples, t-values differ substantially from the
normal.
As degrees of freedom increase, the t-values approach the
normal z-values.
For example, for n = 31, the degrees of freedom, d.f. = 31 – 1 =
30.
So for a 90 percent confidence interval, we would use
t = 1.697, which is only slightly larger than z = 1.645.
8-11
Chapter 8
Example GMAT Scores Again
Figure 8.13
8-12
•
Construct a 90% confidence interval for the mean GMAT score of
all MBA applicants.
x = 510
•
•
Chapter 8
Example GMAT Scores Again
s = 73.77
Since  is unknown, use the Student’s t for the confidence interval
with d.f. = 20 – 1 = 19.
First find t/2 = t.05 = 1.729 from T-table.
8-13
Chapter 8
•
For a 90% confidence
interval, use Appendix
D to find t0.05 = 1.729
with d.f. = 19.
Note: One can use Excel,
Minitab, etc. to
obtain these values
as well as to
construct confidence
Intervals.
We are 90 percent confident
that the true mean GMAT
score might be within the
interval [481.48, 538.52]
8-14
•
•
Chapter 8
Confidence Interval Width
Confidence interval width reflects
- the sample size,
- the confidence level and
- the standard deviation.
To obtain a narrower interval and more precision
- increase the sample size or
- lower the confidence level (e.g., from 90% to 80% confidence).
8-15
•
•
•
•
Chapter 8
Using T-Table
Beyond d.f. = 50, Appendix D shows d.f. in steps of 5 or 10.
If the table does not give the exact degrees of freedom, use the
t-value for the next lower degrees of freedom.
This is a conservative procedure since it causes the interval to be
slightly wider.
A conservative statistician may use the t distribution for
confidence intervals when σ is unknown because
using z would underestimate the margin of error.
8-16
•
Chapter 8
Confidence Interval for a Proportion ()
A proportion is a mean of data whose only values are 0 or 1.
8-17
•
Chapter 8
Applying the CLT
The distribution of a sample proportion p = x/n is symmetric if  = .50
and regardless of , approaches symmetry as n increases.
8-18
Chapter 8
When is it Safe to Assume Normality of p?
•
Rule of Thumb: The sample proportion p = x/n may be assumed to
be normal if both n 10 and n(1- ) 10.
Sample size to assume
normality:
Table 8.9
8-19
•
Chapter 8
Confidence Interval for 
Since  is unknown, the confidence interval for p = x/n
(assuming a large sample) is
8-20
Chapter 8
Example Auditing
8-21
Chapter 8
N = population size; n = sample size
8-22
Chapter 8
Sample Size determination for a Mean
Sample Size to Estimate 
•
To estimate a population mean with a precision of + E (allowable
error), you would need a sample of size. Now,
8-23
Chapter 8
How to Estimate ?
•
Method 1: Take a Preliminary Sample
Take a small preliminary sample and use the sample s in place of
 in the sample size formula.
•
Method 2: Assume Uniform Population
Estimate rough upper and lower limits a and b and set
 = [(b-a)/12]½.
•
Method 3: Assume Normal Population
Estimate rough upper and lower limits a and b and set  = (b-a)/4.
This assumes normality with most of the data with  ± 2 so the
range is 4.
•
Method 4: Poisson Arrivals
In the special case when  is a Poisson arrival rate, then  =  .
8-24
To estimate a population proportion with a precision of ± E
(allowable error), you would need a sample of size
•
Since  is a number between 0 and 1, the allowable error E is
also between 0 and 1.
Chapter 8
•
8-25
•
•
•
Chapter 8
How to Estimate ?
Method 1: Assume that  = .50
This conservative method ensures the desired precision. However,
the sample may end up being larger than necessary.
Method 2: Take a Preliminary Sample
Take a small preliminary sample and use the sample p in place of 
in the sample size formula.
Method 3: Use a Prior Sample or Historical Data
How often are such samples available? Unfortunately,  might be
different enough to make it a questionable assumption.
8-26