Download Section 8.2 - TopCatMath

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Taylor's law wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

German tank problem wikipedia , lookup

Student's t-test wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Math 160 - Cooley
Intro to Statistics
OCC
Section 8.2 – Confidence Intervals for One Population Mean When σ Is Known
Statistical Inference – Provides methods for drawing conclusions about a population from sample data.
Confidence Interval For The Mean Of A Normal Population
Draw an SRS of size n from a Normal population having unknown mean µ and known standard deviation σ. A
level (1 – α) confidence interval for µ is
x  z
The critical value z
2

2
n
is illustrated below and found in Table II.
INTERVAL PROCEDURE #1 – The One-Mean z-Interval Procedure
Purpose: To find a confidence interval for a population mean, μ.
Assumptions
1) Simple random sample
2) Normal population or large sample (n ≥ 30)
3) σ known
Step 1 – For a confidence level (1 – α), use Table II to find z 2 .
Step 2 – The confidence interval for µ is from
x  z
where z
2

2
to x  z

2
n
n
is found in Step 1, n is the sample size, and x is computed from the sample data.
Step 3 – Interpret the confidence interval.
The confidence interval is exact for normal populations and is approximately correct for large samples from
nonnormal populations.
The One-Mean z-Interval Procedure For A Population Mean
 For small samples–say, of size less than 15–the z-interval procedure should be used only when the
variable under consideration is normally distributed or very close to being so.
 For samples of moderate size–say, between 15 and 30–the z-interval procedure can be used unless the
data contains outliers or the variable under consideration is far from being normally distributed.
 For large samples–say, of size 30 or more–the z-interval procedure can be used essentially without
restriction. However, if outliers are present and their removal is not justified, you should compare the
confidence intervals obtained with and without the outliers to see what effect the outliers have. If the
effect is substantial, use a different procedure or take another sample.
 If outliers are present but their removal is justified and results in a data set for which the z-interval
procedure is appropriate (as previously stated), the procedure can be used.
-1-
Math 160 - Cooley
Intro to Statistics
OCC
Section 8.2 – Confidence Intervals for One Population Mean When σ Is Known
A Fundamental Principle of Data Analysis
Before performing a statistical-inference procedure, examine the sample data. If any of the conditions required
for using the procedure appear to be violated, do not apply the procedure. Instead use a different, more
appropriate procedure, or, if you are unsure of one, consult a statistician.
Confidence and Precision
For a fixed sample size, decreasing the confidence level improves the precision, and vice-versa.
Margin of Error for the Estimate of μ
The margin of error for the estimate of μ is
E  z 2 

n
Note: This is simply half the length of the confidence interval.
Margin of Error, Precision, and Sample Size
The length of a confidence interval for a population mean, μ, and therefore the precision with which x
estimates μ, is determined by the margin of error, E. For a fixed confidence level, increasing the sample size
improves the precision, and vice versa.
Sample Size For Estimating μ
The sample size required for a (1 – α)-level confidence interval for μ with a specified margin of error, E, is given
by the formula
 z 2  
n  
 E




2
rounded up to the nearest whole number.
 Exercises:
Later on in the text, the term(s),  z / 2 , – z or z , will be referred to as critical value(s).
1)
For a 90% confidence level, find both critical values.
2)
For a 95% confidence level, find both critical values.
3)
For a 99% confidence level, find both critical values.
4)
For a 97% confidence level, find both critical values.
-2-
Math 160 - Cooley
Intro to Statistics
OCC
Section 8.2 – Confidence Intervals for One Population Mean When σ Is Known
 Exercises:
5)
6)
You measure the lifetime of a random sample of 25 tires of a certain brand. The sample mean is x = 50
months. Suppose that the lifetimes for tires of this brand follow a normal distribution, with unknown
mean µ and standard deviation σ = 5 months.
a)
Find a 95% confidence interval for µ.
b)
Determine the margin of error, E.
c)
Explain the meaning of E in this context in terms of the accuracy of the estimate.
d)
Determine the sample size required to have a margin of error of 0.1 months and a 95%
confidence level.
e)
Obtain a 95% confidence interval for the mean lifetime of this certain brand of tire if a sample
of the size determined in part (d) has a mean of 50.8 months.
The time (in number of days) until maturity of a certain variety of tomato plant is normally distributed
with mean µ and standard deviation σ = 2.4. If you select a simple random sample of four plants of this
variety and measure the time until maturity. The four times, in days are: 63, 69, 62, 66. Based on these
data, find a 99% confidence interval for µ.
-3-
Math 160 - Cooley
Intro to Statistics
OCC
Section 8.2 – Confidence Intervals for One Population Mean When σ Is Known
Interpretation and the Meaning of Confidence Intervals for µ
Consider example #5:
You measure the lifetime of a random sample of 25 tires of a certain brand. The sample mean is
x = 50 months. Suppose that the lifetimes for tires of this brand follow a normal distribution,
with unknown mean µ and standard deviation σ = 5 kg. Find a 95% confidence interval for µ.
During class we found that the 95% confidence interval for the population mean, µ, was (48.04, 51.96).
Students and teachers sometimes refer to this by saying that “We are 95% confident that µ lies in the interval
(48.04, 51.96)”.
However, while this wording is okay for student use in the classroom, it is technically incorrect, and would not
be used by actual statisticians when interpreting the meaning of a confidence interval. Because you would never
describe yourself as being 75% happy or 50% in love with someone, you also can’t be 95% confident about
something.
In actuality, a 95% confidence interval means that the method used to obtain the confidence interval
produces an interval that encapsulates the unknown, µ, 95% of the time. To be more precise a statistician
would say that if we were to take 100 samples of size 25 from the population, and from those samples construct
100 sample means, and then construct 100 confidence intervals around the sample means at the 95% level, then
95 of the 100 confidence intervals constructed from the samples would include the population mean, while the
other 5 of the 100 would not. So, for an individual sample of size 25 from the tire population there is a 95%
chance that its corresponding confidence interval encapsulates the population mean.
When interpreting the wording of confidence intervals for µ taken at level 1 – α taken from a sample of size n,
the following phrases are equivalent:


 
 .
 We are (1 – α) % confident that µ lies in the interval  x  z 2
, x  z 2
n
n

 (1 – α) % of all samples of size n taken produce an interval that contains the population mean µ.
 The method used to obtain the confidence interval produces an interval that encapsulates the unknown,
µ, (1 – α) % of the time.
 In 100 samples of size n taken from the population, (1 – α) of then will produce confidence intervals
that contain the population mean µ.
-4-
Math 160 - Cooley
Intro to Statistics
OCC
Section 8.2 – Confidence Intervals for One Population Mean When σ Is Known
Simulation of confidence intervals
Below is a computer simulation of 20 samples of 36 new mobile home prices each. From there, 20 confidence
intervals were constructed. For the simulation, we assumed   65 (i.e., $65 thousand) and   7.2 (i.e., $7.2
thousand). We assumed these numbers based off last year’s mean and standard deviation. In reality, we don’t
know the current  , so we are assuming last year’s value for  to illustrate a point.
As you can see for the 20 samples, each one had a different sample mean. The standard deviation of the sample
mean was calculated at a constant of 2.4. The simulation shows that 19 of the 20 confidence intervals actually
contained the population mean of 65, which is 95%. If we simulated a much greater sample size, then the
confidence would be closer to the true actually 95.44%.
So, theoretically, if we took 10000 samples of 36 new mobile home prices each and constructed confidence
intervals, then we should expect 9544 of them or 95.44% of them to contain the true population mean  .
However, this is a theoretical probability. If it is not 95.44%, then it will be very close to it. Think of this model
rolling a single die 6 times. Theoretically, we should expect the outcomes to be 1,2,3,4,5, and 6.
-5-
Math 160 - Cooley
Intro to Statistics
OCC
Section 8.2 – Confidence Intervals for One Population Mean When σ Is Known
 Sample Test Multiple Choice Questions:
7)
When a confidence level decreases,
A) the confidence interval increases
B) the confidence interval decreases
C) the margin of error increases
D) the margin of error decreases
E) Both A and C.
F) Both A and D.
G) Both B and C.
H) Both B and D.
8)
What generally happens to the sampling error as the sample size is increased?
A) It gets smaller.
B) It gets larger
C) It gets more predictable.
D) It gets less predictable.
E) None of these.
9)
Suppose a confidence interval is (4.11 , 5.77). Then the margin of error for  is:
A) 9.88
B) 23.7147
C) 1.66
D) 0.83
E) None of these.
10)
Weights of women in one age group are normally distributed with a standard deviation of 17 lb. A
researcher wishes to estimate the mean weight of all women in this age group. Find how large a sample
must be drawn in order to be 90 percent confident that the sample mean will not differ from the
population mean by more than 3.8 lb.
A) 32
B) 33
C) 54
D) 55
E) 76
AB) 77
11)
In actuality, a 95% confidence interval means that the method used to obtain the confidence interval
produces an interval that encapsulates the unknown, µ, 95% of the time. To be more precise a
statistician would say
A) that if we were to take 100 samples of size 25 from the population, and from those
samples construct 100 sample means, and then construct 100 confidence intervals around the
sample means at the 95% level, then 95 of the 100 confidence intervals constructed from the
samples would include the population mean, while the other 5 of the 100 would not.
B) we can be 95% confident that the population mean, µ, will be contained in the confidence interval.
-6-