Download LESSON 18: CONFIDENCE INTERVAL ESTIMATION ESTIMATION

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Student's t-test wikipedia , lookup

Resampling (statistics) wikipedia , lookup

German tank problem wikipedia , lookup

Transcript
LESSON 18: CONFIDENCE INTERVAL ESTIMATION
Outline
• Confidence interval: mean
– Known s
– Selecting sample size
– Unknown s
– Small population
• Confidence interval: proportion
• Confidence interval: variance
1
ESTIMATION
• Point estimator: A point estimator draws inferences about a
population by estimating the value of an unknown
parameter using a single value or point.
• Interval estimator: An interval estimator draws inferences
about a population by estimating the value of an unknown
parameter using an interval.
2
1
ESTIMATION
• Example: A manager of a plant making cellular phones
wants to estimate the time to assemble a phone. A sample
of 30 assemblies show a mean time of 400 seconds. The
sample mean time of 400 seconds is a point estimate. An
alternate estimate is a range e.g., 390 to 410. Such a range
is an interval estimate. The computation method of interval
estimate is discussed in Chapter 10.
3
ESTIMATION
• Interval estimates are reported with the end points e.g.,
[390, 410]
or, equivalently, with a central value and its difference from
each end point e.g.,
400±10
4
2
ESTIMATION
• Precision of an interval estimate: The limits indicate the
degree of precision. A more precise estimate is the one
with less spread between limits e.g., [395,405] or 400±5
• Reliability of an interval estimate: The reliability of an
interval estimate is the probability that it is correct.
5
ESTIMATION
• Unbiased estimator: an unbiased estimator of a population
parameter is an estimator whose expected value is equal to
that parameter.
• In Chapter 2, the sample variance is defined as follows:
n
s2 =
∑(X
i
− X )2
i =1
n −1
• The use of n-1 in the denominator is necessary to get an
unbiased estimator of variance. The use of n in the
denominator produces a smaller value of variance.
6
3
ESTIMATION
• Consistent estimators: An estimator is consistent if the
precision and reliability improves as the sample size is
increased. The estimators X and P are consistent.
• Efficient estimators: An estimator is more efficient than
another if for the same sample size it will provide a greater
sampling precision and reliability.
7
INTERVAL ESTIMATOR OF MEAN (KNOWN s )
• For some confidence level 1 -α, sample size n, sample
mean, X and the population standard deviation, σ the
confidence interval estimator of mean, µ is as follows:
σ

σ
σ 
X ± zα / 2
also written as  X − zα / 2
, X + zα / 2
n
n
n 

• Recall that zα / 2 is that value of z for which area in the upper
tail is α/2
• Lower confidence limit (LCL)
X − zα / 2
• Upper confidence limit (UCL)
X + zα / 2
σ
n
σ
n
8
4
Area=1-α
σX =
f(x)
CONFIDENCE
INTERVAL
σ
n
Area
Area
=0.5-α/2 =0.5-α/2
Area=α/2
Area=α/2
-zα/2
X − zα / 2
n
X
zα/2
X + zα / 2
σ
n
9
Area=0.82
σX =
f(x)
AREAS FOR
THE 82%
CONFIDENCE
INTERVAL
σ
z=0
Area
=0.41
Area
=0.41
Area=0.09
Area=0.09
-z0.09 z=0
X − z0.09
σ
n
σ
n
X
z0.09
X + z0.09
σ
n
10
5
Area=0.82
σX =
f(x)
AREAS AND z
AND X
VALUES FOR
THE 82%
CONFIDENCE
INTERVAL
Area
=0.41
Area
=0.41
Area=0.09
Area=0.09
-1.34 z=0
X −1.34
σ
n
σ
n
X
1.34
X + 1.34
σ
n
11
INTERVAL ESTIMATOR OF MEAN (KNOWN s )
• Interpretation:
– There is (1-α) probability that the sample mean will be
equal to a value such that the interval (LCL, UCL) will
include the population mean
– If the same procedure is used to obtain a confidence
interval estimate of the population mean for a sufficiently
large number of k times, the interval (LCL, UCL) is
expected to include the population mean (1-α)k times
• Wrong interpretation: It’s wrong to interpret that there is
(1-α) probability that the population mean lies between LCL
and UCL. Population mean is fixed, not uncertain /
probabilistic.
12
6
INTERVAL ESTIMATOR OF MEAN (KNOWN s )
• Interpretation of the 95% confidence interval:
– There is 0.95 probability that the sample mean will be
equal to a value such that the interval (LCL, UCL) will
include the population mean
– If the same procedure is used to obtain a confidence
interval estimate of the population mean for a sufficiently
large number of k times, the interval (LCL, UCL) is
expected to include the population mean 0.95k times –
• Wrong interpretation: It’s wrong to interpret that there is
0.95 probability that the population mean lies between LCL
and UCL. Population mean is fixed, not uncertain /
probabilistic.
13
INTERVAL ESTIMATOR OF MEAN (KNOWN s )
Example 1: The following data represent a random sample of
10 observations from a normal population whose standard
deviation is 2. Estimate the population mean with 90%
confidence: 7,3,9,11,5,4,8,3,10,9
f(x)
σx =
σ
n
x
14
7
SELECTING SAMPLE SIZE
• A narrow confidence interval is more desirable.
• For a given a confidence level, a narrow confidence interval
can be obtained by increasing the sample size.
• Desired precision or maximum error: If the confidence
interval has the form of X ± d then, d is the desired
precision or the maximum error.
• For a given confidence level (1-α), desired precision d and
the population standard deviation σ the sample size
necessary to estimate population mean, µ is
z σ
n =  α /2 
 d 
2
An approximation for σ: σ =
Q.999 − Q. 001
6
15
SELECTING SAMPLE SIZE
Example 2: Determine the sample size that is required to
estimate a population mean to within 0.2 units with 90%
confidence when the standard deviation is 1.0.
16
8
INTERVAL ESTIMATOR OF MEAN (UNKNOWN s )
• If the population standard deviation s is unknown, the
normal distribution is not appropriate and the mean is
estimated using Student t distribution. Recall that
X −µ
t=
s/ n
• For some confidence level 1 -α, sample size n, sample
mean, X and the sample standard deviation, s the
confidence interval estimator of mean, µ is as follows:
s

s
s 
X ± tα / 2
also written as  X − tα / 2
, X + tα / 2
n
n
n 

• Where, tα / 2 is that value of t for which area in the upper tail
is α/2 at degrees of freedom, d.f. = n-1.
17
SMALL POPULATION
• For small, finite population, a correction factor is applied in
computing σ X . So, the confidence interval is computed as
follows:
µ = X ± zα / 2
σ
n
N −n
N −1
(σ
known )
µ = X ± tα / 2
s
n
N −n
N −1
(σ
unknown )
18
9
UNKNOWN s AND SMALL POPULATION
Example 3: An inspector wishes to estimate the mean
weight of the contents in a shipment of 16-ounce cans of
corn. The shipment contains 500 cans. A sample of 25
cans is selected, and the contents of each are weighed.
The sample mean and standard deviation were compute
to be X = 16.1 ounces and s = 0.25 ounce. Construct a
90% confidence interval of the population mean.
19
INTERVAL ESTIMATOR OF PROPORTION
• Confidence interval of the proportion for large population:
P(1 − P )
n
• Confidence interval of the proportion for small population:
π = P ± zα / 2
P(1 − P ) N − n
n
N −1
• Required sample size for estimating the proportion:
π = P ± zα / 2
zα2 / 2π (1 − π )
n=
d2
20
10
INTERVAL ESTIMATOR OF PROPORTION
Example 4: The controls in a brewery need adjustment
whenever the proportion p of unfulfilled cans is 0.01 or
greater. There is no way of knowing the true proportion,
however. Periodically, a sample of 100 cans is selected
and the contents are measured.
(a) For one sample, 3 under-filled can were found.
Construct the resulting 95% confidence interval estimate
of p.
(b) What is probability of getting as many or more underfilled cans as in (a) when in fact p is only 0.01.
21
INTERVAL ESTIMATOR OF VARIANCE
• The chi-square distribution is asymmetric. As a result, two
critical values are required to compute the confidence
interval of the variance.
• Confidence interval of the variance:
(n − 1)s 2 ≤ σ 2 ≤ (n − 1)s 2
χα2 / 2
χ12−α / 2
22
11
INTERVAL ESTIMATOR OF VARIANCE
Example 5: The sample standard deviation for n = 25
observations was computed to be s = 12.2. Construct a
98% confidence interval estimate of the population
standard deviation.
23
READING AND EXERCISES
Lesson 18
Reading:
Section 10-1 to 10-4, pp. 295-319
Exercises:
10-9, 10-10, 10-13, 10-21, 10-24, 10-26, 10-31, 10-32
24
12