Download CHAPTER SIX SAMPLING DISTRIBUTION FOR MEANS AND

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Probability wikipedia , lookup

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
CHAPTER SIX
SAMPLING DISTRIBUTION FOR MEANS AND PROPORTIONS
In general the population characteristics will be represented by letters
from the Greek alphabet while sample characteristics will be represented
by latin letters.
In statistical inference, the mean and the variance calculated from sample
data are used to estimate the population mean and variance, hence
is called a point estimate for μ
2
s is called a point estimate for 2.
The mean of all sample means is written as
.
The standard deviation of all sample mean is written as
.
is called the finite population correction factor and must be used
when sampling from finite population.
As a rule of thumb, when the sample size is less than 5% of the
population size.
It can be proved (mathematically) that the probability distribution of the
sample means for sample size greater than 30, selected from any
population (whose mean and variance are known) approaches a Normal
distribution with mean μ and standard deviation
This is called the Central Limit Theorem.
The distribution of sample means
for sample size n≥30.
In addition, it can also be proved (mathematically) that the Central Limit
Theorem applies for small samples selected from Normal populations
when the population variance σ2 is known.
for samples of any size from a Normal distribution, known
variance σ2
A direct application of the Central Limit Theorem is the
(i)
(ii)
calculation of probabilities regarding sample means;
calculations of the limits that contain various percentages of the
mean.
Example
An importer of Herbs and Spices claims that the average weight of
packets of Saffron is 20 gms. However, packets are actually filled to an
average weight μ= 19,5 gm., standard deviation σ = 1,8 gm. A random
sample of 36 packets is selected, calculate:
a) the probability that the average weight is 20 gms or more;
b) the two limits within which 95% of all packets weight;
c) the two limits within which 99% of all average weights fall.
a) The question ask for probability P(x≥20). It is necessary to use the
probability distribution of the sample mean, that is Normal with μ =
19,5 and σ= 1,8/(n)1/2 .
For
From Tables the area in the tail is 0,0475.
b) The question asks for individual weights, not average.
The Z values that correspond to a tail area of 0,025 are -1,96 and
+1,96..
The upper limit is μ+Zσ= 19,5 + 1,96 (1,8) = 19,5 + 3,528 = 23,028.
The lower limit is μ-Zσ = 19,5 – 1,96(1,8) = 19,5 – 3,528 = 15,672.
c) This part asks for sample averages. The method is the same as in
part b) except that the standard for average is
.
The upper limit is given by μ+Z = 19,5 + 1,96 (0,3) = 19,5 +0,588=
20,088.
The lower limit is given by μ-Z = 19,5 -1,96 (0,3) = 19,5 -0,588=
18,912.
Sampling distribution of proportion
A proportion is the number of elements with a given characteristic
divided by the total number of elements in the group.
The sample proportion, p, is the point estimate of the population
proportion, π.
The standard error for proportion is given by the formula
If N is large, the second term goes to zero and
The sampling distribution of proportion
The list of every possible sample proportion, with its probability, is called
the sample distributions of proportions.
For large samples (n≥30)
Example
In a certain neighbourhood, it is known that 12% of youths aged from 16
to 24 are unemployed.
If a random sample of 150 youth are selected what is the probability that
the sample contains
At most 10% unemployed
At most 15 unemployed
If 12% are unemployed π=0,12 e σp =
= 0,0265.
P(p≤0,10).
Z=
= -0,75
The tail area is 0,2266.
The probability that at most 10% of sample is unemployed is 0,2266.
Mean and standard errors for the distribution of proportion when π is
unknown
The calculation of the mean and standard error of sample proportion
depends on knowing the value of the population proportion, π. Since π is
seldom known, the mean and standard error for proportions are
approximated by substituting p for π. Hence
Some desirable properties of estimators
Two of the most important properties of point estimators are that they
should be:
1. Unbiased
2. Minimum variance
Unbiased estimators
An estimator is said to be unbiased if the average value of all the point
estimates is equal to the population parameter to be estimated.
The mean is an unbiased estimator of μ and the sample proportion p is
an unbiased estimator for π.
Minimum variance
The values of sample statistics (sample means and proportios) can vary
greatly about the population parameter being estimated. It is obviously
desiderable to keep this variation (as measured) by standard error as
small as possible (minimum variance). Increasing sample size reduces the
standard error.
An estimator is described as precise when the values of the estimates are
close. The standard deviation of all the estimates (called standard error) is
a measure of the precision of an estimator. Ideally an estimator should be
unbiased and have minimum variance (both accurate and precise).
The sample mean and proportion are unbiased and minimum variance
estimators of the population mean and proportion.