Download Let`s Do It

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Degrees of freedom (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Gibbs sampling wikipedia , lookup

Taylor's law wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
SAMPLING DISTRIBUTION OF THE MEAN
Recall,
What Do We Expect of Sample Means?
 The values of the sample mean x vary from random
sample to random sample in a predictable way.
 The center of the distribution of the x values is at the
true mean  (for any sample size n).
 With a larger sample size n, the x values tend to be
closer to the true population mean  . That is, the x
values vary less around the true mean  . The
variation will also depend on how much variation there
 When the sample size n is large, the sample mean
can take on many possible values, so the random
variable x can be viewed as a continuous random
variable with a density curve as its model.
 When the sample size n is large, the distribution of
the sample mean can be modeled approximately with
a normal distribution.
 For any sample size (even small), if the distribution of
the original population is normal, the distribution of
the sample mean can be modeled with a normal
distribution.
Sampling Distribution of X the Sample Mean
If simple random samples of size n are taken from a population
having population mean  and population standard deviation ,
then the sampling distribution of the sample mean has the following
properties:
1.
X  
2.  X 

n
The average of all the possible sample mean
values is equal to the parameter  . In other
words, the sample mean X is an unbiased
estimator of  .
The standard deviation of all of the possible
sample mean values decreases as the sample
size n increases.
Note: Keep in mind the difference between

(the standard
deviation for the original population values) and
standard deviation of the sample means).

n
(the
3. If the original population is normally distributed, then for
any sample size n the distribution of the sample mean is also
normal with mean and standard deviation as given.
 

X is N   ,

n

4. If the original population is not necessarily normally
distributed, but n is “sufficiently” large, the distribution of
the sample mean is approximately normal with mean and
standard deviation, as given.
  
X is approx. N   ,

n

Sufficiently large means that the normality does not hold exactly for
any one sample size n, but as n increases, the distribution
eventually starts to look like a normal distribution. A sample size of
30 or more is often considered large enough. If there are extreme
outliers in the observed data, it is better to have a larger sample.
This last result is known as the central limit theorem, abbreviated
CLT.
Example Average Expenditure
Let X be the random variable representing the amount spent by
patients at a drug store. The distribution of the X variable is
skewed to the right with a mean of   $60 and a standard
deviation $35. We plan to take a simple random sample of size
n=100 expenditures and will consider the sample mean
expenditure X .
(a) What is the value of the mean, the standard deviation, and
the (approximate) shape of the distribution of X , the sample
mean expenditure?
Mean of the distribution of X = $60
Standard deviation of the distribution of X =
$35
100
=$3.5
Shape of the distribution of X = N($60, $3.5) because the
sample is large ( n  30),
the CLT applies.
(b) Consider the following probability statement:
35
35
P( 60 
 X  60 
)  0.68
100
100
This statement is (circle one) .Explain.
True.
False
Can’t tell.
Let’s Do It! 3
The length of human pregnancy is approximately normally
distributed with mean  =266 days and a standard deviation
 =16.
a. What is the probability that a random sample of 20
pregnancies has a mean gestational period of 260 days or
less?
b. What is the probability that a random sample of 50
pregnancies has a mean gestational period within 10 days of
the mean?
Let’s Do It! 4
The mean weight of 15-year-old males is 142 pounds, and the
standard deviation is 12.3 pounds. If a sample of 36 15-year-old
males is selected, find the probability that the mean of the
sample will be greater than 144.5 pounds. Based on you answer,
would you consider the group overweight?
The Usefulness of Sampling Distributions
Did you discover the usefulness of sampling distributions?
As you see, in real life, you will NOT KNOW the true
population parameter, because if you did know it, you would
not need to take a sample. You take just ONE SAMPLE, not
many samples as in constructing the empirical sampling
distribution of its estimator. One sample may cost thousands
of dollars or take a lot of time to select.
So, what are we doing in this section?
We were discovering the behavior of a sampling distribution of
an estimator in the case of simple random samples. We
have observed that the larger the sample size taken, the more
the distribution looks “normal” and also the more concentrated
about the “true parameter value.” This guarantees that the
larger the sample size that we take, the greater the
chance that the estimator will be close to the true
parameter value.
In general, we take a sample from a population because we do not
know the true parameter value. If we have some theory regarding the
value of the parameter, we can use the expected sampling distribution
and the observed data to assess if the theory is supported or not.
Homework
page
5-8close
all, 63,any
64, one
76, 116
We cannot
say217:
how
particular estimate is to the true
parameter value. However, we can calculate the probability that the
true parameter value will be contained in the range:
This “something” is called the margin of error, and
we will learn more about it in the next chapter.