Download Sampling Distribution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
MAT 2379 (Spring 2012)
Sampling Distribution
Definition : Let X1 , . . . , Xn be a collection of random variables. We say
that they are identically distributed if they have a common distribution.
Definition : Let X1 , . . . , Xn be random variables. We say that X1 , . . . , Xn
is a random sample, if they are independent and identically distributed.
Remarks :
– We will model an experiment with n independent trial as a random
sample.
– The common distribution is called the population.
– The mean of this common distribution is called the population mean
and is denoted µ, in other words E(Xi ) = µ, for i = 1, 2, . . . , n.
– The variance of this common distribution is called the population variance and it is denoted σ 2 , in other words V (Xi ) = σ 2 for i = 1, 2, . . . , n.
Definition : A fonction of the random sample is called a statistic. A
statistic is a random variable and its probability distribution is called its
sampling distribution.
Remark : In these note, we will discuss some results concerning the sampling distribution of the mean X of a sample of size n from a population with
a mean µ and a variance σ 2 .
We have already shown that
E(X) = µ
and
Var(X) =
σ2
.
n
But what is the sampling distribution of X ? Here a few results that we
help us answer this question.
1
Theorem : Let X1 , . . . , Xn be independent normal random variables
such that Xi ∼ N (µi , σi2 ) for i = 1, . . . , n. Let
Y = a1 X 1 + a2 X 2 + . . . + an X n ,
that is Y is a linear combination of X1 , . . . , Xn . Then, Y has a the following
normal distribution :
!
n
n
X
X
N (E[Y ], Var[Y ]) = N
ai µ i ,
a2i σi2 .
i=1
i=1
Consequence : Consider a random sample X1 , . . . , Xn from a normal
population with a mean µ and a variance σ 2 . The mean of the n normal
random variables,
X1 + . . . + Xn
X=
,
n
is a normal random variable. That is, X ∼ N (µ, σ 2 /n). Thus,
X − E(X)
X −µ
√ ∼ N (0, 1).
=
Z= q
σ/ n
V (X)
Remark : We often do not know the population standard deviation and
so we will rather work with the following statistic
T =
X −µ
√ .
S/ n
The sampling distribution of T is not a normal distribution. However, if the
population is normal, then T has a T distribution (often called Student)
with ν = n − 1 degrees of freedom.
Remarks :
– The probability density of T is centered about zero, but it is more
dispersed than a standard normal distribution N (0, 1). In fact, it can
be shown that, for ν > 2,
p
σT = ν/(ν − 2) > 1.
2
– When the degrees of freedom ν → ∞, the T distribution tends to a
standard normal distribution N (0, 1).
– There is a table for the T distribution in the textbook, see Table 17.4.
– Notation : tA,ν is a quantile such that
P (T > tA,ν ) = A,
where T has a T distribution with ν degrees of freedom.
Example 1 : Consider a certain large population of mammals such that
their weight is normally distributed with a mean of 25 kg and a standard
deviation of 5 kg. We will select a random sample of 20 of these mammals.
(a) What is the probability that the mean weight of the 25 mammals is
between 23 kg and 27 kg ?
(b) Let X and S, be the sample mean and the sample standard deviation of
a sample of size n = 25, respectively. Define
Z=
X − 25
√
5/ 25
and T =
X − 25
√ .
S/ 25
(i) What is the distribution of Z and of T ?
(ii) Compute P (Z > 2) and P (T > 2).
(iii) Compute P (Z > 4) and P (T > 4)
(iv) Compute P (Z < −1,55) and P (T < −1,55).
3
Remark : When using Table 17.4 to compute probabilities involving a
T random variable, we will often not be able to give the exact probability.
We will only be able to find an interval where the probability might reside.
However, we can find the probability with Minitab.
Question : If the population is not normal, then what is the sampling
distribution of X ? A partial answer is giving in the following theorem.
Theorem : Consider a random sample X1 , . . . , Xn from a population
with a mean µ and a variance σ 2 . Let Φ be the cumulative distribution function for the standard normal distribution N (0, 1).
(a) If
Zn =
X −µ
√ ,
σ/ n
then
lim FZn = Φ.
n→∞
(b) If
Tn =
X −µ
√ ,
S/ n
then
lim FTn = Φ.
n→∞
Remarks :
– The result in (a) is often called the central limit theorem.
– As long as n is sufficiently large, then we can approximate the sampling
distribution of the sample mean with a normal distribution. We will use
the following rule-of-thumb : if n ≥ 30 then
X −µ
√ ∼ N (0, 1)approximately.
σ/ n
4
– With n sufficiently large, we can also approximate the sampling distribution of the T statistic with a standard normal distribution. We will
use the following rule-of-thumb : if n ≥ 40, then
X −µ
√ ∼ N (0, 1)approximately.
S/ n
Example 2 : Suppose we observe the age of 30 female grizzly bears in
particular region at the time of their first surviving litter. Suppose that the
population has a mean of 8.4 years and a standard deviation of 3 years.
Approximate the probability that the average age of 30 females, when they
produced their first surviving litter, is less than 7 years ?
5
Related documents