Download PDF

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Transcript
t distribution∗
CWoo†
2013-03-21 17:34:56
Let X and Y be random variables such that
1. X and Y are independent
2. X ∼ N(0, 1), the standard normal distribution (with mean 0 and variance
1)
3. Y ∼ χ2 (n), the chi-square distribution with n degrees of freedom
Define a new random variable Z by
Z = X/
Y 12
n
Then the distribution of Z is called the t distribution with n degrees of freedom,
denoted by Z ∼ t(n).
By transformation of the random variables X and Y , one can show that the
probability density function of the t distribution of Z has the form:
fZ (x) = √
1
x2 −
1+
n 1
n
n B( 2 , 2 )
n+1
2
,
where B(α, β) is the beta function.
Below are graphs of some t probability density functions for various degrees
(d) of freedom.
d = 1, d = 2, d = 3, d = 500
Remarks
∗ hTDistributioni
created: h2013-03-21i by: hCWooi version: h35962i Privacy setting:
h1i hDefinitioni h62A01i
† This text is available under the Creative Commons Attribution/Share-Alike License 3.0.
You can reuse this document or portions thereof only if you do so under terms that are
compatible with the CC-BY-SA license.
1
• the t distribution is also known as the Student’s t distribution. The name
Student came from the 19th Century research chemist William Sealy Gossett, who was employed by the brewing company Guinness to improve the
yield of crops used to produce its beer. Gossett conducted agricultural experiments and used random numbers to help determine the sampling distribution of the data he collected. Because the brewing company wanted
to keep the research results confidential for competitve reasons, Gossett
had to use a pen name to publish his findings. Student was his pen name
and the distribution he found turned out to be the t distribution.
• t(1) = Cauchy(0, 1), the Cauchy distribution with parameters 0 and 1.
• t(n) is asymptotically (as n → ∞) N(0, 1), the standard normal distribution with mean 0 and variance 1.
• If X ∼ t(n), E[|X|k ] exists iff k < n. Therefore, a t distribution has no
mean if it has one degree of freedom. For n > 1, E[X] = 0. For n > 2,
n
.
Var[X] = n−2
• If X1 , . . . , Xn are random samples from a normal distribution with mean
µ and variance σ 2 . Let µ̂ be the sample mean and σ̂ 2 the sample variance,
then
µ̂ − µ
√ ∼ t(n − 1)
U=
σ̂/ n
Please note that the statistic U does not depend on σ 2 , and thus is used
often in testing hypotheses involving comparison of the sample mean to
the true mean, given a set of random samples that are normally distributed
with an unknown mean and an unknown variance. This is an example of
a t test.
2