Download The Moments of Student`s t distribution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Gibbs sampling wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
The Moments of Student’s t Distribution

The kurtosis of Student’s t is infinite when df ≤ 4. Otherwise it is 6/(df – 4)

The skewness of Student t is 0, unless df ≤ 3, in which case it is not defined.

The standard deviation of Student’s t distribution is the square root of df / (df – 2),
unless df ≤ 2, in which case it is not defined.

The mean of Student’s t distribution is 0, unless df = 1, in which case it is not
defined.
I have my students run a little Monte Carlo to create the sampling distribution of t
and then fiddle with sample size and the shape of the population from which the scores
are sampled. This allows them to demonstrate the consistency of the mean and the
variance, the central limit theorem, and so on. When I ran it with sample sizes of 4, I
found that the obtained distributions of sample means (100,000 to 1,000,000 samples in
each distribution) had outrageous values for g1 and g2, and these values changed wildly
from one replication to the next. For example, on one run, g1 = .04 and g2 = 102. On
the next run g1 = .4.66 and g2 = 707. On a third run g1 = -26 and g2 = 8,063. Now I
know why – neither skewness nor kurtosis are defined for Student’s t on 3 degrees of
freedom.
It was Robert J. MacG. Dawson who explained this to me, answering a query
about it on Edstat-L:
The skewness (in the third-moment sense) is only defined
for df >= 4. It's a matter of convergence of improper integrals.
T_1 (two data, one degree of freedom) is the Cauchy
distribution, and has no mean. This is because in computing it
you are dividing one normally distributed RV by another - hence
[in polar coordinates] the CDF is basically arctangent - hence
the pdf is 1/(1+x^2)and the mean is the integral from -infinity
to infinity of x/(1+x^2)which is easily seen to diverge. Its
Cauchy principle value is 0 as befits a symmetric function, but
the limits at -infinity and infinity cannot be evaluated
independently. If you try to compute a sample mean for a T_1
simulation it will not converge as the sample size increases.
By a similar argument T,_2 (3 data, pdf 1/{(2+x^2)^(3/2)}
is of order x^-3 at infinity and has a mean but no standard
deviation; and T_3 (4 data) has mean, SD, but no third moment.
There are also measures of skewness (eg, Q1+Q3-2M, which is
completely order based) that are defined even for T_1. They are
also zero.
Karl L. Wuensch, East Carolina University, November, 2009