Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
t distribution∗ CWoo† 2013-03-21 17:34:56 Let X and Y be random variables such that 1. X and Y are independent 2. X ∼ N(0, 1), the standard normal distribution (with mean 0 and variance 1) 3. Y ∼ χ2 (n), the chi-square distribution with n degrees of freedom Define a new random variable Z by Z = X/ Y 12 n Then the distribution of Z is called the t distribution with n degrees of freedom, denoted by Z ∼ t(n). By transformation of the random variables X and Y , one can show that the probability density function of the t distribution of Z has the form: fZ (x) = √ 1 x2 − 1+ n 1 n n B( 2 , 2 ) n+1 2 , where B(α, β) is the beta function. Below are graphs of some t probability density functions for various degrees (d) of freedom. d = 1, d = 2, d = 3, d = 500 Remarks ∗ hTDistributioni created: h2013-03-21i by: hCWooi version: h35962i Privacy setting: h1i hDefinitioni h62A01i † This text is available under the Creative Commons Attribution/Share-Alike License 3.0. You can reuse this document or portions thereof only if you do so under terms that are compatible with the CC-BY-SA license. 1 • the t distribution is also known as the Student’s t distribution. The name Student came from the 19th Century research chemist William Sealy Gossett, who was employed by the brewing company Guinness to improve the yield of crops used to produce its beer. Gossett conducted agricultural experiments and used random numbers to help determine the sampling distribution of the data he collected. Because the brewing company wanted to keep the research results confidential for competitve reasons, Gossett had to use a pen name to publish his findings. Student was his pen name and the distribution he found turned out to be the t distribution. • t(1) = Cauchy(0, 1), the Cauchy distribution with parameters 0 and 1. • t(n) is asymptotically (as n → ∞) N(0, 1), the standard normal distribution with mean 0 and variance 1. • If X ∼ t(n), E[|X|k ] exists iff k < n. Therefore, a t distribution has no mean if it has one degree of freedom. For n > 1, E[X] = 0. For n > 2, n . Var[X] = n−2 • If X1 , . . . , Xn are random samples from a normal distribution with mean µ and variance σ 2 . Let µ̂ be the sample mean and σ̂ 2 the sample variance, then µ̂ − µ √ ∼ t(n − 1) U= σ̂/ n Please note that the statistic U does not depend on σ 2 , and thus is used often in testing hypotheses involving comparison of the sample mean to the true mean, given a set of random samples that are normally distributed with an unknown mean and an unknown variance. This is an example of a t test. 2