Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 117 5.3 The Central Limit Theorem Is it Normal? Times in the 100-meter dash… YES--NORMAL The weights of new Casino Dice… NO--UNIFORM Number of candy bars in a Twix package… NO--UNIFORM Heights of Sequoia trees… YES--NORMAL Weights of 5000 randomly selected Boston Terriers… YES--NORMAL 5.3 The Central Limit Theorem THE CENTRAL LIMIT THEOREM When we take many random samples of size n for a variable with any distribution and record the distribution of the means of each sample. Then, 1. The distribution of means will be approximately a normal distribution for large sample sizes. The mean of the distribution of means approaches the population mean, μ, for large sample sizes. The standard deviation of the distribution of means approaches σ/√n for large sample sizes, where σ is the standard deviation of the population. 2. 3. practical purposes--the distribution will need more than 30 valesccccccp… Ooops, I think I feel asleep…> 30 values. Recall, If normal then the 68-95-99.7 rule holds true. 5.3 The Central Limit Theorem Summary: Within the central limit theorem, we always start with a particular variable, such as the outcomes of a die roll or weights of people, that varies randomly over a population. The variable has a certain mean, μ, and standard deviation, σ, which we may or may not know. This variable can have any sort of distribution. When we take many samples of that variable, with n items on each sample, and make a histogram of the means from the many samples, we will see a distribution that is close to “normal”. The larger the sample size, n, the more closely the distribution of means approximates a normal distribution. Check out this website on rolling dice & the C.L.T. --- http://www.stat.sc.edu/~west/javahtml/CLT.html 5.3 The Central Limit Theorem VISUAL EXAMPLE: Pictures of a distribution being "smoothed out" by summation (showing original density of distribution and three subsequent summations, obtained by convolution “averaging” of density functions) Works Cited: http://en.wikipedia.org/wiki/Ce ntral_limit_theorem More visuals: Page 218 TheCentralLimitTheorem.nbp 5.3 The Central Limit Theorem EXAMPLE: Suppose you are a principal of a middle school and your 100 eighth-graders are about to take a national standardized test. (sound familiar?) The test is designed so that the mean score is μ = 400 with a standard of deviation σ = 70. Assume the scores are normally distributed. A) What is the likelihood that one of your eighth-graders, selected at random, will score below 380 on the exam? How do we start? Percentage? Z-scores correspond to a percentage from the table 5.1 on page 232. Therefore, the standard score of 380 is x x 380 400 0.29 -0.29 is approximately 38th percentile, or 70 38% of all students (about 4 out of 10). 5.3 The Central Limit Theorem EXAMPLE: B) Your performance as a principal depends on how well your entire group of eighth-graders scores on the exam. What is the likelihood that your group of 100 eighth-graders will have a mean score below 380? How do we start? Central Limit Theorem? According to the C.L.T. if we take random groups of say 100 students and study their means, then the means distribution will approach normal. Hence, the μ = 400 and its standard of deviation is according to the C.L.T. 70 7 n 100 x x 380 400 2.9 Therefore, the z-score for a mean of 380 is 7 The percentile for the z-score on page 232 is 0.19th percentile or 0.0019, which is very small, about 1 in 500. EMPHASIS POINT: THERE IS MORE VARIATION IN THE SCORES OF INDIVIDUALS THAN IN THE MEANS OF GROUPS OF INDIVIDUALS 5.3 The Central Limit Theorem When listening to corn pop, you are hearing normal distribution and the C.L.T. -- William A. Massey {Princeton} HOMEWORK: Pg 220 # 1-20 all - Z-Score Percentile ≈ -1.2 and -1.3 12th ≈ -0.7 and -0.65 25th ≈ -0.35 and -0.30 37th ≈ 0.0 and 0.0 50th ≈ 0.30 and 0.35 62nd ≈ 0.65 and 0.70 75th ≈ 1.1 and 1.2 87th ≈ 3.5 and above 99.98th The larger the sample size, the of the means approaches: n