Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Z-Scores, Shifting and Scaling ο Z score calculated as follows: οπ§= ο (π = π¦βπ¦ π (π¦βπ¦)2 πβ1 ) ο Shifting: ο Adding/Subtracting ο Changes Measures of Center ο DOES NOT change measures of spread ο Scaling ο Multiplying/Dividing ο Changes Measures of Center ο Changes measures of Spread Normal Models ο βbell-shaped curvesβ are called Normal Models ο Appropriate for distributions whose shapes are unimodal and roughly symmetric Normal Models ο Each is a model ο For symmetric, unimodal distributions a normal model provide a measure of how extreme a z-score is ο There is a normal model for every possible combination of mean and standard deviation Notation ο π π, π ο This represents a normal model with a mean of π and a standard deviation of π. ο Why the greek? ο This mean and standard deviation are not numerical summaries of data. ο They are part of a model. ο They donβt come from data. ο They are numbers that we choose to specify our model ο They are called parameters. Notation Continued ο π π, π ο We donβt want to confuse our parameters with summaries of the data such as π¦ πππ π ο Summaries of the data are called statistics Z-Scores and Normal Models ο If we model data with a Normal Model and standardize them using the corresponding π πππ π we still call the standardized value a zscore and we write: π¦βπ π§= π Z-Scores and Normal Models ο It is usually easier to standardize data first (using its mean and standard deviation) ο Then we need only model N(0,1) ο N(0,1) is called the standard normal model or standard normal distribution Normality Assumption ο In using the Normal Model to model our data, we must have a unimodal and symmetric distribution ο The Normality Assumption is that the data is unimodal and symmetric ο But it probably isnβt exactly thatβ¦ Nearly Normal Condition ο The shape of the dataβs distribution is unimodal and symmetric. ο Check this by making a histogram ο All models make assumptions β always point out the assumption you make for your model. ο Must also check the conditions in the data to make sure that those assumptions are reasonable. Normal Models ο Normal models tell us how extreme a value is by telling us how likely it is to find one that far from the mean. 68-95-99.7 Rule ο In a Normal Model about 68% of values fall within 1 SD of the mean ο About 95% of values fall within 2 SD of the mean ο About 99.7% of values fall within 3 SD of the mean Sample Problems ο Jean-Baptiste Grange of France skied the slalom in 88.46sec, approximately 1 SD faster than the mean. If a Normal Mode is useful in describing these slalom times, about how many of the 35 skiers finishing the event would you expect skied the slalom faster than Jean-Baptiste? ο We expect 68% of skiers to be within 1 SD of the mean. Of the remaining 32%, we expect half on the high end and half on the low end. ο 16% of 35 is 5.6, so conservatively, weβd expect about 5 skiers to do better than JeanBaptiste The Dutch ο The Dutch are among the tallest people in the world: The average Dutch man is 185cm tall, just over six feet. The average Dutch woman is just over 5β 7ββ tall. ο If the Normal Model is appropriate and the SD for men is about 8cm, what percentage of Dutch men will be over 2 meters (6β 6ββ) tall? The Dutch ο ο ο ο ο Mean = 184 cm SD = 8 cm 2 meters = 200cm 200cm = 2 SD above mean We expect 5% of men to be more than two standard deviations below or above the mean ο 2.5% are likely to be above 2 meters Driving ο It takes you 20 minutes, on average, to drive to school with a standard deviation of 2 minutes ο Suppose a Normal Model is appropriate for the distribution of driving times ο A) How often will you arrive at school in less than 22 minutes? ο Answer: 68% of the time weβll be within 1 SD, or two minutes, of the average 20 minutes. So 32% of the time weβll arrive in less than 18 minutes or in more than 22 minutes. Half of those times (16%) will be greater than 22 minutes, so 84% will be less than 22 minutes Driving ο It takes you 20 minutes, on average, to drive to school with a standard deviation of 2 minutes ο B) How often will it take you more than 24 minutes? ο Answer: 24 minutes is 2 ο Suppose a Normal Model is appropriate for the distribution of driving times SD above the mean. By the 95% rule, we know 2.5% of the times will be more than 24 minutes Driving ο It takes you 20 minutes, on ο C) Do you think the average, to drive to school with a standard deviation of 2 minutes distribution of your driving times is unimodal and symmetric? ο Suppose a Normal Model is ο Answer: βGoodβ traffic will appropriate for the distribution of driving times speed up your time by a bit but traffic incidents may occasionally increase the time it takes so times may be skewed to the right and there may be outliers. Driving ο It takes you 20 minutes, ο D) What does the shape of on average, to drive to school with a standard deviation of 2 minutes the distribution then say about the accuracy of your predictions? ο Suppose a Normal Model ο Answer: If this is the case is appropriate for the distribution of driving times the Normal Model is not appropriate and the percentages we predict would not be accurate. pg 129, # 1, 2, 3, 5, 7, 9, 24 (Handed in Tomorrow for Real) Working With Normal Models 1. Make a Picture 2. Make a Picture 3. Make a Picture How to Draw a Normal Curve: - Bell shaped, symmetric about mean: start at the middle and sketch the left and right - Only need to draw out to 3SD - The place where the bell shape changes from curving downward to curving back up β the inflection point β is located exactly one standard deviation from the mean