Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Continuous Distributions Continuous random variables • Are numerical variables whose values fall within a range or interval • Are measurements • Can be described by density curves Density curves • Is always on or above the horizontal axis • Has an area exactly equal to one underneath it • Often describes an overall distribution • Describe what proportions of the observations fall within each range of values Uniform Distribution • Is a continuous distribution that is evenly (or uniformly) distributed • Has a density curve in the shape of a rectangle • Probabilities are calculated by finding the area under the curve a b x 2 x2 b a 12 2 How do ayou the Where: & bfind are the area endpoints ofof thea rectangle? uniform distribution The Citrus Sugar Company packs sugar in bags labeled 5 pounds. However, the packaging isn’t perfect and the actual What shape does a uniform weights are uniformly distributed with a What is the height of this distribution have? mean of 4.98 pounds and a range of .12 rectangle? pounds. How long is this rectangle? a)Construct the uniform distribution above. 1/.12 4.92 4.98 5.04 • What is the probability that a randomly selected bag will weigh more than 4.97 pounds? P(X > 4.97) = .07(1/.12) = .5833 What is the length of the shaded region? 1/.12 4.92 4.98 5.04 • Find the probability that a randomly selected bag weighs between 4.93 and 5.03 pounds. What is the length of P(4.93<X<5.03) = .1(1/.12) = .8333 the shaded region? 1/.12 4.92 4.98 5.04 The time it takes for students to drive to school is evenly distributed with a minimum of 5 minutes and a range of 35 minutes. What is the height of the rectangle? a)Draw the distribution Where should the rectangle end? 1/35 5 40 b) What is the probability that it takes less than 20 minutes to drive to school? P(X < 20) = (15)(1/35) = .4286 1/35 5 40 c) What is the mean and standard deviation of this distribution? = (5 + 40)/2 = 22.5 2 = (40 - 5)2/12 = 102.083 = 10.104 Normal Distributions • • • • • Symmetrical bell-shaped (unimodal) density curve How is this done Above the horizontal axis mathematically? N(, ) The transition points occur at + Probability is calculated by finding the area under the curve • As increases, the curve flattens & spreads out • As decreases, the curve gets taller and thinner Normal distributions occur frequently. • • • • • • • Length of newborn child Height Weight ACT or SAT scores Intelligence Number of typing errors Chemical processes A 6 B Do these two normal curves have the same mean? If so, what is it? YES Which normal curve has a standard deviation of 3? B Which normal curve has a standard deviation of 1? A Empirical Rule • Approximately 68% of the observations fall within of • Approximately 95% of the observations fall within 2 of • Approximately 99.7% of the observations fall within 3 of Suppose that the height of male students at CHS is normally distributed with a mean of 71 inches and standard deviation of 2.5 inches. What is the probability that the height of a randomly selected male student is more than 73.5 inches? 1 - .68 = .32 P(X > 73.5) = 0.16 68% 71 Standard Normal Density Curves Always has = 0 & = 1 To standardize: x z Must have this memorized! Strategies for finding probabilities or proportions in normal distributions 1. State the probability statement 2. Draw a picture 3. Calculate the z-score 4. Look up the probability (proportion) in the table The lifetime of a certain type of battery is normally distributed with a mean of 200 hours and a standardDraw deviation of 15 & shade Write the hours. What proportion of these the curve probability batteries can be expected to last less statement than 220 hours? P(X < 220) = .9082 Look up z220 200 score in z 1.33 table 15 Calculate z-score The lifetime of a certain type of battery is normally distributed with a mean of 200 hours and a standard deviation of 15 hours. What proportion of these batteries can be expected to last more than 220 hours? P(X>220) = 1 - .9082 = .0918 220 200 z 1.33 15 The lifetime of a certain type of battery is normally distributed with a mean of 200 hours and a standard deviation of 15 Look up in table 0.95 hours. How long must a battery last to be in the top 5%? to find z- score P(X > ?) = .05 x 200 1.645 15 x 224.675 .95 .05 1.645 The heights of the female students at CHS are normally distributed with a What is the zmean of 65 inches. What is the for the standard deviation of this score distribution 63? if 18.5% of the female students are shorter than 63 inches? P(X < 63) = .185 63 65 .9 2 2.22 .9 -0.9 63 The heights of female teachers at CHS are normally distributed with mean of 65.5 inches and standard deviation of 2.25 inches. The heights of male teachers are normally distributed with mean of 70 inches and standard deviation of 2.5 inches. •Describe the distribution of differences of heights (male – female) teachers. Normal distribution with = 4.5 & = 3.3634 • What is the probability that a randomly selected male teacher is shorter than a randomly selected female teacher? P(X<0) = .0901 0 4.5 z 1.34 3.3634 4.5 Will my calculator do any of this normal stuff? • Normalpdf – use for graphing ONLY – Y=normalpdf(x,µ,σ) • Normalcdf – will find probability of area from lower bound to upper bound (if starting from left, lower bound = -1x1099 ) Normalcdf(lower,upper,µ,σ) • Invnorm (inverse normal) – will find z-score from probability number – Z-score = Invnorm(probability) Ways to Assess Normality • Use graphs (dotplots, boxplots, or histograms) • Use the Empirical Rule • Normal probability (quartile) plot on calculator Normal Probability (Quartile) plots • The observation (x) is plotted against known normal z-scores • If the points on the quartile plot lie close to a straight line, then the data is normally distributed • Deviations from a straight line on the quartile plot indicate nonnormal data • Points far away from the plot indicate outliers • Vertical stacks of points (repeated observations of the same number) is called granularity Are these approximately normally distributed? 50 48 54 47 51 52 46 53 52 51 48 48 54 55 57 45 53 50 47 49 50 56 53 52 Both the histogram & boxplot are approximately symmetrical, so these data are approximately normal. Are these approximately normally distributed? 50 48 54 47 51 52 46 53 What 52 51 48 48 54 55 57is this 45 53 50 47 49 50 56 called? 53 52 To create a Normal Probability plot, enter the data into L1 and then choose the 6th graph type in The normal probability STATPLOT. L1 is x-axis plot is approximately and y-axis is theoretical zlinear, so these data are scores for normal approximately normal. distribution