Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter Three Averages and Variation Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Measures of Central Tendency • Mode • Median • Mean Copyright © Houghton Mifflin Company. All rights reserved. 3|2 The Mode • the value that occurs most frequently in a data set Copyright © Houghton Mifflin Company. All rights reserved. 3|3 Find the mode: 6, 7, 2, 3, 4, 6, 2, 6 The mode is 6. Copyright © Houghton Mifflin Company. All rights reserved. 3|4 Find the mode: 6, 7, 2, 3, 4, 5, 9, 8 There is no mode for this data. Copyright © Houghton Mifflin Company. All rights reserved. 3|5 The Median • the central value of an ordered distribution Copyright © Houghton Mifflin Company. All rights reserved. 3|6 To find the median of raw data: • Order the data from smallest to largest. • For an odd number of values pick the middle value. or • For an even number of values compute the average of the middle two values Copyright © Houghton Mifflin Company. All rights reserved. 3|7 Find the median: Data: 5, 2, 7, 1, 4, 3, 2 Rearrange: 1, 2, 2, 3, 4, 5, 7 The median is 3. Copyright © Houghton Mifflin Company. All rights reserved. 3|8 Find the Median: Data: 31, 57, 12, 22, 43, 50 Rearrange: 12, 22, 31, 43, 50, 57 The median is the average of the middle two values = 31 43 2 37 Copyright © Houghton Mifflin Company. All rights reserved. 3|9 Finding the median for a large data set For an ordered data set of n values: Position of the middle value = n1 2 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 10 The Mean • An average that uses the exact value of each entry • Sometimes called the arithmetic mean Copyright © Houghton Mifflin Company. All rights reserved. 3 | 11 The Mean The mean of a collection of data is found by: • summing all the entries • dividing by the number of entries mean sum of all entries number of entries Copyright © Houghton Mifflin Company. All rights reserved. 3 | 12 Find the Mean: 6, 7, 2, 3, 4, 5, 2, 8 6 7 2 3 4 5 2 8 37 mean 4.625 4.6 8 8 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 13 Sigma Notation • The symbol S means “sum the following.” • S is the Greek letter (capital) sigma. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 14 Notations for mean Sample mean x Population mean Greek letter (mu) Copyright © Houghton Mifflin Company. All rights reserved. 3 | 15 Number of entries in a set of data • If the data represents a sample, the number of entries = n. • If the data represents an entire population, the number of entries = N. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 16 Sample mean x x n Copyright © Houghton Mifflin Company. All rights reserved. 3 | 17 Population mean x N Copyright © Houghton Mifflin Company. All rights reserved. 3 | 18 Resistant Measure • a measure that is not influenced by extremely high or low data values Copyright © Houghton Mifflin Company. All rights reserved. 3 | 19 Which is less resistant? • Mean • Median Copyright © Houghton Mifflin Company. All rights reserved. • The mean is less resistant. It can be made arbitrarily large by increasing the size of one value. 3 | 20 Trimmed Mean • a measure of center that is more resistant than the mean but is still sensitive to specific data values Copyright © Houghton Mifflin Company. All rights reserved. 3 | 21 To calculate a (5 or 10%) trimmed mean • Order the data from smallest to largest. • Delete the bottom 5 or 10% of the data. • Delete the same percent from the top of the data. • Compute the mean of the remaining 80 or 90% of the data. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 22 Compute a 10% trimmed mean: 15, 17, 18, 20, 20, 25, 30, 32, 36, 60 • Delete the top and bottom 10% • New data list: 17, 18, 20, 20, 25, 30, 32, 36 10% trimmed mean = Copyright © Houghton Mifflin Company. All rights reserved. x 198 24 .8 n 8 3 | 23 Weighted Average • An average where more importance or weight is assigned to some of the numbers Copyright © Houghton Mifflin Company. All rights reserved. 3 | 24 Weighted Average If x is a data value and w is the weight assigned to that value Weighted average = Copyright © Houghton Mifflin Company. All rights reserved. Sxw Sw 3 | 25 Calculating a Weighted Average In a pageant, the interview is worth 30% and appearance is worth 70%. Find the weighted average for a contestant with an interview score of 90 and an appearance score of 80. 0.30(90) 0.70(80) Weighted average 0.30 0.70 27 56 83 1.00 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 26 Measures of Variation • Range • Standard Deviation • Variance Copyright © Houghton Mifflin Company. All rights reserved. 3 | 27 The Range • the difference between the largest and smallest values of a distribution Copyright © Houghton Mifflin Company. All rights reserved. 3 | 28 Find the range: 10, 13, 17, 17, 18 The range = largest minus smallest = 18 minus 10 = 8 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 29 The Standard Deviation • a measure of the average variation of the data entries from the mean Copyright © Houghton Mifflin Company. All rights reserved. 3 | 30 Standard deviation of a sample s (x x) n 1 2 mean of the sample n = sample size Copyright © Houghton Mifflin Company. All rights reserved. 3 | 31 To calculate standard deviation of a sample • • • • • • Calculate the mean of the sample. Find the difference between each entry (x) and the mean. These differences will add up to zero. Square the deviations from the mean. Sum the squares of the deviations from the mean. Divide the sum by (n 1) to get the variance. Take the square root of the variance to get the standard deviation. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 32 The Variance • the square of the standard deviation Copyright © Houghton Mifflin Company. All rights reserved. 3 | 33 Variance of a Sample ( x x ) 2 s n 1 Copyright © Houghton Mifflin Company. All rights reserved. 2 3 | 34 Find the standard deviation and variance x 30 26 22 78 xx 4 0 -4 Mean = 26 Copyright © Houghton Mifflin Company. All rights reserved. (x - x) Sum = 0 2 16 0 16 ___ 32 3 | 35 The variance s 2 ( x x) n1 Copyright © Houghton Mifflin Company. All rights reserved. 2 = 32 2 =16 3 | 36 The standard deviation s= 16 4 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 37 Find the mean, the standard deviation and variance mean = 5 x xx (x - x) 4 1 1 5 0 0 5 0 0 7 2 4 4 1 1 25 Copyright © Houghton Mifflin Company. All rights reserved. 2 6 3 | 38 The mean, the standard deviation and variance Mean = 5 S tan dard deviation Variance 1 .5 1 .22 6 1 .5 4 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 39 Computation Formulas for Sample Variance and Standard Deviation: Sx x 2 2 Sample variance s 2 n 1 n x 2 x 2 Sample standard devaition s Copyright © Houghton Mifflin Company. All rights reserved. n1 n 3 | 40 To find S x2 • Square the x values, then add. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 41 To find ( S x ) 2 Sum the x values, then square. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 42 Use the computing formulas to find s and s2 x x2 4 16 5 25 5 25 7 49 4 25 16 131 Copyright © Houghton Mifflin Company. All rights reserved. s2 131 625 51 5 1.5 s 1.5 1.22 3 | 43 Population Mean population x mean N where N number of data values in the population Copyright © Houghton Mifflin Company. All rights reserved. 3 | 44 Population Standard Deviation x x 2 N where N number of data values in the population Copyright © Houghton Mifflin Company. All rights reserved. 3 | 45 Coefficient Of Variation: • A measurement of the relative variability (or consistency) of data. s CV 100 or 100 x Copyright © Houghton Mifflin Company. All rights reserved. 3 | 46 CV is used to compare variability or consistency A sample of newborn infants had a mean weight of 6.2 pounds with a standard deviation of 1 pound. A sample of three-month-old children had a mean weight of 10.5 pounds with a standard deviation of 1.5 pound. Which (newborns or 3-month-olds) are more variable in weight? Copyright © Houghton Mifflin Company. All rights reserved. 3 | 47 To compare variability, compare Coefficient of Variation • For newborns: CV = 16% Higher CV: more variable • For 3-month-olds: CV = 14% Copyright © Houghton Mifflin Company. All rights reserved. Lower CV: more consistent 3 | 48 Use Coefficient of Variation • To compare two groups of data, to answer: • Which is more consistent? • Which is more variable? Copyright © Houghton Mifflin Company. All rights reserved. 3 | 49 CHEBYSHEV'S THEOREM For any set of data and for any number k, greater than one, the proportion of the data that lies within k standard deviations of the mean is at least: 1 1 k Copyright © Houghton Mifflin Company. All rights reserved. 2 3 | 50 Results of Chebyshev’s Theorem Copyright © Houghton Mifflin Company. All rights reserved. 3 | 51 Using Chebyshev’s Theorem • A mathematics class completes an examination and it is found that the class mean is 77 and the standard deviation is 6. • According to Chebyshev's Theorem, between what two values would at least 75% of the grades be? Copyright © Houghton Mifflin Company. All rights reserved. 3 | 52 Mean = 77 Standard deviation = 6 At least 75% of the grades would be in the interval: x 2 s to x 2 s 77 – 2(6) to 77 + 2(6) 77 – 12 to 77 + 12 65 to 89 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 53 Percentiles • For any whole number P (between 1 and 99), the Pth percentile of a distribution is a value such that P% of the data fall at or below it. • The percent falling at or above the Pth percentile will be (100 – P)%. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 54 A histogram showing the 60th Percentile Copyright © Houghton Mifflin Company. All rights reserved. 3 | 55 Percentiles Copyright © Houghton Mifflin Company. All rights reserved. 3 | 56 Quartiles • Percentiles that divide the data into fourths • Q1 = 25th percentile • Q2 = the median • Q3 = 75th percentile Copyright © Houghton Mifflin Company. All rights reserved. 3 | 57 Quartiles Copyright © Houghton Mifflin Company. All rights reserved. 3 | 58 Computing Quartiles • Order the data from smallest to largest. • Find the median, the second quartile. • Find the median of the data falling below Q2. This is the first quartile. • Find the median of the data falling above Q2. This is the third quartile. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 59 Inter-quartile range IQR Q3 Q1 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 60 Find the quartiles: 12 23 41 15 24 45 16 25 51 16 30 17 32 18 33 22 33 22 34 The data has been ordered. The median is 24. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 61 Find the quartiles: 12 23 41 15 24 45 16 25 51 16 30 17 32 18 33 22 33 22 34 The data has been ordered. The median is 24. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 62 Find the quartiles: 12 23 41 15 24 45 16 25 51 16 30 17 32 18 33 22 33 22 34 For the data below the median, the median is 17. 17 is the first quartile. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 63 Find the quartiles: 12 23 41 15 24 45 16 25 51 16 30 17 32 18 33 22 33 22 34 For the data above the median, the median is 33. 33 is the third quartile. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 64 Find the interquartile range: 12 23 41 15 24 45 16 25 51 16 30 17 32 18 33 22 33 22 34 IQR = Q3 – Q1 = 33 – 17 = 16 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 65 Five-Number Summary of Data • • • • • Lowest value First quartile Median Third quartile Highest value Copyright © Houghton Mifflin Company. All rights reserved. 3 | 66 Box-and-Whisker Plot • a graphical presentation of the fivenumber summary of data Copyright © Houghton Mifflin Company. All rights reserved. 3 | 67 Box-and-Whisker Plot Copyright © Houghton Mifflin Company. All rights reserved. 3 | 68 Making a Box-and-Whisker Plot • Draw a vertical scale including the lowest and highest values. • To the right of the scale, draw a box from Q1 to Q3. • Draw a solid line through the box at the median. • Draw lines (whiskers) from Q1 to the lowest and from Q3 to the highest values. Copyright © Houghton Mifflin Company. All rights reserved. 3 | 69 Construct a Box-and-Whisker Plot: 12 23 41 15 24 45 16 25 51 16 30 17 32 18 33 22 33 Lowest = 12 Q1 = 17 Median = 24 Q3 = 33 22 34 Highest = 51 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 70 Box-and-Whisker Plot Highest = 51 Q3 = 33 Median = 24 Q1 = 17 Lowest = 12 Copyright © Houghton Mifflin Company. All rights reserved. 3 | 71