Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 3 Averages and Variation Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze Measures of Central Tendency • Average – a measure of the center value or central tendency of a distribution of values. • Three types of average: – Mode – Median – Mean Copyright © Cengage Learning. All rights reserved. 3|2 Mode The mode is the most frequently occurring value in a data set. Example: Sixteen students are asked how many college math classes they have completed. {0, 3, 2, 2, 1, 1, 0, 5, 1, 1, 0, 2, 2, 7, 1, 3} The mode is 1. Copyright © Cengage Learning. All rights reserved. 3|3 Median Finding the median: 1). Order the data from smallest to largest. 2). For an odd number of data values: Median = Middle data value 3). For an even number of data values: Sum of middle two values Median 2 Copyright © Cengage Learning. All rights reserved. 3|4 Median Find the median of the following data set. { 4, 6, 6, 7, 9, 12, 18, 19} a). 6 b). 7 Copyright © Cengage Learning. All rights reserved. c). 8 d). 9 3|5 Median Find the median of the following data set. {4, 6, 6, 7, 9, 12, 18, 19} a). 6 b). 7 Copyright © Cengage Learning. All rights reserved. c). 8 d). 9 3|6 Mean Sample mean Population mean x x x n Copyright © Cengage Learning. All rights reserved. N 3|7 Mean Sample mean Population mean x x x N n Find the mean of the following data set. {3, 8, 5, 4, 8, 4, 10} a). 8 b). 6.5 Copyright © Cengage Learning. All rights reserved. c). 6 d). 7 3|8 Mean Sample mean Population mean x x x N n Find the mean of the following data set. {3, 8, 5, 4, 8, 4, 10} a). 8 b). 6.5 Copyright © Cengage Learning. All rights reserved. c). 6 d). 7 3|9 Trimmed Mean • Order the data and remove k% of the data values from the bottom and top. • 5% and 10% trimmed means are common. • Then compute the mean with the remaining data values. Copyright © Cengage Learning. All rights reserved. 3 | 10 Resistant Measures of Central Tendency • A resistant measure will not be affected by extreme values in the data set. • The mean is not resistant to extreme values. • The median is resistant to extreme values. • A trimmed mean is also resistant. Copyright © Cengage Learning. All rights reserved. 3 | 11 Critical Thinking • Four levels of data – nominal, ordinal, interval, ratio (Chapter 1) • Mode – can be used with all four levels. • Median – may be used with ordinal, interval, of ratio level. • Mean – may be used with interval or ratio level. Copyright © Cengage Learning. All rights reserved. 3 | 12 Critical Thinking • Mound-shaped data – values of mean, median and mode are nearly equal. Copyright © Cengage Learning. All rights reserved. 3 | 13 Critical Thinking • Skewed-left data – mean < median < mode. Copyright © Cengage Learning. All rights reserved. 3 | 14 Critical Thinking • Skewed-right data – mean > median > mode. Copyright © Cengage Learning. All rights reserved. 3 | 15 Weighted Average • At times, we may need to assign more importance (weight) to some of the data values. xw Weighted Average w • x is a data value. • w is the weight assigned to that value. Copyright © Cengage Learning. All rights reserved. 3 | 16 Measures of Variation Three measures of variation: range variance standard deviation • Range = Largest value – smallest value Only two data values are used in the computation, so much of the information in the data is lost. Copyright © Cengage Learning. All rights reserved. 3 | 17 Sample Variance and Standard Deviation Sample Variance Sample Standard Deviation n 2 ( x x ) i i1 2 s n 1 s s 2 Find the standard deviation of the data set. {2,4,6} a). 2 b). 3 Copyright © Cengage Learning. All rights reserved. c). 4 d). 3.67 3 | 18 Sample Variance and Standard Deviation Sample Variance Sample Standard Deviation n 2 ( x x ) i i1 2 s n 1 s s 2 Find the standard deviation of the data set. {2,4,6} a). 2 b). 3 Copyright © Cengage Learning. All rights reserved. c). 4 d). 3.67 3 | 19 Population Variance and Standard Deviation Population Variance N (x ) i 2 Population Standard Deviation 2 i 1 N Copyright © Cengage Learning. All rights reserved. 2 3 | 20 The Coefficient of Variation For Samples s CV 100 x Copyright © Cengage Learning. All rights reserved. For Populations CV 100 3 | 21 Chebyshev’s Theorem Copyright © Cengage Learning. All rights reserved. 3 | 22 Chebyshev’s Theorem Copyright © Cengage Learning. All rights reserved. 3 | 23 Critical Thinking • Standard deviation or variance, along with the mean, gives a better picture of the data distribution than the mean alone. • Chebyshev’s theorem works for all kinds of data distribution. • Data values beyond 2.5 standard deviations from the mean may be considered as outliers. Copyright © Cengage Learning. All rights reserved. 3 | 24 Percentiles and Quartiles • For whole numbers P, 1 ≤ P ≤ 99, the Pth percentile of a distribution is a value such that P% of the data fall below it, and (100-P)% of the data fall at or above it. • Q1 = 25th Percentile • Q2 = 50th Percentile = The Median • Q3 = 75th Percentile Copyright © Cengage Learning. All rights reserved. 3 | 25 Quartiles and Interquartile Range (IQR) Copyright © Cengage Learning. All rights reserved. 3 | 26 Computing Quartiles Copyright © Cengage Learning. All rights reserved. 3 | 27 Five Number Summary • A listing of the following statistics: – Minimum, Q1, Median, Q3, Maximum • Box-and-Whisder plot – represents the fivenumber summary graphically. Copyright © Cengage Learning. All rights reserved. 3 | 28 Box-and-Whisker Plot Construction Copyright © Cengage Learning. All rights reserved. 3 | 29 Critical Thinking • Box-and-whisker plots display the spread of data about the median. • If the median is centered and the whiskers are about the same length, then the data distribution is symmetric around the median. • Fences – may be placed on either side of the box. Values lie beyond the fences are outliers. (See problem 10) Copyright © Cengage Learning. All rights reserved. 3 | 30 Critical Thinking Which of the following box-and-whiskers plots suggests a symmetric data distribution? (a) (b) Copyright © Cengage Learning. All rights reserved. (c) (d) 3 | 31 Critical Thinking Which of the following box-and-whiskers plots suggests a symmetric data distribution? (a) (b) Copyright © Cengage Learning. All rights reserved. (c) (d) 3 | 32