Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Student’s Solutions Manual and Study Guide: Chapter 3 Page 1 Chapter 3 Descriptive Statistics LEARNING OBJECTIVES The focus of Chapter 3 is on the use of statistical techniques to describe data, thereby enabling you to: 1. 2. 3. 4. 5. Apply various measures of central tendency—including the mean, median, and mode—to a set of ungrouped data. Apply various measures of variability—including the range, interquartile range, mean absolute deviation, variance, and standard deviation (using the empirical rule and Chebyshev’s theorem)—to a set of ungrouped data. Compute the mean, median, mode, standard deviation, and variance of grouped data. Describe a data distribution statistically and graphically using skewness, kurtosis, and box-and-whisker plots. Use computer packages to compute various measures of central tendency, variation, and shape on a set of data, as well as to describe the data distribution graphically. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 2 CHAPTER OUTLINE 3.1 Measures of Central Tendency: Ungrouped Data Mean Median Mode Percentiles Quartiles 3.2 Measures of Variability - Ungrouped Data Range Interquartile Range Mean Absolute Deviation, Variance, and Standard Deviation Mean Absolute Deviation Variance Standard Deviation Meaning of Standard Deviation Empirical Rule Chebyshev’s Theorem Population Versus Sample Variance and Standard Deviation Computational Formulas for Variance and Standard Deviation Z Scores Coefficient of Variation 3.3 Measures of Central Tendency and Variability - Grouped Data Measures of Central Tendency Mean Median Mode Measures of Variability 3.4 Measures of Shape Skewness Skewness and the Relationship of the Mean, Median, and Mode Coefficient of Skewness Kurtosis Box and Whisker Plot 3.5 Descriptive Statistics on the Computer Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 3 KEY TERMS Arithmetic Mean Bimodal Box and Whisker Plot Chebyshev’s Theorem Coefficient of Skewness Coefficient of Variation (CV) Deviation from the Mean Empirical Rule Interquartile Range Kurtosis Leptokurtic Mean Absolute Deviation (MAD) Measures of Central Tendency Measures of Shape Measures of Variability Median Mesokurtic Mode Multimodal Percentiles Platykurtic Quartiles Range Skewness Standard Deviation Sum of Squares of x Variance z Score Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 4 STUDY QUESTIONS 1. Statistical measures used to yield information about the centre or middle part of a group of numbers are called _______________. 2. The "average" is the _______________. 3. The value occurring most often in a group of numbers is called _______________. 4. In a set of 110 numbers arranged in order, the median is located at the _______________ position. 5. If a set of data has an odd number of values arranged in ascending order, the median is the ________________ value. Consider the data: 5, 4, 6, 6, 4, 5, 3, 2, 6, 4, 6, 3, 5 Answer questions 6-8 using this data. 6. The mode is _______________. 7. The median is _______________. 8. The mean is _______________. 9. If a set of values is a population, then the mean is denoted by _______________. 10. In computing a mean for grouped data, the _______________ is used to represent all data in a given class interval. 11. The mean for the data given below is _______________. Class Interval 50 - under 53 53 - under 56 56 - under 59 59 - under 62 62 - under 65 Frequency 14 17 29 31 18 12. Measures of variability describe the _______________ of a set of data. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 5 Use the following population data for Questions 13-17: 27 65 28 61 34 91 61 37 58 31 43 47 44 20 48 50 49 43 19 52 13. The range of the data is _______________. 14. The value of Q1 is _____________, Q2 is ____________, and Q3 is ____________. 15. The interquartile range is _____________. 16. The value of the 34th percentile is ________________. 17. The value of the Pearsonian coefficient of skewness for these data is ________________. 18. The Mean Absolute Deviation is computed by averaging the _______________ of deviations around the mean. 19. Subtracting each value of a set of data from the mean produces _______________ from the mean. 20. The sum of the deviations from the mean is always _______________. 21. The variance is the _______________ of the standard deviation. 22. The population variance is computed by using _______________ in the denominator. Whereas, the sample variance is computed by using _______________ in the denominator. 23. If the sample standard deviation is 9, then the sample variance is _______________. Consider the data below and answer questions 24-26 using the data: 2, 3, 6, 12 24. The mean absolute deviation for this data is _______________. 25. The sample variance for this data is _______________. 26. The population standard deviation for this data is _______________. 27. In estimating what proportion of values fall within so many standard deviations of the mean, a researcher should use _______________ if the shape of the distribution of numbers is unknown. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 28. Page 6 Suppose a distribution of numbers is mound shaped with a mean of 150 and a variance of 225. Approximately _______________ percent of the values fall between 120 and 180. Between _______________ and _______________ fall 99.7% of these values. 29. The shape of a distribution of numbers is unknown. The distribution has a mean of 275 and a standard deviation of 12. The value of k for 299 is _______________. At least _______________ percent of the values fall between 251 and 299. 30. Suppose data are normally distributed with a mean of 36 and a standard deviation of 4.8. The z score for 30 is _______________. The z score for 40 is _______________. 31. A normal distribution of values has a mean of 74 and a standard deviation of 21. The coefficient of variation for this distribution is _______________? Consider the data below and use the data to answer questions 32-35. Class Interval 2- 4 4- 6 6- 8 8-10 10-12 12-14 Frequency 5 12 14 15 8 4 32. The sample variance for the data above is _______________. 33. The population standard deviation for the data above is _______________. 34. The mode of the data is _________________________. 35. The median of the data is _________________________. 36. If a unimodal distribution has a mean of 50, a median of 48, and a mode of 47, the distribution is skewed _______________. 37. If the value of Sk is positive, then it may be said that the distribution is ________________________ skewed. 38. The peakedness of a distribution is called _________________________. 39. If a distribution is flat and spread out, then it is referred to as _______________________; if it is "normal" in shape, then it is referred to as _________________________; if it is high and thin, then it is referred to as __________________________. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 7 40. In a box plot, the inner fences are computed by ____________________ and _____________________. The outer fences are computed by ___________________ and _____________________. 41. Data values that lie outside the mainstream of values in a distribution are referred to as __________________. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 8 ANSWERS TO STUDY QUESTIONS 1. Measures of Central Tendency 22. N, n – 1 2. Mean 23. 81 3. Mode 24. 3.25 4. 55.5th 25. 20.25 5. Middle 26. 3.897 6. 6 27. Chebyshev’s Theorem 7. 5 28. 95, 105, and 195 8. 4.54 29. 2, 75 9. 30. –1.25, 0.83 10. Class Midpoint 31. 28.38% 11. 58.11 32. 7.54 12. Spread or Dispersion 33. 2.72 13. 72 34. 9 14. Q1 = 32.5, Q2 = 45.5, Q3 = 55 35. 7.7143 15. IQR = 22.5 36. Right 16. P34 = 37 37. Positively 17. Sk = -0.018 38. Kurtosis 18. Absolute Value 39. Platykurtic, Mesokurtic, Leptokurtic 40. Q1 - 1.5 IQR and Q3 + 1.5 IQR 19. Deviations Q1 - 3.0 IQR and Q3 + 3.0 IQR 20. Zero 41. Outliers 21. Square Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 9 SOLUTIONS TO PROBLEMS IN CHAPTER 3 3.1 Mean 17.3 44.5 µ = x/N = (333.6)/8 = 41.7 31.6 40.0 52.8 x = x/n = (333.6)/8 = 41.7 38.8 30.1 78.5 (It is not stated in the problem whether the x = 333.6 data represent as population or a sample). 3.3 Median for values in 3.1 Arrange in ascending order: 17.3 , 30.1 , 31.6 , 38.8 , 40 , 44.5 , 52.8 , 78.5 There are 8 terms. Since there is an even number of terms, the median is the average of the two middle values, 38.8 and 40. The median = 38.8 40 78.8 39.4 2 2 n1 th 8 1 term = = 4.5th term. 2 2 The median is located halfway between the 4th and 5th terms or the average of 38.8 and 40. So, the median is 39.4. Using the formula, the median is located at the Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 3.5 Page 10 Mode 2, 2, 3, 3, 4, 4, 4, 4, 5, 6, 7, 8, 8, 8, 9 4 is the most frequently occurring value The mode = 4 3.7 Rearranging the data in ascending order: 80, 94, 97, 105, 107, 112, 116, 116, 118, 119, 120, 127, 128, 138, 138, 139, 142, 143, 144, 145, 150, 162, 171, 172 n = 24 For P20: i 20 (24) 4.8 100 Thus, P20 is located at the 4 + 1 = 5th term and For P47: i P20 = 107 47 (24) 11.28 100 Thus, P47 is located at the 11 + 1 = 12th term and For P83: i P47 = 127 83 (24) 19.92 100 Thus, P83 is located at the 19 + 1 = 20th term and P83 = 145 Q1 = P25 For P25: i 25 (24) 6 100 Thus, Q1 is located at the 6.5th term and Q1 = (112 + 116)/ 2 = Q2 = Median 24 1 th 12.5 term 2 th The median is located at the: Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition 114 Student’s Solutions Manual and Study Guide: Chapter 3 Page 11 Thus, Q2 = (127 + 128)/ 2 = 127.5 Q3 = P75 For P75: i 75 (24) 18 100 Thus, Q3 is located at the 18.5th term and 3.9 Q3 = (143 + 144)/ 2 = 143.5 Rearranging the data into ascending order: 1,738,160 1,804,523 2,337,237 2,443,040 2,639,735 2,886,343 3,161,955 3,581,445 6,118,221 6,793,714 6,867,465 8,157,058 12 1 The median is located at the = 6.5th term 2 Thus, Q2 = (2,886,343 + 3,161,955)/ 2 = 3,024,149 th Q3 = P75 For P75: i 75 (12) 9 100 Thus, Q3 is located at the 9.5th term and Q3 = (6,118,221 + 6,793,714)/ 2 = 6,455,967.5 For P20: i 20 (12) 2.4 100 P20 is located at the 2 + 1 = 3rd term, For P60: i 60 (12) 7.2 100 P60 is located at the 7 + 1 = 8th term, For P80: i P20 = 2,337,237 P60 = 3,581,445 80 (12) 9.6 100 P80 is located at the 9 + 1 = 10th term, P80 = 6,793,714 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 i For P93: Page 12 93 (12) 11.16 100 P93 is located at the 11 + 1 = 12th term, 3.11 x 6 2 4 9 1 3 5 x = 30 6-4.2857 = x-µ = P93 = 8,157,058 x -µ 1.7143 2.2857 0.2857 4.7143 3.2857 1.2857 0.7143 14.2857 (x-µ)2 2.9388 5.2244 0.0816 22.2246 10.7958 1.6530 0.5102 (x -µ)2 = 43.4284 x 30 4.2857 N 7 a.) Range = 9 - 1 = 8 x 14.2857 2.0408 N 7 ( x ) 2 43.4284 c.) 2 = = 6.2041 N 7 b.) M.A.D. = d.) = ( x ) 2 6.2041 = 2.4908 N e.) Arranging the data in order: Q1 = P25 1, 2, 3, 4, 5, 6, 9 i = 25 (7) = 1.75 100 Q1 is located at the 2nd term, Q1 = 2 Q3 = P75: i = 75 (7) = 5.25 100 Q3 is located at the 6th term, Q3 = 6 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 13 IQR = Q3 - Q1 = 6 - 2 = 4 f.) 3.13 z = 6 4.2857 = 0.69 2.4908 z = 2 4.2857 = -0.92 2.4908 z = 4 4.2857 = -0.11 2.4908 z = 9 4.2857 = 1.89 2.4908 z = 1 4.2857 = -1.32 2.4908 z = 3 4.2857 = -0.52 2.4908 z = 5 4.2857 = 0.29 2.4908 a.) x 12 23 19 26 24 23 x = 127 = = (x-µ) 12-21.167= -9.167 1.833 -2.167 4.833 2.833 1.833 (x -µ) = -0.002 (x -µ)2 84.034 3.360 4.696 23.358 8.026 3.360 2 (x -µ) = 126.834 x 127 = 21.167 N 6 ( x ) 2 126.834 21.139 = 4.598 N 6 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition ORIGINAL FORMULA Student’s Solutions Manual and Study Guide: Chapter 3 Page 14 b.) x 12 23 19 26 24 23 x = 127 x2 144 529 361 676 576 529 2 x = 2815 = (x) 2 N N x 2 = 4.598 (127) 2 6 6 2815 2815 2688.17 126.83 21.138 6 6 SHORT-CUT FORMULA The short-cut formula is faster, but the original formula gives insight into the meaning of a standard deviation. 3.15 2 = 58,631.295 = 242.139 x = 6886 x2 = 3,901,664 n = 16 µ = 430.375 3.17 a) 1 1 1 3 1 .75 2 2 4 4 .75 b) 1 1 1 1 .84 2 2.5 6.25 .84 c) 1 1 1 1 .609 2 1.6 2.56 .609 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 d) 3.19 1 1 1 1 .902 2 3.2 10.24 Page 15 .902 x xx 7 5 10 12 9 8 14 3 11 13 8 6 106 1.833 3.833 1.167 3.167 0.167 0.833 5.167 5.833 2.167 4.167 0.833 2.833 32.000 ( x x) 2 3.361 14.694 1.361 10.028 0.028 0.694 26.694 34.028 4.694 17.361 0.694 8.028 121.665 x 106 = 8.833 n 12 x x 32 a) MAD = = 2.667 n 12 x b) s2 = ( x x) 2 121.665 = 11.06 n 1 11 c) s = s 2 11.06 = 3.326 d) Rearranging terms in order: 3 5 6 7 8 8 9 10 11 12 13 14 Q1 = P25: i = (.25)(12) = 3 Q1 = the average of the 3rd and 4th terms: Q1 = (6 + 7)/2 = 6.5 Q3 = P75: i = (.75)(12) = 9 Q3 = the average of the 9th and 10th terms: IQR = Q3 - Q1 = 11.5 – 6.5 = 5 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Q3 = (11 + 12)/2 = 11.5 Student’s Solutions Manual and Study Guide: Chapter 3 e.) z = f.) CV = 3.21 Page 16 6 8.833 = - 0.85 3.326 (3.326)(100) = 37.65% 8.833 = 12 µ = 125 68% of the values fall within: µ ± 1 = 125 ± 1(12) = 125 ± 12 between 113 and 137 95% of the values fall within: µ ± 2 = 125 ± 2(12) = 125 ± 24 between 101 and 149 99.7% of the values fall within: µ ± 3 = 125 ± 3(12) = 125 ± 36 between 89 and 161 3.23 1 = .80 k2 1 1 - .80 = k2 1- .20 = k2 = 5 and 1 k2 and .20k2 = 1 k = 2.236 2.236 standard deviations Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 3.25 Page 17 =4 µ = 29 Between 21 and 37 days: x1 21 29 8 = -2 Standard Deviations 4 4 x2 37 29 8 = 2 Standard Deviations 8 4 Since the distribution is normal, the empirical rule states that 95% of the values fall within µ ± 2 . Exceed 37 days: Since 95% fall between 21 and 37 days, 5% fall outside this range. Since the normal distribution is symmetrical, 2½% fall below 21 and above 37. Thus, 2½% lie above the value of 37. Exceed 41 days: x 41 29 12 = 3 Standard deviations 4 4 The empirical rule states that 99.7% of the values fall within µ ± 3 = 29 ± 3(4) = 29 ± 12. That is, 99.7% of the values will fall between 17 and 41 days. 0.3% will fall outside this range and half of this or .15% will lie above 41. Less than 25: µ = 29 x = 4 25 29 4 = -1 Standard Deviation 4 4 According to the empirical rule, µ ± 1 contains 68% of the values. 29 ± 1(4) = 29 ± 4 Therefore, between 25 and 33 days, 68% of the values lie and 32% lie outside this range with ½(32%) = 16% less than 25. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 3.27 Class 0- 2 2- 4 4- 6 6- 8 8 - 10 10 - 12 12 - 14 Mean: = Page 18 f 39 27 16 15 10 8 6 f = 121 M 1 3 5 7 9 11 13 fM 39 81 80 105 90 88 78 fM = 561 fM 561 = 4.64 f 121 Median: N f 121 The observation 60.5, 50% of N = 121, is in the class corresponding to 2–under 4. Therefore, l 2 , F 39 , f 27 and w 2 . N 121 39 F w 2 2 2 2 21.5 2 3.59 median grouped l 2 f 27 27 Mode: The modal class is 0 – 2. The midpoint of the modal class = the mode = 1 3.29 Class f M fM 20-30 30-40 40-50 50-60 60-70 70-80 Total 7 11 18 13 6 4 59 25 35 45 55 65 75 175 385 810 715 390 300 2775 = fM 2775 = 47.034 f 59 M-µ -22.0339 -12.0339 (M - µ)2 485.4927 144.8147 f(M - µ)2 3398.449 1592.962 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 - 2.0339 7.9661 17.9661 27.9661 2 = 4.1367 74.462 63.4588 824.964 322.7808 1936.685 782.1028 3128.411 Total 10,955.933 f ( M ) 2 10,955.93 = 185.694 f 59 = 3.31 Page 19 185.694 Class 18 - 24 24 - 30 30 - 36 36 - 42 42 - 48 48 - 54 54 - 60 60 - 66 66 - 72 f 17 22 26 35 33 30 32 21 15 f= 231 a.) Mean: x = 13.627 M 21 27 33 39 45 51 57 63 69 fM 357 594 858 1,365 1,485 1,530 1,824 1,323 1,035 fM= 10,371 fM2 7,497 16,038 28,314 53,235 66,825 78,030 103,968 83,349 71,415 2 fM = 508,671 fM fM 10,371 = 44.896 n f 231 b.) Median: N f 231 The observation 115.5, 50% of N = 231, is in the class corresponding to 42 –under 48. Therefore, l 42 , F 17 22 26 35 100 , f 33 and w 6 . N 231 100 F w 42 2 6 42 15.5 6 44.818 median grouped l 2 33 f 33 c.) Mode. The Modal Class = 36-42. The mode is the class midpoint = 39 (fM ) 2 (10,371) 2 508,671 n 231 43,053.5065 = 187.189 n 1 230 230 fM 2 d.) s2 = Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 e.) s = Page 20 187.189 = 13.682 3.33 f M fM fM2 0 - 50,000 18 25,000 450,000 11,250,000,000 50,000 - 100,000 4 75,000 300,000 22,500,000,000 100,000 - 150,000 7 125,000 875,000 109,375,000,000 150,000 - 200,000 7 175,000 1,225,000 214,375,000,000 200,000 - 250,000 2 225,000 450,000 101,250,000,000 250,000 - 300,000 0 275,000 0 0 300,000 - 350,000 1 325,000 325,000 105,625,000,000 350,000 - 400,000 0 375,000 0 0 400,000 - 450,000 1 f 40 425,000 425,000 f M 4,050,000 180,625,000,000 f M 2 745,000,000,000 Class a.) Mean: fM 4,050,000 = = 101,250 f 40 b.) Median: N f 40 The observation 20, 50% of N = 40, is in the class corresponding to 50,000 – under 100,000. Therefore, l 50,000 , F 18 , f 4 and w 50,000 . N 40 18 F w 50,000 2 50,000 median grouped l 2 f 4 2 50,000 50,000 75,000 4 c.) Mode. The Modal Class = 0 - 50,000 The mode is the midpoint of this class = 25,000. d.) Variance: Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 21 (fM ) 2 (4,050,000) 2 fM 745,000,000,000 N 40 = = 8,373,437,500 N 40 2 2 e.) Standard Deviation: = 3.35 2 8,373,437,500 = 91,506.49 mean = $35 median = $33 mode = $21 The stock prices are skewed to the right. While many of the stock prices are at the cheaper end, a few extreme prices at the higher end pull the mean. 3.37 3( M d ) 3(551 319) = 0.726 959 Because the value of Sk is positive, the distribution is positively skewed. Sk = 3.39 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 n = 24 Q1 = P25: 25 ( 24) = 6 i = 100 Thus, Q1 is located at the 6.5th term and Q1 = (497 + 503)/ 2 = 500 (n 1) th 12.5th term 2 Median = Q2 = (558 +559)/ 2 = 558.5 Median: Q3 = P75: i = 75 ( 24) = 18 100 Thus, Q3 is located at the 18.5th term and Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Page 22 Student’s Solutions Manual and Study Guide: Chapter 3 Page 23 Q3 = (588 + 590)/ 2 = 589 IQR = 589 - 500 = 89 Inner Fences: Q1 - 1.5 IQR = 500 - 1.5 (89) = 366.5 and Q3 + 1.5 IQR = 589 + 1.5 (89) = 722.5 Outer Fences: Q1 - 3.0 IQR = 500 - 3 (89) = 233 and Q3 + 3.0 IQR = 589 + 3 (89) = 856 The distribution is negatively skewed. There are no mild or extreme outliers. 3.41 Arranging the values in an ordered array: 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 4, 4, 5, 6, 8 Mean: x x 75 = 2.5 n 30 Mode = 2 (There are eleven 2’s) Median: There are n = 30 terms. n 1 30 1 31 The median is located at = 15.5th position. 2 2 2 th Median is the average of the 15th and 16th value. However, since these are both 2, the median is 2. Range = 8 - 1 = 7 Q1 = P25: 25 (30) = 7.5 100 i = Q1 is the 8th term = 1 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Q3 = P75: i = Page 24 75 (30) = 22.5 100 Q3 is the 23rd term = 3 IQR = Q3 - Q1 = 3 - 1 = 2 3.43 Mean: µ = x 126,904 = 6,345.2 N 20 The median is located at the n 1 th value = 21/2 = 10.5th value 2 The median is the average of 5,414 and 5,563 = 5,488.5 P30: i = (.30)(20) = 6 P30 is located at the average of the 6th and 7th terms P30 = (4,507+4,541)/2 = 4,524 P60: i = (.60)(20) = 12 P60 is located at the average of the 12th and 13th terms P60 = (6,101+6,498)/2 = 6,299.5 P90: i = (.90)(20) = 18 P90 is located at the average of the 18th and 19th terms P90 = (9,863+11,019)/2 = 10,441 Q1 = P25: i = (.25)(20) = 5 Q1 is located at the average of the 5th and 6th terms Q1 = (4,464+4,507)/2 = 4,485.5 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Q3 = P75: Page 25 i = (.75)(20) = 15 Q3 is located at the average of the 15th and 16th terms Q3 = (6,796+8,687)/2 = 7,741.5 Range = 11,388 - 3,619 = 7,769 IQR = Q3 - Q1 = 7,741.5 - 4,485.5 = 3,256 3.45 x2 x 12.5 156.25 9.7 94.09 6.4 40.96 5.3 28.09 4.4 19.36 4.1 16.81 3.9 15.21 3.6 12.96 3.5 12.25 3.2 10.24 2.9 8.41 x 414.63 x = 59.5 N = 11 a.) 2 x = 59.5 x 2 414.63 x = 59.5/11 = 5.41 N (11 1) th Median: 6th term 2 Median = Q2 = 4.1 Since there is no extreme value, the median has no advantage over the mean. Mean: µ= Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 b.) Page 26 Range = 12.5 – 2.9 = 9.6 25 (11) = 2.75 Q1 = P25: i = 100 Thus, Q1 is located at the 3rd term and Q1 = 3.5. Q3 = P75: i = 75 (11) = 8.25 100 Thus, Q3 is located at the 9th term and Q3 = 6.4 IQR = Q3 - Q1 = 6.4 – 3.5 = 2.9 c.) Variance: 2 = (x) 2 (59.5) 2 414.63 N 11 = 8.435 N 11 x 2 Standard Deviation: = d.) 2 8.435 = 2.904 Saudi Aramco: x 12.5 5.41 = 2.44 2.904 Daily capacity for Saudi Aramco is 2.44 standard deviations over the mean daily capacity. z = ExxonMobil: x 5.3 5.41 = -0.04 2.904 Daily capacity for ExxonMobil is 0.04 standard deviations under the mean daily capacity. z = e.) Skewness: Sk = 3( M d ) 3(5.41 4.1) = 1.35 2.904 Because the value of Sk is positive, the distribution is positively skewed. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 27 3.47 Class f 15-20 9 20-25 16 25-30 27 30-35 44 35-40 42 40-45 23 45-50 7 50-55 2 f = 170 M fM fM2 17.5 157.5 2756.25 22.5 360.0 8100.00 27.5 742.5 20418.75 32.5 1430.0 46475.00 37.5 1575.0 59062.50 42.5 977.5 41543.75 47.5 332.5 15793.75 52.5 105.0 5512.50 2 fM = 5680.0 fM = 199662.50 fM 5680 = 33.412 f 170 a.) Mean: = Median: N f 170 The observation 85, 50% of N = 170, is in the class corresponding to 30 – under 35. Therefore, l 30 , F 9 16 27 52 , f 44 and w 5 . N 170 52 F w 30 2 5 30 33 5 33.75 median grouped l 2 f 44 44 Mode: The Modal Class is 30-35. The class midpoint is the mode = 32.5. b.) Variance: (fM ) 2 (5680.0) 2 199,662.50 n 170 s2 = = 58.483 n 1 169 Standard Deviation: fM 2 s = 3.49 s 2 58.483 = 7.647 CVX = x 3.45 (100%) (100%) = 10.78% x 32 CVY = y 5.40 (100%) (100%) = 6.43% y 84 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 28 Stock X has a greater relative variability. 3.51 µ = 419, = 27 a.) 68%: µ + 1 419 + 27 95%: µ + 2 419 + 2(27) 365 to 473 99.7%: µ + 3 419 + 3(27) 338 to 500 392 to 446 b.) Use Chebyshev’s theorem: Each of the points, 359 and 479 is a distance of 60 from the mean, µ = 419. k = (distance from the mean)/ = 60/27 = 2.22 Proportion = 1 - 1/k2 = 1 - 1/(2.22)2 = .797 = 79.7% 400 419 = -0.704. This worker is in the lower half of 27 workers but within one standard deviation of the mean. c.) Since x = 400, z = 3.53 Mean Median Mode $30,468 $26,721 $25,000 Since these three measures are not equal, the distribution is skewed. The distribution is skewed to the right because the mean is greater than the median. Often, the median is preferred in reporting income data because it yields information about the middle of the data while ignoring extremes. 3.55 Paris: Since 1 - 1/k2 = .53, solving for k: k = 1.459 The distance from µ = 349 to x = 381 and to x = 317 is 32 1.459 = 32 Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 29 = 21.93 Moscow: Since 1 - 1/k2 = .83, solving for k: k = 2.425 The distance from µ = 415 to x = 459 and to x = 371 is 44 2.425 = 44 = 18.14 3.57 First of all, the graph indicates that the data are very skewed to the right. This is supported by the output which yields a skewness value of 3.6214. This is a very high positive skewness number telling us that there are a few advertisers in the Hispanic market who are spending extremely large numbers on advertising while most of the other companies are spending moderate to less amounts. In fact, looking at the graph, there is a very high modal group of advertisers centering around the $ 5 million mark. The output reveals that the mean amount being spent by Hispanic advertisers is $ 7.8560 million while the median is $ 5.75 million. The gap between the mean and the median supports the notion that the data are positively skewed. The standard deviation of the advertising data is 5.8860. If we apply the empirical rule to these data using the mean and the standard deviation, we can see that there are less than one standard deviation between the mean and the minimum value reinforcing the notion that the data are not normally distributed (because there should be at least three standard deviations of data on either side of the mean if the data are approximately normally distributed) and are likely to be positively skewed. The “N” indicates that there were 50 companies in this data analysis. The minimum amount spent on advertising cultivating the Hispanic market by one of these companies is $ 3.25 million and the maximum spent is $ 40.00 million. Also, presented in the data are the first and third quartiles. Observe the gap between the third quartile and the maximum amount. The “range” of this top 25 percent is over $31 million (8.625 – 40.00) and this is where the skewness is occurring. 3.59 The N indicates that there are 25 companies in this database. The mean advertising expenditure is $ 772,702 thousands while the median is $ 613,823 thousands. The gap between the mean and the median indicates that there is positive skewness. While no skewness figure is given in the MINITAB output, the boxplot shows extreme outliers on the right or positive side of the data Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 30 supporting the conclusion that there is positive skewness. Since the median is located on the left side of the “box”, the data are likely to be skewed to the right. The standard deviation is $ 436,067 thousand. Using this figure, it is possible to see that there is less than one standard deviation between the mean and the minimum ($ 445,958). If we apply the empirical rule to these data using the mean and the standard deviation, we can see that the data are not normally distributed (because there should be at least three standard deviations of data on either side of the mean if the data are approximately normally distributed) and are likely to be positively skewed. The values of the first quartile ($ 484,600 thousand) and the third quartile ($788,256 thousand) are also given and can be used to compute the interquartile deviation (middle 50% of the data), construct the boxplot, and indicate advertising expenditures for both the bottom and top 25 percent of companies. Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition Student’s Solutions Manual and Study Guide: Chapter 3 Page 31 Legal Notice Copyright Copyright © 2014 by John Wiley & Sons Canada, Ltd. or related companies. All rights reserved. The data contained in these files are protected by copyright. This manual is furnished under licence and may be used only in accordance with the terms of such licence. The material provided herein may not be downloaded, reproduced, stored in a retrieval system, modified, made available on a network, used to create derivative works, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise without the prior written permission of John Wiley & Sons Canada, Ltd. (MMXIII xii FI) Black, Chakrapani, Castillo: Business Statistics, Second Canadian Edition