Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Section 2.4 Measures of Variation © 2012 Pearson Education, Inc. All rights reserved. 1 of 149 Section 2.4 Objectives • Determine the range of a data set • Determine the variance and standard deviation of a population and of a sample • Use the Empirical Rule and Chebychev’s Theorem to interpret standard deviation • Approximate the sample standard deviation for grouped data © 2012 Pearson Education, Inc. All rights reserved. 2 of 149 Range Range • The difference between the maximum and minimum data entries in the set. • The data must be quantitative. • Range = (Max. data entry) – (Min. data entry) © 2012 Pearson Education, Inc. All rights reserved. 3 of 149 Example: Finding the Range A corporation hired 10 graduates. The starting salaries for each graduate are shown. Find the range of the starting salaries. Starting salaries (1000s of dollars) 41 38 39 45 47 41 44 41 37 42 © 2012 Pearson Education, Inc. All rights reserved. 4 of 149 Solution: Finding the Range • Ordering the data helps to find the least and greatest salaries. 37 38 39 41 41 41 42 44 45 47 minimum • Range = (Max. salary) – (Min. salary) = 47 – 37 = 10 maximum The range of starting salaries is 10 or $10,000. © 2012 Pearson Education, Inc. All rights reserved. 5 of 149 Deviation, Variance, and Standard Deviation Deviation • The difference between the data entry, x, and the mean of the data set. • Population data set: • Deviation of x = x – μ • Sample data set: • Deviation of x x x © 2012 Pearson Education, Inc. All rights reserved. 6 of 149 Example: Finding the Deviation A corporation hired 10 graduates. The starting salaries for each graduate are shown. Find the deviation of the starting salaries. Starting salaries (1000s of dollars) 41 38 39 45 47 41 44 41 37 42 Solution: • First determine the mean starting salary. x 415 41.5 N 10 © 2012 Pearson Education, Inc. All rights reserved. 7 of 149 Solution: Finding the Deviation • Determine the deviation for each data entry. © 2012 Pearson Education, Inc. All rights reserved. Deviation ($1000s) Salary ($1000s), x x–μ 41 41 – 41.5 = –0.5 38 38 – 41.5 = –3.5 39 39 – 41.5 = –2.5 45 45 – 41.5 = 3.5 47 47 – 41.5 = 5.5 41 41 – 41.5 = –0.5 44 44 – 41.5 = 2.5 41 41 – 41.5 = –0.5 37 37 – 41.5 = –4.5 42 Σx = 415 42 – 41.5 = 0.5 Σ(x – μ) = 0 8 of 149 Deviation, Variance, and Standard Deviation Population Variance • ( x ) N 2 2 Sum of squares, SSx Population Standard Deviation • 2 ( x ) 2 N © 2012 Pearson Education, Inc. All rights reserved. 9 of 149 Finding the Population Variance & Standard Deviation In Words 1. Find the mean of the population data set. 2. Find the deviation of each entry. In Symbols x N x–μ 3. Square each deviation. (x – μ)2 4. Add to get the sum of squares. SSx = Σ(x – μ)2 © 2012 Pearson Education, Inc. All rights reserved. 10 of 149 Finding the Population Variance & Standard Deviation In Words 5. Divide by N to get the population variance. 6. Find the square root of the variance to get the population standard deviation. © 2012 Pearson Education, Inc. All rights reserved. In Symbols 2 ( x ) 2 N ( x ) 2 N 11 of 149 Example: Finding the Population Standard Deviation A corporation hired 10 graduates. The starting salaries for each graduate are shown. Find the population variance and standard deviation of the starting salaries. Starting salaries (1000s of dollars) 41 38 39 45 47 41 44 41 37 42 Recall μ = 41.5. © 2012 Pearson Education, Inc. All rights reserved. 12 of 149 Solution: Finding the Population Standard Deviation • Determine SSx • N = 10 Deviation: x – μ Squares: (x – μ)2 41 41 – 41.5 = –0.5 (–0.5)2 = 0.25 38 38 – 41.5 = –3.5 (–3.5)2 = 12.25 39 39 – 41.5 = –2.5 (–2.5)2 = 6.25 45 45 – 41.5 = 3.5 (3.5)2 = 12.25 47 47 – 41.5 = 5.5 (5.5)2 = 30.25 41 41 – 41.5 = –0.5 (–0.5)2 = 0.25 44 44 – 41.5 = 2.5 (2.5)2 = 6.25 41 41 – 41.5 = –0.5 (–0.5)2 = 0.25 37 37 – 41.5 = –4.5 (–4.5)2 = 20.25 42 42 – 41.5 = 0.5 (0.5)2 = 0.25 Σ(x – μ) = 0 SSx = 88.5 Salary, x © 2012 Pearson Education, Inc. All rights reserved. 13 of 149 Solution: Finding the Population Standard Deviation Population Variance ( x ) 88.5 8.9 • N 10 2 2 Population Standard Deviation • 2 8.85 3.0 The population standard deviation is about 3.0, or $3000. © 2012 Pearson Education, Inc. All rights reserved. 14 of 149 Deviation, Variance, and Standard Deviation Sample Variance • ( x x ) s n 1 2 2 Sample Standard Deviation • 2 ( x x ) s s2 n 1 © 2012 Pearson Education, Inc. All rights reserved. 15 of 149 Finding the Sample Variance & Standard Deviation In Words In Symbols x n 1. Find the mean of the sample data set. x 2. Find the deviation of each entry. xx 3. Square each deviation. ( x x )2 4. Add to get the sum of squares. SS x ( x x ) 2 © 2012 Pearson Education, Inc. All rights reserved. 16 of 149 Finding the Sample Variance & Standard Deviation In Words 5. Divide by n – 1 to get the sample variance. 6. Find the square root of the variance to get the sample standard deviation. © 2012 Pearson Education, Inc. All rights reserved. In Symbols 2 ( x x ) s2 n 1 ( x x ) 2 s n 1 17 of 149 Example: Finding the Sample Standard Deviation The starting salaries are for the Chicago branches of a corporation. The corporation has several other branches, and you plan to use the starting salaries of the Chicago branches to estimate the starting salaries for the larger population. Find the sample standard deviation of the starting salaries. Starting salaries (1000s of dollars) 41 38 39 45 47 41 44 41 37 42 © 2012 Pearson Education, Inc. All rights reserved. 18 of 149 Solution: Finding the Sample Standard Deviation • Determine SSx • n = 10 Deviation: x – μ Squares: (x – μ)2 41 41 – 41.5 = –0.5 (–0.5)2 = 0.25 38 38 – 41.5 = –3.5 (–3.5)2 = 12.25 39 39 – 41.5 = –2.5 (–2.5)2 = 6.25 45 45 – 41.5 = 3.5 (3.5)2 = 12.25 47 47 – 41.5 = 5.5 (5.5)2 = 30.25 41 41 – 41.5 = –0.5 (–0.5)2 = 0.25 44 44 – 41.5 = 2.5 (2.5)2 = 6.25 41 41 – 41.5 = –0.5 (–0.5)2 = 0.25 37 37 – 41.5 = –4.5 (–4.5)2 = 20.25 42 42 – 41.5 = 0.5 (0.5)2 = 0.25 Σ(x – μ) = 0 SSx = 88.5 Salary, x © 2012 Pearson Education, Inc. All rights reserved. 19 of 149 Solution: Finding the Sample Standard Deviation Sample Variance ( x x ) 88.5 9.8 • s n 1 10 1 2 2 Sample Standard Deviation 88.5 3.1 • s s 9 2 The sample standard deviation is about 3.1, or $3100. © 2012 Pearson Education, Inc. All rights reserved. 20 of 149 Example: Using Technology to Find the Standard Deviation Sample office rental rates (in dollars per square foot per year) for Miami’s central business district are shown in the table. Use a calculator or a computer to find the mean rental rate and the sample standard deviation. (Adapted from: Cushman & Wakefield Inc.) © 2012 Pearson Education, Inc. All rights reserved. Office Rental Rates 35.00 33.50 37.00 23.75 26.50 31.25 36.50 40.00 32.00 39.25 37.50 34.75 37.75 37.25 36.75 27.00 35.75 26.00 37.00 29.00 40.50 24.50 33.00 38.00 21 of 149 Solution: Using Technology to Find the Standard Deviation Sample Mean Sample Standard Deviation © 2012 Pearson Education, Inc. All rights reserved. 22 of 149 Interpreting Standard Deviation • Standard deviation is a measure of the typical amount an entry deviates from the mean. • The more the entries are spread out, the greater the standard deviation. © 2012 Pearson Education, Inc. All rights reserved. 23 of 149 Interpreting Standard Deviation: Empirical Rule (68 – 95 – 99.7 Rule) For data with a (symmetric) bell-shaped distribution, the standard deviation has the following characteristics: • About 68% of the data lie within one standard deviation of the mean. • About 95% of the data lie within two standard deviations of the mean. • About 99.7% of the data lie within three standard deviations of the mean. © 2012 Pearson Education, Inc. All rights reserved. 24 of 149 Interpreting Standard Deviation: Empirical Rule (68 – 95 – 99.7 Rule) 99.7% within 3 standard deviations 95% within 2 standard deviations 68% within 1 standard deviation 34% 2.35% x 3s 34% 13.5% x 2s 13.5% x s © 2012 Pearson Education, Inc. All rights reserved. x xs 2.35% x 2s x 3s 25 of 149 Example: Using the Empirical Rule In a survey conducted by the National Center for Health Statistics, the sample mean height of women in the United States (ages 20-29) was 64.3 inches, with a sample standard deviation of 2.62 inches. Estimate the percent of the women whose heights are between 59.06 inches and 64.3 inches. © 2012 Pearson Education, Inc. All rights reserved. 26 of 149 Solution: Using the Empirical Rule • Because the distribution is bell-shaped, you can use the Empirical Rule. 34% + 13.5% = 47.5% of women are between 59.06 and 64.3 inches tall. © 2012 Pearson Education, Inc. All rights reserved. 27 of 149 Chebychev’s Theorem • The portion of any data set lying within k standard deviations (k > 1) of the mean is at least: 1 1 2 k • k = 2: In any data set, at least 1 3 1 2 or 75% 2 4 of the data lie within 2 standard deviations of the mean. • k = 3: In any data set, at least 1 8 1 2 or 88.9% 3 9 of the data lie within 3 standard deviations of the mean. © 2012 Pearson Education, Inc. All rights reserved. 28 of 149 Example: Using Chebychev’s Theorem The age distribution for Florida is shown in the histogram. Apply Chebychev’s Theorem to the data using k = 2. What can you conclude? © 2012 Pearson Education, Inc. All rights reserved. 29 of 149 Solution: Using Chebychev’s Theorem k = 2: μ – 2σ = 39.2 – 2(24.8) = – 10.4 (use 0 since age can’t be negative) μ + 2σ = 39.2 + 2(24.8) = 88.8 At least 75% of the population of Florida is between 0 and 88.8 years old. © 2012 Pearson Education, Inc. All rights reserved. 30 of 149 Standard Deviation for Grouped Data Sample standard deviation for a frequency distribution • • ( x x ) 2 f s n 1 where n = Σf (the number of entries in the data set) When a frequency distribution has classes, estimate the sample mean and the sample standard deviation by using the midpoint of each class. © 2012 Pearson Education, Inc. All rights reserved. 31 of 149 Example: Finding the Standard Deviation for Grouped Data You collect a random sample of the number of children per household in a region. Find the sample mean and the sample standard deviation of the data set. © 2012 Pearson Education, Inc. All rights reserved. Number of Children in 50 Households 1 3 1 1 1 1 2 2 1 0 1 1 0 0 0 1 5 0 3 6 3 0 3 1 1 1 1 6 0 1 3 6 6 1 2 2 3 0 1 1 4 1 1 2 2 0 3 0 2 4 32 of 149 Solution: Finding the Standard Deviation for Grouped Data • First construct a frequency distribution. • Find the mean of the frequency distribution. xf 91 x 1.8 n 50 The sample mean is about 1.8 children. x f xf 0 10 0(10) = 0 1 19 1(19) = 19 2 7 2(7) = 14 3 7 3(7) =21 4 2 4(2) = 8 5 1 5(1) = 5 6 4 6(4) = 24 Σf = 50 Σ(xf )= 91 © 2012 Pearson Education, Inc. All rights reserved. 33 of 149 Solution: Finding the Standard Deviation for Grouped Data • Determine the sum of squares. x f xx ( x x )2 0 10 0 – 1.8 = –1.8 (–1.8)2 = 3.24 3.24(10) = 32.40 1 19 1 – 1.8 = –0.8 (–0.8)2 = 0.64 0.64(19) = 12.16 2 7 2 – 1.8 = 0.2 (0.2)2 = 0.04 0.04(7) = 0.28 3 7 3 – 1.8 = 1.2 (1.2)2 = 1.44 1.44(7) = 10.08 4 2 4 – 1.8 = 2.2 (2.2)2 = 4.84 4.84(2) = 9.68 5 1 5 – 1.8 = 3.2 (3.2)2 = 10.24 10.24(1) = 10.24 6 4 6 – 1.8 = 4.2 (4.2)2 = 17.64 17.64(4) = 70.56 ( x x )2 f ( x x )2 f 145.40 © 2012 Pearson Education, Inc. All rights reserved. 34 of 149 Solution: Finding the Standard Deviation for Grouped Data • Find the sample standard deviation. x 2 x ( x x )2 ( x x ) f 145.40 s 1.7 n 1 50 1 ( x x )2 f The standard deviation is about 1.7 children. © 2012 Pearson Education, Inc. All rights reserved. 35 of 149 Section 2.4 Summary • Determined the range of a data set • Determined the variance and standard deviation of a population and of a sample • Used the Empirical Rule and Chebychev’s Theorem to interpret standard deviation • Approximated the sample standard deviation for grouped data © 2012 Pearson Education, Inc. All rights reserved. 36 of 149