Download Appendix B Concepts in Statistics Line Plots Stem-and

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Time series wikipedia , lookup

Regression toward the mean wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.1
Appendix B
B.1
What you should learn
Why you should learn it
Line plots, stem-and-leaf plots,
and histograms are quick methods for determining which
elements in a set of data occur
with the greatest frequency. For
instance, in Exercise 9 on page
B4, you are asked to construct
a frequency distribution and a
histogram of employees’ monthly
salary contributions to a retirement plan.
Test Scores
93, 70, 76, 58, 86, 93, 82, 78,
83, 86, 64, 78, 76, 66, 83, 83,
96, 74, 69, 76, 64, 74, 79, 76,
88, 76, 81, 82, 74, 70
B1
Concepts in Statistics
Line Plots
Statistics is the branch of mathematics that studies techniques for collecting,
organizing, and interpreting data. In this section, you will study several ways to
organize data. The first is a line plot, which uses a portion of a real number line
to order numbers. Line plots are especially useful for ordering small sets of
numbers (about 50 or less) by hand.
Example 1 Leaves
8
4469
0044466666889
122333668
336
Constructing a Line Plot
Use a line plot to organize the following test scores. Which number occurs with
the greatest frequency?
93, 70, 76, 67, 86, 93, 82, 78, 83, 86, 64, 78, 76, 66, 83,
83, 96, 74, 69, 76, 64, 74, 79, 76, 88, 76, 81, 82, 74, 70
Solution
Begin by scanning the data to find the smallest and largest numbers. For this data,
the smallest number is 64 and the largest is 96. Next, draw a portion of a real
number line that includes the interval 64, 96. To create the line plot, start with
the first number, 93, and enter an above 93 on the number line. Continue
recording ’s for each number in the list until you obtain the line plot shown in
Figure B.1. From the line plot, you can see that 76 had the greatest frequency.
×
×
× ×× ××
65
70
×
×
× ×
×
× × ×
××
× × ×× ×××
75
80
×
× ×
85
×
×
90
×
95
100
Test scores
FIGURE
Stems
5
6
7
8
9
Representing Data
Representing Data
• How to use line plots to order
and analyze data
• How to use stem-and-leaf plots
to organize and compare data
• How to use histograms to
represent frequency
distributions
B.1
Stem-and-Leaf Plots
Another type of plot that can be used to organize sets of numbers by hand is a
stem-and-leaf plot. A set of test scores and the corresponding stem-and-leaf plot
are shown at the left.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
B2
Appendix B
Concepts in Statistics
Note that the leaves represent the units digits of the numbers and the stems
represent the tens digits. Stem-and-leaf plots can also be used to compare two sets
of data, as shown in the following example.
Example 2 AK
AL
AR
AZ
CA
CO
CT
DC
DE
FL
GA
HI
IA
ID
IL
IN
KS
KY
LA
MA
MD
ME
MI
MN
MO
MS
5.7
13.0
14.0
13.0
10.6
9.7
13.8
12.2
13.0
17.6
9.6
13.3
14.9
11.3
12.1
12.4
13.3
12.5
11.6
13.5
11.3
14.4
12.3
12.1
13.5
12.1
MT
NC
ND
NE
NH
NJ
NM
NV
NY
OH
OK
OR
PA
RI
SC
SD
TN
TX
UT
VA
VT
WA
WI
WV
WY
13.4
12.0
14.7
13.6
12.0
13.2
11.7
11.0
12.9
13.3
13.2
12.8
15.6
14.5
12.1
14.3
12.4
9.9
8.5
11.2
12.7
11.2
13.1
15.3
11.7
Use a stem-and-leaf plot to compare the test scores given on the previous page
with the following test scores. Which set of test scores is better?
90, 81, 70, 62, 64, 73, 81, 92, 73, 81, 92, 93, 83, 75, 76,
83, 94, 96, 86, 77, 77, 86, 96, 86, 77, 86, 87, 87, 79, 88
Solution
Begin by ordering the second set of scores.
62, 64, 70, 73, 73, 75, 76, 77, 77, 77, 79, 81, 81, 81, 83,
83, 86, 86, 86, 86, 87, 87, 88, 90, 92, 92, 93, 94, 96, 96
Now that the data has been ordered, you can construct a double stem-and-leaf plot
by letting the leaves to the right of the stems represent the units digits for the first
group of test scores and letting the leaves to the left of the stems represent the
units digits for the second group of test scores.
Leaves (2nd Group)
42
977765330
877666633111
6643220
Leaves
7
Stems
5
6
7
8
9
Leaves (1st Group)
8
4469
0044466666889
122333668
336
By comparing the two sets of leaves, you can see that the second group of test
scores is better than the first group.
Example 3 Stems
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
Comparing Two Sets of Data
Using a Stem-and-Leaf Plot
The table at the left above shows the percent of the population of each state and
the District of Columbia that was at least 65 years old in 2000. Use a stem-and-leaf
plot to organize the data. (Source: U.S. Census Bureau)
Solution
5
679
Begin by ordering the numbers, as shown below.
6
5.7, 8.5, 9.6, 9.7, 9.9, 10.6, 11.0, 11.2, 11.2, 11.3, 11.3, 11.6, 11.7, 12.0,
0223367
12.0, 12.1, 12.1, 12.1, 12.1, 12.2, 12.3, 12.4, 12.4, 12.5, 12.7, 12.8, 12.9,
00111123445789
13.0, 13.0, 13.0, 13.1, 13.2, 13.2, 13.3, 13.3, 13.3, 13.4, 13.5, 13.5, 13.6,
000122333455678
13.7, 13.8, 14.0, 14.3, 14.4, 14.5, 14.7, 14.9, 15.3, 15.6, 17.6
034579
Next construct the stem-and-leaf plot using the leaves to represent the digits to
36
the right of the decimal points, as shown at the left. From the stem-and-leaf plot,
you can see that Alaska has the lowest percent and Florida has the highest
6
percent.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.1
Representing Data
B3
Histograms and Frequency Distributions
7, 9
|
9, 11
||||
11, 13 | | | | | | | | | | | | | | | | |
13, 15 | | | | | | | | | | | | | | | | |
Number of states
With data such as that given in Example 3, it is useful to group the numbers into
intervals and plot the frequency of the data in each interval. For instance, the
frequency distribution and histogram shown in Figure B.2 represent the data
given in Example 3.
Frequency Distribution
Histogram
Interval
Tally
|
5, 7
20
15
10
5
15, 17 | |
5
17, 19 |
7
9 11 13 15 17 19
Percent of population
65 or older
FIGURE
B.2
A histogram has a portion of a real number line as its horizontal axis. A
histogram is similar to a bar graph, except that the rectangles (bars) in a bar graph
can be either horizontal or vertical and the labels of the bars are not necessarily
numbers.
Another difference between a bar graph and a histogram is that the bars in a
bar graph are usually separated by spaces, whereas the bars in a histogram are not
separated by spaces.
Number of sales
representatives
Interval
100–109
110–119
120–129
130–139
140–149
150–159
160–169
170–179
180–189
190–199
Tally
|||| |||
|
|||
|||
|||| ||
||||
|||| |||
|||| |
||
||||
Example 4 A company has 48 sales representatives who sold the following numbers of units
during the first quarter of 2003. Construct a grouped frequency distribution for
this data.
107
105
150
109
171
153
8
7
6
5
4
3
2
1
160
Units sold
B.3
162
193
153
171
163
107
184
167
164
150
118
124
170
149
167
138
142
162
177
195
171
100
107
192
102
127
163
164
144
134
145
193
141
147
100
187
141
191
129
153
132
177
Solution
100 120 140
FIGURE
Constructing a Histogram
180
200
To begin constructing a grouped frequency distribution, you must first decide on
the number of groups. There are several ways to group this data. However,
because the smallest number is 100 and the largest is 195, it seems that 10 groups
of 10 each would be appropriate. The first group would be 100–109, the second
group would be 110–119, and so on. By tallying the data into the 10 groups, you
obtain the distribution shown at the left above. A histogram for the distribution is
shown in Figure B.3.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
B4
Appendix B
Concepts in Statistics
B.1 Exercises
1. Gasoline Prices The line plot shows a sample of
prices of unleaded regular gasoline from 25 different
cities.
×
×
×
×
×
× × ×
×
×
×
×
×
× × ×
× × × ×
×
×
×
×
×
0.989 1.009 1.029 1.049 1.069 1.089 1.109 1.129 1.149 1.169 1.189
(a) What price occurred with the greatest frequency?
(b) What is the range of prices?
2. Agriculture The line plot shows the weights (to the
nearest hundred pounds) of 30 head of cattle sold by a
rancher.
×
×
600
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
800
×
×
×
×
×
×
×
×
×
×
1000
1200
×
×
1400
(a) What weight occurred with the greatest frequency?
(b) What is the range of weights?
Quiz and Exam Scores
In Exercises 3–6, use the following scores from a math class of 30 students. The scores are
for two 25-point quizzes and two 100-point exams.
Quiz #1 20, 15, 14, 20, 16, 19, 10, 21, 24, 15, 15, 14,
15, 21, 19, 15, 20, 18, 18, 22, 18, 16, 18, 19, 21, 19, 16,
20, 14, 12
Quiz #2 22, 22, 23, 22, 21, 24, 22, 19, 21, 23, 23, 25,
24, 22, 22, 23, 23, 23, 23, 22, 24, 23, 22, 24, 21, 24, 16,
21, 16, 14
Exam #1 77, 100, 77, 70, 83, 89, 87, 85, 81, 84, 81, 78,
89, 78, 88, 85, 90, 92, 75, 81, 85, 100, 98, 81, 78, 75, 85,
89, 82, 75
Exam #2 76, 78, 73, 59, 70, 81, 71, 66, 66, 73, 68, 67,
63, 67, 77, 84, 87, 71, 78, 78, 90, 80, 77, 70, 80, 64, 74,
68, 68, 68
3. Construct a line plot for each quiz. For each quiz,
which score occurred with the greatest frequency?
4. Construct a line plot for each exam. For each exam,
which score occurred with the greatest frequency?
5. Construct a stem-and-leaf plot for Exam #1.
6. Construct a double stem-and-leaf plot to compare the
scores for Exam #1 and Exam #2. Which set of scores
is higher?
7. Insurance The table shows the total numbers of
persons (in thousands) without health insurance
coverage in the 50 states and the District of Columbia
in 2000. Use a stem-and-leaf plot to organize the data.
(Source: U.S. Census Bureau)
AK
CO
GA
IN
MD
MS
NH
OH
SC
VA
WY
125
563
1135
701
501
364
85
1255
448
886
70
AL
CT
HI
KS
ME
MT
NJ
OK
SD
VT
600
263
117
301
145
162
1049
636
82
67
AR
DC
IA
KY
MI
NC
NM
OR
TN
WA
364
73
248
513
982
980
427
465
577
780
AZ
DE
ID
LA
MN
ND
NV
PA
TX
WI
793
82
196
810
430
69
311
905
4425
386
CA
FL
IL
MA
MO
NE
NY
RI
UT
WV
6281
2620
1659
595
586
164
2802
55
296
254
8. Meteorology The data shows the seasonal snowfall
(in inches) at Lincoln, Nebraska for the years 1972
through 2001 (the amounts are listed in order by year).
How would you organize this data? Explain your
reasoning. (Source: University of Nebraska–Lincoln)
29.2, 33.6, 42.1, 21.1, 21.8, 31.0, 34.4, 23.3, 13.0,
32.3, 38.0, 47.5, 21.5, 18.9, 15.7, 13.0, 19.1, 18.7,
25.8, 23.8, 32.1, 21.3, 21.8, 30.7, 29.0, 44.6, 24.4,
11.9, 37.9, 29.5
9. Retirement Contributions The employees of a
company must contribute 7% of their monthly salaries
to a company-sponsored retirement plan. The
contributed amounts (in dollars) for the company’s 35
employees are as follows.
100, 200, 130, 136, 161, 156, 209, 126, 135, 98, 114,
117, 168, 133, 140, 124, 172, 127, 143, 157,
124, 152, 104, 126, 155, 92, 194, 115, 120, 136, 148,
112, 116, 146, 96
(a) Construct a frequency distribution using groups
of 20. The first group should be 90–109.
(b) Construct a histogram for this frequency distribution.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.2
B.2
Measures of Central Tendency and Dispersion
B5
Measures of Central Tendency and Dispersion
What you should learn
• How to find and interpret the
mean, median, and mode of a
set of data
• How to determine the measure
of central tendency that best
represents a set of data
• How to find the standard
deviation of a set of data
• How to create and use
box-and-whisker plots
Why you should learn it
Measures of central tendency
and dispersion provide a convenient way to describe and compare sets of data. For instance, in
Exercise 36 on page B13, the
mean and standard deviation are
used to analyze the price of gold
for the years 1981 through 2000.
Mean, Median, and Mode
In many real-life situations, it is helpful to describe data by a single number that is
most representative of the entire collection of numbers. Such a number is called a
measure of central tendency. The most commonly used measures are as follows.
1. The mean, or average, of n numbers is the sum of the numbers divided by n.
2. The median of n numbers is the middle number when the numbers are written in order. If n is even, the median is the average of the two middle numbers.
3. The mode of n numbers is the number that occurs most frequently. If two
numbers tie for most frequent occurrence, the collection has two modes and is
called bimodal.
Example 1 Comparing Measures of Central Tendency
On an interview for a job, the interviewer tells you that the average annual income
of the company’s 25 employees is $60,849. The actual annual incomes of the 25
employees are shown below. What are the mean, median, and mode of the
incomes? Was the person telling you the truth?
$17,305,
$478,320,
$45,678,
$18,980,
$17,408,
$25,676,
$28,906,
$12,500,
$24,540,
$33,450,
$12,500,
$33,855,
$37,450,
$20,432,
$28,956,
$34,983,
$36,540,
$250,921,
$36,853,
$16,430,
$32,654,
$98,213,
$48,980,
$94,024,
$35,671
Solution
The mean of the incomes is
17,305 478,320 45,678 18,980 . . . 35,671
25
1,521,225
$60,849.
25
Mean To find the median, order the incomes as follows.
$12,500,
$12,500,
$16,430,
$17,305,
$18,980,
$20,432,
$24,540,
$25,676,
$28,956,
$32,654,
$33,450,
$33,855,
$35,671,
$36,540,
$36,853,
$37,450,
$48,980,
$94,024,
$98,213,
$250,921,
$17,408,
$28,906,
$34,983,
$45,678,
$478,320
From this list, you can see that the median (the middle number) is $33,450. From
the same list, you can see that $12,500 is the only income that occurs more than
once. So, the mode is $12,500. Technically, the person was telling the truth
because the average is (generally) defined to be the mean. However, of the three
measures of central tendency Mean: $60,849 Median: $33,450 Mode: $12,500
it seems clear that the median is most representative. The mean is inflated by the
two highest salaries.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
B6
Appendix B
Concepts in Statistics
Choosing a Measure of Central Tendency
Which of the three measures of central tendency is the most representative? The
answer is that it depends on the distribution of the data and the way in which you
plan to use the data.
For instance, in Example 1, the mean salary of $60,849 does not seem very
representative to a potential employee. To a city income tax collector who wants
to estimate 1% of the total income of the 25 employees, however, the mean is
precisely the right measure.
Example 2 Choosing a Measure of Central Tendency
Which measure of central tendency is the most representative of the data shown
in each frequency distribution?
a. Number
1
2
3
4
5
6
7
8
9
Tally
7
20
15
11
8
3
2
0
15
b. Number
1
2
3
4
5
6
7
8
9
Tally
9
8
7
6
5
6
7
8
9
c. Number
1
2
3
4
5
6
7
8
9
Tally
6
1
2
3
5
5
4
3
0
Solution
a. For this data, the mean is 4.23, the median is 3, and the mode is 2. Of these,
the mode is probably the most representative.
b. For this data, the mean and median are each 5 and the modes are 1 and 9 (the
distribution is bimodal). Of these, the mean or median is the most representative.
c. For this data, the mean is 4.59, the median is 5, and the mode is 1. Of these,
the mean or median is the most representative.
Variance and Standard Deviation
Very different sets of numbers can have the same mean. You will now study two
measures of dispersion, which give you an idea of how much the numbers in a
set differ from the mean of the set. These two measures are called the variance of
the set and the standard deviation of the set.
Definitions of Variance and Standard Deviation
Consider a set of numbers x1, x2, . . . , xn with a mean of x. The variance
of the set is
x x 2 x2 x2 . . . xn x2
v 1
n
and the standard deviation of the set is v ( is the lowercase Greek
letter sigma).
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.2
Measures of Central Tendency and Dispersion
B7
The standard deviation of a set is a measure of how much a typical number
in the set differs from the mean. The greater the standard deviation, the more the
numbers in the set vary from the mean. For instance, each of the following sets
has a mean of 5.
5, 5, 5, 5,
4, 4, 6, 6,
and
3, 3, 7, 7
The standard deviations of the sets are 0, 1, and 2.
1 5 52 5 52 5 52 5 52
4
0
2 4 5
2
4 52 6 52 6 52
4
1
3 3 5
2
3 52 7 52 7 52
4
2
Example 3 Estimations of Standard Deviation
Consider the three sets of data represented by the bar graphs in Figure B.4. Which
set has the smallest standard deviation? Which has the largest?
Se t A
Set B
Set C
5
5
5
4
4
4
3
3
3
2
2
2
1
1
1
1 2 3 4 5 6 7
FIGURE
1 2 3 4 5 6 7
1 2 3 4 5 6 7
B.4
Solution
Of the three sets, the numbers in set A are grouped most closely to the center and
the numbers in set C are the most dispersed. So, set A has the smallest standard
deviation and set C has the largest standard deviation.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
B8
Appendix B
Concepts in Statistics
Example 4 Finding Standard Deviation
Find the standard deviation of each set shown in Example 3.
Solution
Because of the symmetry of each bar graph, you can conclude that each has a
mean of x 4. The standard deviation of set A is
(32 222 312 502 312 222 32
17
1.53.
The standard deviation of set B is
23
2
222 212 202 212 222 232
14
2.
The standard deviation of set C is
53
2
422 312 202 312 422 532
26
2.22.
These values confirm the results of Example 3. That is, set A has the smallest
standard deviation and set C has the largest.
The following alternative formula provides a more efficient way to compute
the standard deviation.
Alternative Formula for Standard Deviation
The standard deviation of x1, x2, . . . , xn is
x
2
1
x22 . . . xn2
x 2.
n
Because of messy computations, this formula is difficult to verify. Conceptually,
however, the process is straightforward. It consists of showing that the expressions
x1 x2 x2 x2 . . . xn x2
n
and
x
2
1
x22 . . . x n2
x2
n
are equivalent. Try verifying this equivalence for the set x1, x2, x3 with
x x1 x2 x33.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.2
17
109
83
61
395
67
35
12
6
203
154
22
115
42
198
111
131
105
122
79
49
37
145
134
118
96
MT
NC
ND
NE
NH
NJ
NM
NV
NY
OH
OK
OR
PA
RI
SC
SD
TN
TX
UT
VA
VT
WA
WI
WV
WY
53
114
41
85
28
81
36
22
218
167
109
59
210
11
64
48
121
408
42
89
14
86
123
58
23
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
Measures of Central Tendency and Dispersion
B9
Using the Alternative Formula
Use the alternative formula for standard deviation to find the standard deviation
of the following set of numbers.
5, 6, 6, 7, 7, 8, 8, 8, 9, 10
Solution
Begin by finding the mean of the set, which is 7.4. So, the standard deviation is
5 26 27 10 38 9
568
54.76
10
2
2
2
2
2
102
7.42
2.04
1.43.
You can use the statistical features of a graphing utility to check this result.
A well-known theorem in statistics, called Chebychev’s Theorem, states that
at least
1
1 2
k
of the numbers in a distribution must lie within k standard deviations of the mean.
So, 75% of the numbers in a set must lie within two standard deviations of the
mean, and at least 88.9% of the numbers must lie within three standard deviations
of the mean. For most distributions, these percentages are low. For instance, in all
three distributions shown in Example 3, 100% of the numbers lie within two standard deviations of the mean.
Describing a Distribution
The table at the left above shows the number of hospitals (in thousands) in each
state and the District of Columbia in 1999. Find the mean and standard deviation
of the numbers. What percent of the numbers lie within two standard deviations
of the mean? (Source: Health Forum)
Solution
Begin by entering the numbers into a graphing utility that has a standard deviation program. After running the program, you should obtain
x 97.18
Number of hospitals
(in thousands)
FIGURE
Example 5 Example 6 0 - 49
50 - 99
100 - 149
150 - 199
200 - 249
250 - 299
300 - 349
350 - 399
400 - 499
Number of states
AK
AL
AR
AZ
CA
CO
CT
DC
DE
FL
GA
HI
IA
ID
IL
IN
KS
KY
LA
MA
MD
ME
MI
MN
MO
MS
B.5
and
81.99.
The interval that contains all numbers that lie within two standard deviations of
the mean is
97.18 281.99, 97.18 281.99
or
66.80, 261.16.
From the histogram in Figure B.5, you can see that all but two of the numbers
(96%) lie in this interval—all but the numbers that correspond to the number of
hospitals (in thousands) in California and Texas.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
B10
Appendix B
Concepts in Statistics
Box-and-Whisker Plots
Standard deviation is the measure of dispersion that is associated with the mean.
Quartiles measure dispersion associated with the median.
Definition of Quartiles
Consider an ordered set of numbers whose median is m. The lower quartile
is the median of the numbers that occur before m. The upper quartile is the
median of the numbers that occur after m.
Example 7 Finding Quartiles of a Set
Find the lower and upper quartiles for the set.
34, 14, 24, 16, 12, 18, 20, 24, 16, 26, 13, 27
Solution
Begin by ordering the set.
12, 13, 14, 16, 16, 18, 20, 24, 24, 26, 27, 34
1st 25%
2nd 25%
3rd 25%
4th 25%
The median of the entire set is 19. The median of the six numbers that are less
than 19 is 15. So, the lower quartile is 15. The median of the six numbers that are
greater than 19 is 25. So, the upper quartile is 25.
Quartiles are represented graphically by a box-and-whisker plot, as shown
in Figure B.6. In the plot, notice that five numbers are listed: the smallest number,
the lower quartile, the median, the upper quartile, and the largest number. Also
notice that the numbers are spaced proportionally, as though they were on a real
number line.
12
FIGURE
15
19
25
34
B.6
The next example shows how to find quartiles when the number of elements
in a set is not divisible by 4.
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.2
Example 8 B11
Measures of Central Tendency and Dispersion
Sketching Box-and-Whisker Plots
Sketch a box-and-whisker plot for each set.
a. 27, 28, 30, 42, 45, 50, 50, 61, 62, 64, 66
b. 82, 82, 83, 85, 87, 89, 90, 94, 95, 95, 96, 98, 99
c. 11, 13, 13, 15, 17, 18, 20, 24, 24, 27
Solution
a. This set has 11 numbers. The median is 50 (the sixth number). The lower
quartile is 30 (the median of the first five numbers). The upper quartile is 62
(the median of the last five numbers). See Figure B.7.
27
30
FIGURE
B.7
50
62
66
b. This set has 13 numbers. The median is 90 (the seventh number). The lower
quartile is 84 (the median of the first six numbers). The upper quartile is 95.5
(the median of the last six numbers). See Figure B.8.
82
FIGURE
84
90
95.5
99
B.8
c. This set has 10 numbers. The median is 17.5 (the average of the fifth and sixth
numbers). The lower quartile is 13 (the median of the first five numbers). The
upper quartile is 24 (the median of the last five numbers). See Figure B.9.
11
FIGURE
13
17.5
24
27
B.9
B.2 Exercises
In Exercises 1–6, find the mean, median, and mode of the
set of measurements.
1.
2.
3.
4.
5.
6.
5, 12, 7, 14, 8, 9, 7
30, 37, 32, 39, 33, 34, 32
5, 12, 7, 24, 8, 9, 7
20, 37, 32, 39, 33, 34, 32
5, 12, 7, 14, 9, 7
30, 37, 32, 39, 34, 32
7. Reasoning Compare your answers for Exercises 1
and 3 with those for Exercises 2 and 4. Which of the
measures of central tendency is sensitive to extreme
measurements? Explain your reasoning.
8. Reasoning
(a) Add 6 to each measurement in Exercise 1 and
calculate the mean, median, and mode of the
revised measurements. How are the measures of
central tendency changed?
(b) If a constant k is added to each measurement in a
set of data, how will the measures of central
tendency change?
Larson/Hostetler/Edwards - Precalculus 6th Edition.
B12
Appendix B
Concepts in Statistics
9. Electric Bills A person had the following monthly
bills for electricity. What are the mean and median of
the collection of bills?
January
$67.92
February
$59.84
March
$52.00
April
$52.50
May
$57.99
June
$65.35
July
$81.76
August
$74.98
September
$87.82
October
$83.18
November
$65.35
December
$57.00
10. Car Rental A car rental company kept the following record of the numbers of miles a rental car was
driven. What are the mean, median, and mode of this
data?
Monday
410
Tuesday
260
Wednesday 320
Thursday
320
Friday
460
Saturday
150
11. Six-Child Families A study was done on families
having six children. The table shows the numbers of
families in the study with the indicated numbers of
girls. Determine the mean, median, and mode of this
set of data.
Number of girls
0
1
2
3
4
5
6
Frequency
1
24
45
54
50
19
7
12. Sports A baseball fan examined the records of a
favorite baseball player’s performance during his last
50 games. The numbers of games in which the player had 0, 1, 2, 3, and 4 hits are recorded in the table.
Number of hits
0
1
2
3
4
Frequency
14
26
7
2
1
(a) Determine the average number of hits per game.
(b) Determine the player’s batting average if he had
200 at-bats during the 50-game series.
13. Think About It Construct a collection of numbers
that has the following properties. If this is not possible, explain why it is not.
Mean 6, median 4, mode 4
14. Think About It Construct a collection of numbers
that has the following properties. If this is not possible, explain why it is not.
15. Test Scores A professor records the following
scores for a 100-point exam.
99, 64, 80, 77, 59, 72, 87, 79, 92, 88,
90, 42, 20, 89, 42, 100, 98, 84, 78, 91
Which measure of central tendency best describes
these test scores?
16. Shoe Sales A salesman sold eight pairs of men’s
black dress shoes. The sizes of the eight pairs were
as follows: 1012, 8, 12, 1012, 10, 912, 11, and 1012.
Which measure (or measures) of central tendency
best describes the typical shoe size for this data?
In Exercises 17–24, find the mean x , variance v, and
standard deviation of the set.
17.
18.
19.
20.
21.
22.
23.
24.
4, 10, 8, 2
3, 15, 6, 9, 2
0, 1, 1, 2, 2, 2, 3, 3, 4
2, 2, 2, 2, 2, 2
1, 2, 3, 4, 5, 6, 7
1, 1, 1, 5, 5, 5
49, 62, 40, 29, 32, 70
1.5, 0.4, 2.1, 0.7, 0.8
In Exercises 25–30, use the alternative formula to find the
standard deviation of the set.
25.
26.
27.
28.
29.
30.
2, 4, 6, 6, 13, 5
10, 25, 50, 26, 15, 33, 29, 4
246, 336, 473, 167, 219, 359
6.0, 9.1, 4.4, 8.7, 10.4
8.1, 6.9, 3.7, 4.2, 6.1
9.0, 7.5, 3.3, 7.4, 6.0
In Exercises 31 and 32, line plots of sets of data are given.
Determine the mean and standard deviation of each set.
31. (a)
(b)
×
×
×
8
10
×
×
×
16
18
(c)
×
8
Mean 6, median 6, mode 4
12
20
×
×
10
(d)
×
4
6
×
×
×
×
×
14
16
×
×
×
22
24
×
12
14
×
×
×
×
8
10
16
×
12
Larson/Hostetler/Edwards - Precalculus 6th Edition.
Section B.2
×
×
×
32. (a)
×
×
12
(b)
14
×
×
×
×
×
12
(c)
×
×
14
22
×
×
×
(d)
16
16
×
24
×
×
2
×
×
18
×
×
×
26
×
×
4
6
In Exercises 39–42, sketch a box-and-whisker plot for the
data without the aid of a graphing utility.
28
×
×
×
×
×
×
8
6
5
5
Frequency
Frequency
33. Reasoning Without calculating the standard deviation, explain why the set 4, 4, 20, 20 has a standard
deviation of 8.
34. Reasoning If the standard deviation of a set of
numbers is 0, what does this imply about the set?
35. Test Scores An instructor adds five points to each
student’s exam score. Will this change the mean or
standard deviation of the exam scores? Explain.
36. Price of Gold The following data represents the
average prices of gold (in dollars per fine ounce) for
the years 1981 to 2000. Use a computer or graphing
utility to find the mean, variance, and standard
deviation of the data. What percent of the data lies
within two standard deviations of the mean?
(Source: U.S. Bureau of Mines and U.S. Geological
Survey)
460,
376,
424,
361,
318,
368,
478,
438,
383,
385,
363,
345,
361,
385,
386,
389,
332,
295,
280,
280
37. Think About It The histograms represent the test
scores of two classes of a college course in mathematics. Which histogram has the smaller standard
deviation?
6
4
3
2
1
4
3
2
1
86
90
94
Score
98
B13
Measures of Central Tendency and Dispersion
38. Test Scores The scores of a mathematics exam
given to 600 science and engineering students at a
college had a mean and standard deviation of 235
and 28, respectively. Use Chebychev’s Theorem to
determine the intervals containing at least 43 and at
8
least 9 of the scores. How would the intervals change
if the standard deviation were 16?
18
×
×
×
×
×
×
×
×
×
×
×
×
×
84 88 92 96
Score
39.
40.
41.
42.
23, 15, 14, 23, 13, 14, 13, 20, 12
11, 10, 11, 14, 17, 16, 14, 11, 8, 14, 20
46, 48, 48, 50, 52, 47, 51, 47, 49, 53
25, 20, 22, 28, 24, 28, 25, 19, 27, 29, 28, 21
In Exercises 43–46, use a graphing utility to create a
box-and-whisker plot for the data.
43. 19, 12, 14, 9, 14, 15, 17, 13, 19, 11, 10, 19
44. 9, 5, 5, 5, 6, 5, 4, 12, 7, 10, 7, 11, 8, 9, 9
45. 20.1, 43.4, 34.9, 23.9, 33.5, 24.1, 22.5, 42.4, 25.7,
17.4, 23.8, 33.3, 17.3, 36.4, 21.8
46. 78.4, 76.3, 107.5, 78.5, 93.2, 90.3, 77.8, 37.1, 97.1,
75.5, 58.8, 65.6
47. Product Lifetime A company has redesigned a
product in an attempt to increase the lifetime of the
product. The two sets of data list the lifetimes (in
months) of 20 units with the original design and 20
units with the new design. Create a box-and-whisker
plot for each set of data, and then comment on the
differences between the plots.
Original Design
15.1
78.3
27.2
12.5
53.0
13.5
10.8
38.3
56.3
42.7
11.0
85.1
68.9
72.7
18.4
10.0
30.6
20.2
85.2
12.6
New Design
55.8
71.5
37.2
60.0
46.7
31.1
54.0
23.2
25.6
35.3
67.9
45.5
19.0
18.9
23.5
24.8
23.1
80.5
99.5
87.8