Download SP7-18 Degree of Overlap of Two Populations with the

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
Transcript
SP7-18
Degree of Overlap of Two Populations with the
Same Variability
Pages 194–196
Standards: 7.SP.B.3
Goals:
Students will evaluate the visual degree of overlap between two data sets, as displayed by box
plots or dot plots.
Prior Knowledge Required:
Can evaluate the mean and MAD of a data set
Can evaluate the median and IQR of a data set
Can draw dot plots and box plots
Can read and interpret dot plots and box plots
Understands the real-world interpretation of center and variability
Knows that larger samples are less variable
Vocabulary: average, box plot, center, data point, data set, degree of overlap, dot plots,
first quartile (Q1), frequency, interquartile range (IQR), mean, mean absolute deviation (MAD),
median, outlier, population, third quartile (Q3), samples, variability
Materials:
calculators
NOTE: Allow students to use calculators throughout this lesson.
Introduce the importance of comparing populations. Give examples of situations in which
people might be interested in comparing differences between populations, and discuss reasons
why the populations might be different. For example:
• Are basketball players generally taller than soccer players? (people who are taller might be
more likely to prefer basketball because it is easier to score a basket if you’re taller)
• Are words in a Grade 7 textbook generally longer than words in a Grade 2 textbook?
(Grade 7 students are able to read longer words)
• Do people in one country tend to live longer than people in another country? (some countries
have better health care systems)
• Does one company serve customers faster than another company? (one company might
have a more efficient system)
Allow students to suggest other possible comparisons.
Introduce the importance of overlap. For each example discussed above, emphasize the fact
that the comparisons are general. For example, not all basketball players are taller than all
soccer players. ASK: Will Grade 7 textbooks have some words that are shorter than some
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-49
words in a Grade 2 textbook? (yes: “a” and “an” will be in the Grade 7 textbook) SAY: There
will usually be some degree of overlap between the two populations. For example, Grade 7
textbooks will have some very short words and Grade 2 textbooks might have some longer
words, such as “centimeter.”
SAY: Sometimes two populations will have a lot of overlap and sometimes only a little. ASK: Which
of the two comparisons is more likely to have more overlap: comparing heights of 9-year-old
girls and 9-year-old boys, or comparing heights of 9-year-old girls and 39-year-old women?
(comparing heights of girls and boys) SAY: Statisticians want to know if there are general
differences between two populations, but they also want to know how much overlap there is
between them.
Determining the degree of overlap visually. SAY: You can sometimes tell just by looking at
two box plots or dot plots how much overlap there is between the two data sets. It’s easier to tell
if you plot the two data sets one on top of the other. Draw on the board:
B.
A.
Data Set 2:
Data Set 1:
0
1
2 3
4
5 6
7
8
9 10
0
1
2 3
4
5 6
7
8
9 10
ASK: Would you say that the data sets in A look as though they overlap a lot or a little? (a lot)
What about in B? (a little) Keep these dot plots on the board for use later in the lesson.
Using box plots to compare overlap in the middle half of the data. Tell students that when
sets are put in box plots, we can compare how much overlap there is in the middle half of the
data. Draw on the board:
Remind students that the boxes show the middle half of the data because 25% of the data is
before the first box, 25% is in the first box, 25% is in the second box, and 25% is after the
second box. SAY: To decide how much overlap there is in the boxes, imagine sliding the top
box straight down over the bottom box, and see which parts turn dark gray. ASK: Would most of
the boxes turn dark gray, or only a little? (most of them) SAY: When most of the boxes turn dark
gray, there is high overlap; when only a small part of the boxes turn dark gray, there is small
overlap; when none of the boxes turn dark gray, there is no overlap.
S-50
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
Exercises: Copy the box plots and color the part that overlaps. Describe the degree of overlap
between the two data sets as high overlap, small overlap, or no overlap.
a)
b)
c)
d)
e)
f)
Answers: a) high overlap, b) small overlap, c) small overlap, d) high overlap, e) no overlap,
f) small overlap
The variabilities and the difference between the means both affect the degree of overlap.
Tell students that in the previous examples, we compared data sets with the same variability,
and the farther apart their centers were, the less they overlapped. Tell students that there is
another factor that makes some pairs of data sets look as though they have high overlap and
others look as though they have small overlap. SAY: Suppose I have two sticks, and I place
them horizontally along a number line. Draw on the board:
0
1
2 3
4
5
6
7 8
9 10
SAY: Suppose I know that one of the sticks is centered at 4, and the other stick is centered at 6.
ASK: Can you tell me how much they overlap? (no) What else do you need to know about the
sticks? (how long they are) Continue drawing on the board:
0
1
2
3
4
5
6
7
8
9 10
0
1
2
3
4
5
6
7
8
9 10
SAY: Both pairs of sticks are centered at 4 and 6, but the first pair is longer. The first pair of
sticks overlap a lot, whereas the second pair of sticks don’t overlap at all. Keep these diagrams
on the board for use later in the lesson.
Tell students that when discussing overlap of data sets, they can use measures of center and
variability, with the variability representing the “length” of the stick. When using box plots, the
center is the median and the variability is the IQR. Refer students to the dot plots on the board.
ASK: When you compare overlap of dot plots, would you use the mean or median for the
center? (mean) Would you use the MAD or IQR as the measure of variability? (MAD)
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-51
NOTE: For part b) of the exercises below, you might give each student and each dot plot a
number and assign students to one of the four dot plots so that all four are represented.
Exercises: Look at the dot plots for Data Set 1 and Data Set 2 on the board.
a) Calculate the mean of all four dot plots.
b) Choose one of the four dot plots and calculate the MAD.
Answers: a) means for A: 3 and 4, B: 3 and 7; b) MADs are all 4/3
When students finish, take up the answers and point out that all four MADs are the same. ASK: How
could you have predicted that before checking? (the shapes are exactly the same, so how far
each data set is on average from the mean will be the same for all four dot plots) Did you have
to do any calculations to find the means, or was there an easier way? (the distributions are all
symmetric, so they are all centered at their mean, which you can read from the dot plot)
(MP.1, MP.7) ASK: When the dot plots have a high overlap, which is larger, the MADs or the
difference between the means? (MADs) When the dot plots have a small overlap, which is
larger, the MADs or the difference between the means? (difference between the means)
Refer students again to the sticks on the board. Point out that when the sticks are longer than
the difference between the centers, there is a high overlap. When the sticks are shorter than the
difference between the centers, there is no overlap.
(MP.2) Exercises: The dot plots below show the resting heart rates of babies less than 1 year
old and of teenagers.
Babies:
Teenagers:
60
70
80
90
100
110
120
130
140
150
160
a) Describe the degree of overlap between the two dot plots.
b) Calculate the means of both data sets.
c) What is the difference between the means?
d) Without doing calculations, do you think both data sets have the same MAD? How do
you know?
e) Calculate the MAD of both data sets
f) Which is larger, the difference between the means or the MAD? Does this fit with your answer
to part a)? Explain.
Answers: a) small overlap; b) 80 and 120; c) 40; d) yes, because the shapes of the distributions
are identical; e) 10; f) the difference between the means is larger than the variability, which
means there is a small overlap
S-52
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
(MP.2) Comparing populations with different variability and substantial overlap. Tell
students that, so far in this lesson, we have only compared data with the same variability. But
we can also compare data sets with different variability.
Draw on the board:
Time Sara Takes to Get to School (in Minutes)
Walking:
Taking the Bus:
10
15
20
25
30
SAY: Sometimes Sara walks to school and sometimes she takes the bus. One day when Sara
walked to school, she ran into someone she knew and stopped to chat for 10 minutes. Have a
volunteer point out which data point on the board represents this situation. (the outlier) Tell
students that Sara needs to get to school on time for an important test and is deciding whether
to take the bus or walk. ASK: Should we include the outlier when calculating the average time it
takes to get to school by walking? (no) Why not? (she has control over whether she stops to
chat) SAY: There might be other factors, such as having to wait at traffic lights, that she doesn’t
have control over, but we don’t have to count the outlier when comparing the data sets for this
purpose. Erase the outlier from the board before having students complete the exercises below.
Exercises: Look at the dot plots for Time Sara takes to get to school.
a) Find the mean of both data sets.
b) Which method gets Sara to school faster on average, taking the bus or walking?
c) Find the MAD of both data sets.
d) Which method is more reliable, the bus or walking? Use your answer to part c) to justify
your answer.
e) If you were Sara and needed to get to school on time for an important test, would you walk
or take the bus? Why?
f) How would you describe the overlap between the two sets by looking at the dot plots—no
overlap, small overlap, or high overlap?
Answers: a) walking: 20, taking the bus: 19; b) taking the bus; c) walking: 0.6, taking the
bus: 1.6; d) walking because the MAD is much lower; e) I would walk because I can control
how long it takes to get to school; f) answers will vary: both “small overlap” and “high overlap”
are acceptable answers at this stage for the following reasons: there is small overlap because
there is a lot of data in the “taking the bus” dot plot that does not overlap with the range of the
“walking” dot plot, there is high overlap because the “walking” dot plot is completely contained
in the “taking the bus” dot plot
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-53
Tell students that you want to find the median of both sets because you want to draw box
plots. SAY: There are 10 points, and 10 is even, so the median is halfway between the middle
two values. Have volunteers circle the middle two values on each data set, and have other
volunteers tell you the median walking time and the median taking the bus time (20 minutes,
18.5 minutes). Draw a line between the two middle values for each data set.
The plots should look like this:
Time Sara Takes to Get to School (in Minutes)
Walking:
Taking the Bus:
10
15
20
25
30
Before they complete the exercises below, remind students that Q1 is the median of the lower
half and Q3 is the median of the upper half of the data.
Exercises:
a) Find Q1 and Q3 for both data sets.
b) Draw box plots for both data sets.
c) How would you describe the degree of overlap from looking at the box plots?
Answers:
a) walking: Q1 = 19, Q3 = 21, taking the bus: Q1 = 18, Q3 = 21
b)
Walking:
Taking the bus:
15
16
17
18
19
20
21
22
23
c) high overlap, because 2/3 of the boxes would turn dark gray when you merge them
Comparing populations with different variability and small overlap. Draw on the board:
Heights (inches)
Students:
Teachers:
40
S-54
50
60
70
80
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
Tell students that at a K–8 school, ten randomly chosen students and ten randomly chosen
teachers recorded their heights. ASK: By looking at the dot plots, and without doing any
calculations, which would you say is more variable, the heights of the students or the heights
of the teachers? (the students) Why does that make sense? (students are still growing) Have
volunteers estimate, without doing any calculations, the mean of each data set. (student height
seems to be centered close to 50, and teacher height seems to be centered close to 70)
ASK: In these dot plots, is it rare for a student’s height to be 5 inches away from the mean?
(no, it happens quite a lot) Is it rare for a teacher’s height to be 5 inches away from the mean?
(yes, it only happens once) SAY: That is one way to describe the fact that teacher height is less
variable than student height. The teachers are more likely to be close to their mean height than
students are. ASK: Are there any students close to the teacher mean? (yes) Are there any
teachers close to the student mean? (no) SAY: The student population varies a lot more in
height than the teacher population does, so it’s not surprising to find some students near the
mean for teachers. It would be more surprising to find teachers near the mean for students.
Exercises: Here are two sentences from two different math textbooks:
Fill in the blank for the expanded form.
In the product of a number and a variable, the multiplication sign is usually dropped.
a) One sentence came from a Grade 2 textbook and the other from a Grade 7 textbook. Which
is which? How do you know?
b) Write a data set for each grade consisting of the lengths of the words from each sentence.
c) Draw the dot plots.
Answers: a) the second one is from Grade 7, Grade 2 students don’t multiply numbers and
variables; b) Grade 2 word lengths: 4, 2, 3, 5, 3, 3, 8, 4; Grade 7 word lengths: 2, 3, 7, 2, 1, 6, 3,
1, 8, 3, 14, 4, 2, 7, 7
c)
Grade 2:
Grade 7:
0
1
2
3
4
5
6
7
8
9
10 11
12 13 14
15 16
Tell students that you want them to find the mean and the MAD of both sets of data from the
previous exercises. Remind students that they can use frequency plots to help add the data
values. Below each column of the Grade 2 dot plot, write the total value of the dots in that
column. (see answers below)
2
9
8
5
8
SAY: Instead of adding all eight values, now we can just add these five values. This doesn’t
save much time for the Grade 2 data, but it will for the Grade 7 data.
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-55
Exercises: Complete the totals row for Grade 7. Find the mean of both data sets, to one
decimal place.
Answers:
Grade 2 mean: (2 + 9 + 8 + 5 + 8) ÷ 8 = 32 ÷ 8 = 4
Grade 7 mean: (2 + 6 + 9 + 4 + 6 + 21 + 8 + 14) ÷ 15 = 70 ÷ 15 ≈ 4.7
Remind students that the MAD is the average distance from the mean. Use the Grade 2 data
set from the previous exercises to write on the board:
Grade 2 data set:
2
3
3
3
4
4
5
8
Distance from the mean:
2
1
1
1
0
0
1
4
ASK: What is the sum of all the distances from the mean? (9) How many data values are there?
(8) SAY: So the average of the distances or MAD is 10 ÷ 8 = 1.25. Write this on the board.
Exercises:
a) Find the MAD of the Grade 7 data set, to one decimal place.
b) Which grade has the higher mean? The higher MAD?
Answers: a) 2.8, b) Grade 7 has the higher mean and the higher MAD
Extensions
(MP.7) 1. Here are two dot plots:
1
2
3
4
5
6
7
8
9
10
a) Do the dot plots have any data points in common?
b) Draw the box plots.
c) Describe the degree of overlap in the box plots.
Answers:
a) no
b)
1
2
3
4
5
6
7
8
9
10
c) high overlap
When students finish, emphasize that the degree of overlap between two sets is not about how
many data points they have in common, but an overall picture of how the values overlap.
S-56
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
(MP.1) 2. Teach students that they can use frequencies to find the MAD the same way they use
frequencies to find the mean, because the MAD is the mean of the deviations. Have students
find the mean and the MAD of the following set by inputting the frequencies into the table:
2, 2, 2, 3, 3, 3, 3, 4, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 6, 7, 7, 7, 7, 7, 30
Data Value
Frequency
Totals
Distance from Mean
Totals of Distances
2
3
4
5
6
7
30
Answers:
Mean: (6 + 12 + 4 + 15 + 48 + 35 + 30) ÷ 25 = 6
MAD: (12 + 12 + 2 + 3 + 0 + 5 + 24) ÷ 25 = 2.32
Data Value
Frequency
Totals
Distance from Mean
Totals of Distances
2
3
6
4
12
3
4
12
3
12
4
1
4
2
2
5
3
15
1
3
6
8
48
0
0
7
5
35
1
5
30
1
30
24
24
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-57
SP7-19
Using Mean and MAD to Compare Populations
Pages 197–199
Standards: 7.SP.B.3, 7.SP.B.4
Goals:
Students will learn in general terms how confident they can be that two populations are different
based on looking at samples.
Students will express the difference between the means as a multiple of the MAD.
Prior Knowledge Required:
Can evaluate the mean and MAD of a data set
Can evaluate the median and IQR of a data set
Can draw dot plots and box plots
Can read and interpret dot plots and box plots
Understands the real-world interpretation of center and variability
Can determine the degree of visual overlap
Understands the relationship between visual overlap and comparing the difference between
the center to the variability
Vocabulary: average, box plot, data set, degree of overlap, dot plot, first quartile (Q1),
frequency, interquartile range (IQR), mean, mean absolute deviation (MAD), median,
third quartile (Q3), variability
Materials:
calculators
BLM 300 Random Numbers from 0 to 9 (p. S-75)
NOTE: Allow students to use calculators for all exercises in this lesson.
Introduce the difference between the means as a multiple of the MAD. Draw on the board:
Math:
Mean = 88.5
MAD = 6.2
Science:
Mean = 43
MAD = 8
0
10
20
30
40
50
60
70
80
90
100
SAY: Rob wrote 10 math tests and 10 science tests. These are his marks. ASK: Is Rob better at
math or science? (math) How confident are you about that? (very confident) Why are you so
confident? (because the dot plots are very different) Do these sets overlap? (no) What is the
difference between the means? (45.5) Which is larger: the largest MAD or the difference
S-58
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
between the means? (the difference between the means) SAY: Not only is the difference between
the means larger than the MAD, but in fact it is almost six times as large! Write on the board:
difference between the means 45.5
=
» 5.7
MAD
8
SAY: When the difference between the means is a large multiple of the MAD, the graphs will
have very small overlap. In this lesson, we will investigate why.
Review simulating real-world situations. Ask students to imagine that there is a bag of
100 marbles and 30 marbles are red. This could represent many real-life situations, such as
30% of the population supporting a specific candidate for an election, or 30% of the population
texting for longer than a given amount of time. ASK: How can we simulate picking a red marble
using a random number generator? (choose a random number from 1 to 100 and find the
probability that the number is between 1 and 30; or choose a random number from 1 to 10
and find the probability that the number is 1, 2, or 3) If the second option doesn’t come up,
PROMPT: Are there smaller numbers I can use instead of 30 and 100 that will have the
same probability? (3 and 10) Write on the board:
30
3
=
100 10
Tell students that instead of picking numbers from 1 to 10, you will pick numbers from 0 to 9.
SAY: The only benefit is that it is easier to search in a computer file for 0 to 9 than 1 to 10,
because the number of occurrences of 1 gets confused with the 10s. Write on the board:
0, 1, or 2 = Red marble
3, 4, 5, 6, 7, 8, or 9 = Not a red marble
SAY: 30 Grade 7 students take turns selecting a sample of 10 marbles, recording how
many marbles were red, then putting their marbles back. ASK: How can we simulate picking
10 marbles from the bag? (pick 10 random numbers from 0 to 9) Give students BLM 300
Random Numbers from 0 to 9. Tell students that each row represents a sample of 10 picks
from the bag. Have students record beside each row how many from each sample represents
a red marble (i.e., is between 0 and 2). Have students check with a partner that they have the
same data. (3, 3, 5, 3, 4, 2, 5, 0, 2, 5, 6, 2, 6, 4, 1, 4, 2, 1, 3, 2, 1, 4, 5, 3, 3, 4, 5, 1, 2, 1)
Exercises: Use the data from each row of BLM 300 Random Numbers from 0 to 9.
a) Copy and complete the frequency chart for the number of red marbles.
Number of Red Marbles
0
1
2
3
4
5
6
Frequency
b) Add the frequencies from part a). Make sure they add to 30.
c) Make a dot plot showing how many times each number of red marbles occurred.
d) Find the mean to one decimal place, then calculate the MAD of the data.
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-59
Selected answers:
a) Number of Red Marbles
Frequency
0
1
2
3
4
5
6
1
5
6
6
5
5
2
c)
0
1
2
3
4
5
6
d) mean = 3.1, MAD = 1.4
Comparing samples of size 10. Tell students that samples of size 10 vary a lot because they
are so small. Then tell students that another 30 samples of size 10 were picked from another
bag of marbles. We’ll call the original bag from the exercises “Bag 1” and we’ll call this new bag
“Bag 2.” Draw on the board (the dot plot for Bag 1 is the answer to part c) of the exercises above):
Bag 1:
Bag 2:
0
1
2
3
4
5
6
Exercises: Look at the dot plots for Bag 1 and Bag 2.
a) Find the mean and the MAD from Bag 2 to one decimal place.
b) Is the variability for both sets about the same?
c) Looking at the dot plots, does there appear to be a high overlap in both sets?
Answers: a) mean = 2.5, MAD = 1.1, b) yes, c) yes
Comparing samples of different sizes. When students finish, tell them that instead of plotting
the number of red marbles, they could instead plot the proportion of red marbles. SAY: So, if
4 out of the 10 marbles picked were red, then the proportion of red marbles in the sample is 0.4.
This is useful if you want to compare samples of size 10 to samples of size 30, for example. The
sample of size 30 will likely have more red marbles than the sample of size 10, but it is the
proportion of red marbles that we want to compare. Students will need to understand this
concept in order to complete Question 2 on AP Book 7.2 p. 198.
Using the means and the largest MAD to discuss overlap. Tell students that there is high
overlap when the difference between the means is smaller than the largest MAD. Refer students
to the dot plots for Bag 1 and Bag 2. ASK: What is the difference between the two means? (0.6)
S-60
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
What are the two MADs? (1.1 and 1.4) From our description of high overlap, do these dot plots
have high overlap? (yes)
Tell students that even though the means are different, when there is high overlap, it might be
just a coincidence that the means are different. You can’t be very confident that the two bags
are different at all; the difference might just be due to the fact that all samples vary. Tell students
that in fact, Bag 2 also has exactly 30 red marbles, so it’s just like taking another 30 samples of
size 10 from the same bag.
Tell students you took another 30 samples of size 10 from yet another bag. Draw on the board
(the dot plot for Bag 1 is the same as above):
Bag 1:
Bag 3:
0
1
2
3
4
5
6
7
8
9
(MP.2) ASK: Does the overlap look more or less than the other pair? (less) Do you think it’s
more likely or less likely that the two bags have the same amount of red marbles? (less likely)
Tell students that one way of describing how likely it is for two sets of samples to be from similar
populations is by describing how much they overlap. SAY: We can describe the degree of
overlap by describing how the difference between the means compares to the largest MAD.
Have students calculate the mean and MAD from Bag 3, to one decimal place. (mean = 5.4,
MAD = 1.3) Summarize the means and MADs for Bags 1 and 3 on the board, as shown below:
Bag 1: mean = 3.1, MAD = 1.4
Bag 3: mean = 5.4, MAD = 1.3
ASK: What is the difference between the means? (2.3) What is the largest MAD? (1.4) Is the
difference between the means larger or smaller than the largest MAD? (larger) So is there high
overlap? (no) Is that consistent with your prediction when you looked at the dot plots? (yes)
SAY: Because the overlap isn’t high, we can be reasonably sure that the two bags have
different amounts of red marbles. Tell students that, in fact, Bag 3 has 60 red marbles.
So, although the sample we chose had smaller than average amounts of red marbles, it was
still big enough to show the difference between the bags with 30 red marbles.
NOTE: To provide an easier version of the exercises below, you can provide students with the
means and the MADs of both data sets.
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-61
(MP.3, MP.4) Exercises: Calculate the means and the MADs of both data sets to one decimal
place. Is there enough evidence to believe that the populations are actually different? Explain
how you know.
a) Did 12-year-old boys in 1995 watch more TV than 12-year-old boys in 2007?
Hours of TV Watched per Day
1995:
2007:
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
b) Do boys play more video games than girls?
Minutes per Day Playing Video Games
Boys:
Girls:
0
10
20
30
40
50
60
70
80
90
100
c) Do females live longer than males?
Life Expectancies in 15 Countries (in Years)
Females:
Males:
70
75
80
85
d) Do girls spend more time on homework than boys?
Time Spent on Homework Each Week (in Hours)
Girls:
Boys:
0
S-62
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
Answers:
a) 1995 mean = 2.5, 1995 MAD = 0.9; 2007 mean = 2, 2007 MAD = 1; the difference between
the means is 0.5, which is less the largest MAD, so there is not enough evidence
b) Boys mean = 52, Boys MAD = 24; Girls mean = 21.5, Girls MAD ≈ 16.7; the difference
between the means is 30.5, which is greater than the largest MAD, so there is enough evidence
c) Female mean ≈ 79.1, Female MAD ≈ 2.1; Male mean ≈ 73.7, Male MAD ≈ 2.2; the difference
between the means is 5.4, which is greater than both MADs, so there is enough evidence
d) Girls mean = 2.5, Girls MAD = 0.7; Boys mean = 2, Boys MAD = 0.7; the difference between
the means is 0.5, which is less than both MADs, so there is not enough evidence
Writing the difference between the means as a multiple of the largest MAD. Tell students
that they can be more precise than just saying whether the difference between the means is
larger or smaller than the largest MAD. They can actually write the difference between the
means as a multiple of the largest MAD. SAY: It’s important to use the larger MAD because
the difference needs to be larger than both MADs to be confident that the difference between
the means is meaningful. Refer back to the example of the three bags of marbles. Write on
the board:
Bag 1: mean = 3.1, MAD = 1.4
Bag 2: mean = 2.5 and MAD = 1.1
Bag 3: mean = 5.4, MAD = 1.3
Mean of Bag 1 - Mean of Bag 2 0.6
=
» 0.43
largest MAD
1.4
Mean of Bag 3 - Mean of Bag 1 2.3
=
» 1.64
largest MAD
1.4
SAY: The difference between the means in Bags 1 and 2 is only half the largest MAD, so there
is a lot of overlap. The difference between the means in Bags 1 and 3 is more than the largest
MAD, so the difference in the means is meaningful. The larger that multiplier is, the more
meaningful the difference is.
Have students compare Bags 2 and 3 by finding the difference between the means as a multiple
of the largest MAD. (2.9 ÷ 1.3 ≈ 2.23) ASK: Is the difference between Bags 2 and 3 more or less
meaningful than the difference between Bags 1 and 3? (more meaningful) How do you know?
(because 2.23 is larger than 1.64)
Exercises:
a) For each situation in the exercises above, calculate the difference between the means as a
multiple of the largest MAD, to one decimal place.
b) For which situation are you most certain that the means are really different?
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability
S-63
Answers: a) part a) 0.5 ÷ 0.9 ≈ 0.6, part b) 30.5 ÷ 24 ≈ 1.3, part c) 5.4 ÷ 2.2 ≈ 2.5,
part d) 0.5 ÷ 0.7 ≈ 0.7; b) part c): life expectancies of females is generally greater than
life expectancies of males
Using box plots to decide whether the difference in medians is meaningful. Remind
students that they can also use box plots to compare degree of overlap. SAY: The more overlap
there is between samples, the harder it is to say for sure that the populations are really different.
The samples might be different just because random samples vary.
Exercises: In which case would you describe the degree of overlap as higher? In which case
would you be more confident that the populations are actually different?
a) A.
B.
b) A.
B.
Answers: a) more overlap in A, more confident B that the populations are actually different in B;
b) more overlap in B, more confident that the populations are actually different in A
Extension
0
5
10
15
20
a) Visually, how would you describe the overlap?
b) What are the medians?
c) What are the IQRs?
d) Which is larger: the difference between the medians or the largest IQR?
Answers: a) small overlap, b) both 10, c) both 6, d) the largest IQR
(MP.4) When students finish, point out that this shows a limitation in using the difference
between the medians to describe the degree of overlap; there might be very small overlap even
when the medians are exactly the same. Point out that if there is no difference between the
medians, you don’t need to decide if the difference is meaningful.
S-64
Teacher’s Guide for AP Book 7.2 — Unit 7 Statistics and Probability