Download CHAPTER 4 RANDOM VARIABLES AND DISTRIBUTIONS

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Every baby is born either male or female. So every
child is the outcome of a human binomial experiment.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_CO.indd 130
8/31/11 6:49 PM
In a family with one child, what is the chance that the child is a
girl? In a family with two children, what is the chance that both
of them are girls? What about the chance of having three girls,
or four, or five? Random variables can help describe situations
like these.
In This Chapter
You will learn how random variables and probability distributions can describe
problem situations. You’ll also learn how to use expected value to make decisions.
Specific discrete and continuous distributions, such as binomial and normal
distributions, will be covered as well as techniques for using the standard normal
distribution to compare outcomes.
Topic List
► Chapter 4 Introduction
► Creating Probability Distributions
► Interpreting Probability Distributions
► Expected Value
► Binomial Distributions
► Continuous Random Variables
► The Normal Distribution
► Standardizing Data
► Comparing Scores
► The Standard Normal Curve
► Finding Standard Scores
► Chapter 4 Wrap-Up
RANDOM VARIABLES AND DISTRIBUTIONS
131
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_CO.indd 131
8/31/11 6:49 PM
Chapter 4 Introduction
Though individual events can be random, they can
yield patterns when repeated multiple times.
Probability Distributions
The relative frequency histogram shows the proportions for the number of
boys in a family with 3 children, given the predicted probability of 50%.
According to this graph, there is a 37.5% chance of having exactly 1 boy out
of 3 children. You can also see that there is a 25% chance that all 3 children
will have the same gender (just add the heights of the first and last bar).
40
35
30
Probability (%)
It iiss of
oft
often
ten predicted that the probability that a newborn baby will be a
boy (or girl, as the case may be) is 50%. Due to the natural gender ratio,
however, the actual probability that a newborn will be a boy is not exactly
50%.
% In the United States,, for example,
p , the chances are slightly
g y higher
g
that
a newborn will be a boy rather than a girl.
Number of Boys in Family
(Predicted)
25
20
15
10
5
0
The graph for the actual number of boys in a family with 3 children,
however, tells a slightly different story. The relative frequency histogram
estimates the actual proportions considering the natural gender ratio.
0
1
2
Number of boys
3
Number of Boys in Family
(Actual)
Notice that, compared to the first graph, the chances are greater that a
family of 3 will have mostly boys (as indicated by the higher bars for 2 and
3 boys). Although there are slightly more all-boy families than would be
predicted by pure probability, the histograms are quite similar.
40
35
Probability (%)
30
25
20
15
10
5
0
0
132
CHAPTER 4
1
2
Number of boys
3
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_IN_RG.indd 132
9/17/11 2:24 AM
Chapter 4 Introduction
The
Th
he Normal Distribution
When
W
Wh
hen a ddata set has a normal distribution, a histogram that represents the
data
at set is symmetric and has a bell shape.
The Normal Distribution
The
h symmetry of this curve demonstrates that more data are near the mean
th
tha
an iin th
the outlying
tl i regions
i
than
regions.
The normal distribution has properties that are very useful in statistics.
When data are normally distributed, about 68% of the data values are
within a range of 1 standard deviation above and below the mean.
Mean
For example, suppose the average length of a full-term newborn baby
is 20.1 inches with a standard deviation of 0.4 inches. Because lengths
of babies are normally distributed, we can assume that roughly 68% of
newborn babies are between 19.7 and 20.5 inches in length. These values
are obtained by subtracting and adding 0.4 from 20.1.
68% of newborn
babies
19.3
19.7
20.1
20.5
20.9
Other properties of the normal distribution will be explored later in the chapter.
Applying It
In this chapter, you will learn how to create and
interpret probability and binomial distributions
using both tables and histograms. You will work
with real data to learn about expected value and
continuous random variables. You will also learn
about normal distributions and how the standard
normal curve is used to interpret data and make
predictions.
CHAPTER 4 INTRODUCTION
133
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_IN_RG.indd 133
9/17/11 2:24 AM
Preparing for the Chapter
Review the following skills to prepare for the concepts
in Chapter 4.
►
Find probabilities of events.
►
Interpret histograms.
►
Determine areas of figures in the coordinate plane.
x−a
► Solve linear equations in the form c = _____ .
b
► Simplify the expression 1 − x for different values of x.
Problem Set
A stack of 22 cards are numbered 1 to 22. If a card is randomly
selected, find the probability of the event.
1.
The number 4 is selected.
2.
An even number is selected.
3.
An odd number less than 8 is selected.
4.
A prime number or an even number is selected.
5.
A number less than 11 is selected.
6.
A number less than 17 or greater than 20 is selected.
The histogram shows the number of children in 32 selected families.
How many families have 5 children?
8.
How many families have 2 or more children?
9.
How many families have from 1 to 3 children?
10.
What percent of the families have from 2 to
4 children?
Number of Children per Family
12
10
Families
7.
8
6
4
2
0
1
134
CHAPTER 4
2
3
4
Children
5
6
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_IN_PS.indd 134
9/24/11 1:54 AM
Chapter 4 Introduction
Determine the area of the shaded figure.
11.
13.
x
6
7
6
5
4
3
2
1
5
4
3
2
1
y
0
y
0
12.
x
1 2 3 4 5 6 7 8 9
1 2 3 4 5 6 7
14.
x
6
5
4
3
2
1
0
x
5
4
3
2
1
y
1 2 3 4 5 6 7 8 9
0
y
1 2 3 4 5 6 7 8 9 10 11
Determine the value of x.
15.
x − 30
1.1 = ______
3
17.
x − 120
−1.15 = _______
4.8
16.
x − 1.22
2.05 = ________
0.1
18.
x − 450
−2.5 = _______
12
Find the value of the expression 1 − x for the given value of x.
19.
3
x = __
8
21.
2
x = __
9
23.
x = 0.381
20.
x = 0.92
22.
x = 0.3072
24.
x = 0.0011
CHAPTER 4 INTRODUCTION
135
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_IN_PS.indd 135
9/17/11 2:29 AM
Creating Probability
Distributions
Probabilities of all outcomes of a discrete random
variable can be summarized in a table.
Creating Probability Distribution Tables
A study of a new treatment for high cholesterol undergoes clinical testing
with the results summarized in this frequency table.
X
0–9
10–19
20–29
30–39
40–49
f
81
73
35
47
14
In this table, X stands for the number of points that the patient’s cholesterol level
decreased, and f stands for the number of patients. For instance, 73 patients had
their cholesterol decrease between 10 and 19 points after the new treatment.
A patient wants to know the probability that the new treatment will lower
his cholesterol from 20 to 29 points. Here, a probability distribution table
would be useful because it shows the probabilities for all of the outcomes.
REMEMBER
The letter X is used to represent
a discrete random variable. For
example, when a coin is tossed,
the variable X would represent
the two outcomes: heads and
tails. These outcomes are both
discrete and random.
HOW TO CREATE A PROBABILITY DISTRIBUTION TABLE
To create a probability distribution table from a frequency table
Step 1 Find the sum of the frequencies.
Step 2 Divide each frequency by the sum found in Step 1.
Step 3 Replace each frequency in the table with the ratio found in Step 2.
Step 4 Change the label f to P.
For the frequency table describing the clinical testing, the total number of
frequencies is 81 + 73 + 35 + 47 + 14 = 250. So 250 patients took part in
the study. Here is the complete probability distribution table.
X
0–9
10–19
20–29
30–39
40–49
THINK ABOUT IT
P
0.324
0.292
0.14
0.188
0.056
The sum of the probabilities in
a probability distribution table
always equals one.
According to this table, the probability that the new treatment will lower a
patient’s cholesterol between 20 and 29 points is 0.14 or 14%.
136
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T01_RG.indd 136
9/17/11 2:33 AM
In a different study, patients are separated into four groups according
to a spinner.
4
For the spinner, there are four different outcomes:
Group 1, Group 2, Group 3, and Group 4. There are
6 sections overall.
1
2
Notice that some outcomes are more likely than
others. For instance, there are two sections for groups
1 and 2, but only one section for groups 3 and 4.
2
1
3
The probability distribution table shows that the chances of being in Group 1
or Group 2 are higher than the chances of being in Group 3 or Group 4.
X
P
1
2
3
4
1
__
1
__
1
__
1
__
3
3
6
6
Properties of Probability Distributions
There are two properties that every probability distribution table has.
PROPERTIES OF PROBABILITY DISTRIBUTIONS
Every probability distribution for a discrete random variable has the
following properties:
• Each probability is a number between 0 and 1.
• The sum of all probabilities is equal to 1.
This probability distribution table violates the first property because each
probability is not a number between 0 and 1. So it is not a probability
distribution.
X
1
2
3
4
5
P
0.5
0.1
0.3
0.4
−0.3
THINK ABOUT IT
Every probability distribution for
a discrete random variable must
satisfy both properties.
This probability distribution table violates the second property because
the sum of all probabilities is not equal to 1. So it is not a probability
distribution.
X
5
10
15
20
25
30
P
0.34
0.5
0.101
0.32
0.34
0.122
CREATING PROBABILITY DISTRIBUTIONS
137
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T01_RG.indd 137
9/24/11 1:57 AM
Problem Set
A survey asked a group of people the size of their households.
The results are in the table.
Household Size
1
2
3
4
5
ƒ
4
12
6
2
1
1.
How many people were surveyed?
2.
How many people reported a household size of 4?
3.
Complete the probability distribution table.
1
Household Size
2
3
4
5
0.08
P
Match the frequency table with the probability distribution table.
A.
4.
138
X
P
0
B.
X
P
0.1
0
1
0.12
2
C.
X
P
0.1
0
1
0.2
0.32
2
3
0.16
4
0.3
X
ƒ
0
D.
X
P
0.2
0
0.2
1
0.24
1
0.2
0.3
2
0.1
2
0.2
3
0.1
3
0.26
3
0.2
4
0.3
4
0.2
4
0.2
X
ƒ
X
ƒ
X
ƒ
5
0
10
0
10
0
5
1
10
1
12
1
10
1
6
2
15
2
5
2
10
2
16
3
5
3
13
3
10
3
8
4
15
4
10
4
10
4
15
CHAPTER 4
5.
6.
7.
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T01_PS.indd 138
9/17/11 2:45 AM
Create a probability distribution table.
8.
6
10.
In a medical study, the heights of a group of
men were measured to the nearest inch. There
were 120 men with heights of 60–64 inches,
280 men with heights of 65–69 inches, 312 men
with heights of 70–74 inches, and 162 men with
heights of 75–79 inches.
11.
The Mosquitoes are playing the Gnats in a
three-game series. The probability is 0.25 that
the Gnats will win all 3 games, 0.4 that they will
win 2 of the games, 0.2 that they will win 1 of
the games, and 0.15 that they will not win any
of the games.
12.
What is the probability of rolling a 1, 2, 3, 4, 5,
or 6 when rolling a 6-sided number cube?
13.
There is a 40% chance it will rain tomorrow, a
30% chance it will snow, and a 30% chance it
will be partly cloudy.
5
4
3
2
1
0
Yellow
9.
Red
Orange
Blue
Green
10
9
8
7
6
5
4
3
2
1
0
Mon
Tue
Wed
Thu
Fri
Sat
Sun
Explain why the table is not a probability distribution.
14.
Result
P
0
15.
Length
P
0.25
5–14
0.25
1
0.45
15–24
-0.3
2
0.0
25–34
0.4
3
0.35
35–44
0.5
45–54
0.15
Find the value of n using the probability distribution table.
16.
Color
P
red
blue
1
__
1
__
3
8
gold
silver
n
1
__
6
17.
Number
of Guests
per Room
0
1
2
3
4
P
0.42
0.31
0.16
n
0.09
Solve.
18.
Challenge Create a probability distribution for rolling two 6-sided number cubes,
where X is the sum of the numbers on the number cube, and P is the probability of
obtaining the sum.
CREATING PROBABILITY DISTRIBUTIONS
139
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T01_PS.indd 139
9/24/11 1:58 AM
Interpreting Probability
Distributions
The likelihood of many events can be summarized
by a table or histogram.
Interpreting Probability Distribution Tables
A summer camp program director is planning activities for the summer.
The probability distribution table shows the probability P that the age of a
randomly chosen camper is X.
X
10
11
12
13
14
P
0.15
0.3
0.2
0.25
0.1
For the first activity, the program director selects a camper at random to
read announcements. What is P (X = 13): the probability that the camper
selected is 13 years old?
According to the probability distribution table, when X = 13, the probability
is P = 0.25. So, P(X = 13) = 0.25.
In other words, the probability of selecting a 13-year-old camper at random
is 0.25, or 25%.
Suppose the program director wants to determine P(X ≥ 13), the probability
that a randomly selected camper is at least 13 years old.
This can be answered by adding individual probabilities.
P (X ≥ 13) = P (X = 13) + P (X = 14)
= 0.25 + 0.1
= 0.35
There is a 35% probability of selecting a camper who is at least 13 years old.
140
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T02_RG.indd 140
9/24/11 2:00 AM
Interpreting Probability Distribution Histograms
The program director now has to decide which staff members to assign to
which activities. For this, she is using this probability distribution histogram.
P
Ages of Staff Members
0.5
REMEMBER
0.4
A histogram is a graph showing
the frequency distribution
of data.
0.3
0.2
0.1
0
6
12
18
24
30
36
42
48
54
X
In this histogram, X stands for the ages of individual staff members.
P is the probability that if a single staff member is chosen at random,
then the staff member will be that age.
For the first activity, the program director selects a staff member at random
to lead the songs at dinner. What is P (18 ≤ X < 24): the probability that the
staff member selected is at least 18 years old and less than 24 years old?
The first bar of the histogram represents 18 ≤ X < 24. The height of this
bar is at the value P = 0.2. So the probability of selecting a staff member
who is at least 18 years old and less than 24 years old is 0.2%, or 20%.
Probability distribution histograms can help determine more than just
single entries. Suppose the program director wants to determine P (X ≥ 30),
the probability that a randomly selected staff member is at least 30 years old.
This can be answered by adding individual probabilities.
P (X ≥ 30) = P (30 ≤ X < 36) + P (36 ≤ X < 42)
= 0.3 + 0.1
= 0.4
There is a 40% probability of selecting a staff member who is at least
30 years old.
INTERPRETING PROBABILITY DISTRIBUTIONS
141
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T02_RG.indd 141
9/24/11 2:00 AM
Problem Set
Use the probability distribution table to find the probability of the event.
X
0
1
2
3
4
5
P
0.3
0.05
0.1
0.15
0.15
0.25
1.
P (X = 3)
5.
P (X > 2 or X = 0)
2.
P (X = 2 or X = 0)
6.
P (X ≠ 3)
3.
P (1 < X ≤ 3)
7.
P (X < 4)
4.
P (0 < X < 3)
8.
P (X ≥ 4)
Use the probability histogram to find the probability of the event.
0.25
0.2
P
0.15
0.1
0.05
0
1
2
3
4
X
5
6
7
P (X = 6)
13.
P (X = 1 or X > 3)
10.
P (X = 1 or X = 3)
14.
P (X ≠ 6)
11.
P (1 < X ≤ 6)
15.
P (X > 4)
12.
P (0 < X < 4)
16.
P (X ≤ 5)
9.
The probability distribution histogram shows the weight distribution of
hamadryas baboons that live in a certain region. Find the probability of
the event.
Weights of Hamadryas Baboons
0.3
0.25
P
0.2
0.15
0.1
0.05
5
10 15
20 25 30
Weight (lb)
35
40
45
17.
P (5 ≤ X < 20)
20.
P (X < 30)
18.
P (25 ≤ X < 35)
21.
P (X < 15)
19.
P (X < 15 or X ≥ 25)
22.
P (X < 20 or X ≥ 30)
142
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T02_PS.indd 142
9/17/11 2:58 AM
Each week, Evan randomly picks a weekday (Monday through Friday)
to visit the library.
23.
Create a probability distribution table for this
situation. What is the probability that he does
not visit the library on Friday?
24.
Let X be the number of times he visits the library
on Friday for the next two weeks. Create a
probability distribution table for X. Find P (X > 0).
A message center receives calls from 6 p.m. until 6 a.m. Typically, 50%
of the calls come in between 6 p.m. and 9 p.m.; 25% of the calls come
in between 9 p.m. and midnight; 20% of the calls come in between
midnight and 3 a.m.; and 5% of the calls come in between 3 a.m.
and 6 a.m.
25.
What is the probability that a call will come in
between 6 p.m. and 9 p.m.?
26.
What is the probability that a call will come in
between 6 p.m. and midnight?
27.
What is the probability that a call will come in
between midnight and 6 a.m.?
28.
What is the probability that a call will come in
between 9 p.m. and 6 a.m.?
Use the Weights of Hamadryas Baboons probability histogram to find
the probability of the event.
29.
Challenge Two hamadryas baboons from the region are
randomly selected. What is the probability that both weigh less
than 30 pounds?
INTERPRETING PROBABILITY DISTRIBUTIONS
143
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T02_PS.indd 143
9/17/11 2:58 AM
Expected Value
Probability distributions can tell us what to expect
when all outcomes are considered.
Using Probability for Estimation
Each week the manager at a large restaurant orders eggs that are used
for breakfast items at the restaurant. The probability distribution table
shows the probability P that a customer will order a breakfast item with
X number of eggs.
The food manager expects about 800 customers
to have breakfast each week at the restaurant.
How many of these customers are expected to
order breakfast items with two eggs?
According to the probability distribution table,
the probability that a customer will order a
breakfast with two eggs is 0.42, or 42%.
X
P
0
0.29
1
0.18
2
0.42
3
0.11
REMEMBER
In a probability distribution
table, the sum of all probabilities
is equal to 1, and each individual
probability is a number between
0 and 1.
Out of 800 patrons, 42% are expected to order
breakfast items with two eggs. So about 0.42 · 800 = 336 customers
are expected to order breakfast items with two eggs.
Finding Expected Value
The prediction above is based on a single outcome—that a customer will
order a breakfast item that has two eggs.
What if the restaurant manager wants to know the average number of eggs
for all breakfast items served at the restaurant?
The manager needs to know the expected value—the expected number of
eggs in a typical breakfast served at the restaurant.
EXPECTED VALUE
Suppose each outcome of a random variable X is x1, x2, . . . ., xn and has
probability p1, p2, . . . . , pn, respectively. The average, or expected value,
of X is
E(X ) = x1 · p1 + x2 · p2 + . . . + xn · pn.
144
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T03_RG.indd 144
9/24/11 2:05 AM
The expected value is the sum of the products of the individual
probabilities and their respective probabilities.
E(X ) = x1 · p1 + x2 · p2 + x3 · p3 + x4 · p4
= 0 · (0.29) + 1 · (0.18) + 2 · (0.42) + 3 · (0.11)
= 0 + 0.18 + 0.84 + 0.33
= 1.35
The expected value of 1.35 means that the food manager can expect that
each breakfast ordered will have 1.35 eggs on average.
THINK ABOUT IT
With an expected value of
1.35 eggs per breakfast, the
manager can predict that about
1.35 · (800) = 1080 eggs will be
consumed in a week.
Solving Problems with Expected Value
The recommended amount of riboflavin per day is 1.7 milligrams. Suppose
there is a 30% chance that a person will take 0 milligrams of riboflavin
per day; a 40% chance that a person will take 1 milligram of riboflavin per
day; a 20% chance that a person will take 2 milligrams of riboflavin per
day; a 5% chance that a person will take 3 milligrams of riboflavin per day;
and a 5% chance that a person will take 4 milligrams of riboflavin per day.
What is the expected value of riboflavin intake per day?
BY THE WAY
Riboflavin is also known as
vitamin B2.
E(X ) = x1 · p1 + x2 · p2 + x3 · p3 + x4 · p4
= 0 · (0.3) + 1 · (0.4) + 2 · (0.2) + 3 · (0.05) + 4 · (0.05)
= 0 + 0.4 + 0.4 + 0.15 + 0.2
= 1.15
So the expected value of riboflavin intake for a single person is about
1.15 milligrams per day. This value is below the recommended amount
of 1.7 milligrams per day.
EXPECTED VALUE
145
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T03_RG.indd 145
9/24/11 2:05 AM
Problem Set
The table shows the distribution of household sizes in a certain city.
1.
2.
3.
If there were 6000 households in the city,
how many households would you expect to be
of size 1?
Household Size
P
1
0.16
If there were 4000 households in the city,
how many households would you expect to be
of size 4?
2
0.48
3
0.24
What is the expected value of household size in
this city?
4
0.08
5
0.04
Determine the expected value of X.
4.
5.
146
X
P
6.
X
P
0
8.
X
P
0.1
1998
0.42
1
0.1
1999
0.2
0
1
__
1
1
__
3
2
0.1
2000
0.15
2
1
__
3
0.1
2001
0.1
1
__
4
0.1
2002
0.13
3
5
0.5
X
P
X
P
0
0.5
1998
0.2
1
0.1
2000
0.2
X
3
6
6
P
7.
9.
0
1
__
6
1
__
3
2
0.1
2002
0.2
12
1
__
3
0.1
2004
0.2
4
0.1
2006
0.2
18
1
__
5
0.1
CHAPTER 4
3
6
6
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T03_PS.indd 146
9/17/11 3:47 AM
Solve.
10.
A clothing store has a section of jeans on sale.
There is a 30% chance of picking a pair with a
30-inch inseam; a 25% chance of picking a pair
with a 32-inch inseam; a 10% chance of picking
a pair with a 34-inch inseam; a 10% chance of
picking a pair with a 36-inch inseam; a 10% chance
of picking a pair with a 38-inch inseam; and a
15% chance of picking a pair with a 40-inch inseam.
Create a probability distribution table and find the
expected value of the inseam length of the jeans.
11.
Raina is entering a 64-player cribbage
tournament. The probability of her playing
1
1 ; exactly 2 matches is __
exactly 1 match is __
2
4;
1 ; exactly 4 matches is ___
1
exactly 3 matches is __
8
16 ;
1
exactly 5 matches is ___
32 ; and exactly 6 matches
1
is ___
32 . Create a probability distribution table for
this situation, and find the expected value of the
number of matches Raina will play.
The Columbus Crew of Major League Soccer had the following results last year.
Number of Goals
P
0
1
2
3
1
__
1
__
3
___
1
__
3
5
10
12.
What is the expected value of the number of
goals the Columbus Crew will score in a match?
13.
If the Columbus Crew played 30 matches,
convert the probability distribution table into a
frequency table.
6
14.
Determine the mean number of goals the
Columbus Crew scored in a match. Explain why
this is greater than, less than, or equal to the
expected value.
Use the probability distribution table.
X
P
6
m
8
1
__
n
1
__
15.
2
4
Challenge If the expected value of this distribution is 8.5, find the
value of m and n.
EXPECTED VALUE
147
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T03_PS.indd 147
9/17/11 3:47 AM
Binomial Distributions
A table or graph can be used to represent outcomes
of a variable that has a binomial distribution.
Determining Whether a Variable Has a Binomial
Distribution
A doctor knows that for a certain type of adult migraine headache,
Medicine Q works about 70% of the time. If the medicine is tried with two
different patients with this condition, the resulting probability distribution is
a binomial distribution.
BINOMIAL DISTRIBUTION
A variable has a binomial distribution if
• There are a fixed number of independent trials.
• The outcome of each trial is success or failure.
• The probability of success for each trial is the same.
This situation meets all of the conditions for a binomial distribution.
• There are a fixed number of independent trials; there are two trials, and
each patient taking Medicine Q is independent of the other.
• The outcome of each trail is a success or failure; the medicine will help
the migraine (success) or it will not (failure).
THINK ABOUT IT
Two or more coin tosses have
a binomial distribution. The
probability of success and failure
is the same—50%.
• The probability of success for each trial is the same; there is a 0.7
probability of success when a patient takes Medicine Q.
Creating Binomial Distributions
Suppose later in the day, the doctor is seeing two patients who have this
type of adult migraine headache. We can create a binomial distribution table
or graph to represent outcomes for these two patients.
The variable X can take on three different values: Medicine Q will work for
none of the patients, one of the patients, or both patients. So the variable X is
equal to 0, 1, or 2.
148
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T04_RG.indd 148
9/17/11 3:43 AM
When X = 0, it means that Medicine Q failed to work for both patients.
Since P (success) = 0.7, P (failure) = 0.3.
REMEMBER
If A and B are independent events,
then P(A and B) = P(A) · P(B).
P(X = 0) = P (failure) · P (failure)
= 0.3 · 0.3
= 0.09
When X = 1, it means that the medicine worked for one of the patients,
but not both.
P(X = 1) = P (success) · P (failure) + P (failure) · P (success)
= 0.7 · 0.3 + 0.3 · 0.7
= 0.21 + 0.21
= 0.42
When X = 2, it means that the medicine worked for both patients.
P(X = 0) = P (success) · P (success)
= 0.7 · 0.7
= 0.49
We can now use the probabilities that we calculated for each outcome to
create a binomial distribution table and a binomial distribution histogram.
Binomial Distribution Table
X
0
1
2
P
0.09
0.42
0.49
Binomial Distribution Histogram
P
0.55
0.5
0.45
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
0
1
2
X
Interpreting Binomial Distributions
The doctor would like to be successful with at least one of the two patients.
In the language of probability, the question becomes what is P (X ≥ 1)?
The answer to this question is found by adding the appropriate probabilities
in the table.
P(X ≥ 1) = P (X = 1) + P (X = 2)
= 0.42 + 0.49
= 0.91
There is a 91% probability that Medicine Q will be successful with at least
one of the patients.
BINOMIAL DISTRIBUTIONS
149
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T04_RG.indd 149
9/17/11 3:43 AM
Problem Set
Determine whether the variable X has a binomial distribution. If it has
a binomial distribution, then give P (success) and the number of trials.
If X does not have a binomial distribution, then explain why.
1.
When a coin is tossed, X is the number of tosses
it takes to get tails.
2.
Over the next 4 days, X is the number of days it
will rain.
3.
When a coin is tossed 4 times, X is the number
of tails.
4.
When a 6-sided number cube is rolled 3 times,
X is the number of 2s rolled.
5.
A bag contains a red, a yellow, and a blue marble.
When a marble is selected 2 times in a row
without replacement, X is the number of yellow
marbles.
6.
A bag contains a red, a yellow, and a blue marble.
When a marble is selected from the bag 3 times
in a row with replacement, X is the number of
yellow marbles.
Match the probability histogram with the description of the binomial
distribution. Assume that there are 2 trials.
A.
P (success) = 0.8
C.
P (success) = 0.2
B.
P (success) = 0.7
D.
P (success) = 0.3
7.
9.
0.7
0.5
0.6
0.4
0.5
0.3
0.4
P
P
0.6
0.2
0.3
0.1
0.2
0
0.1
0
8.
1
X
2
0
10.
1
X
2
0
1
X
2
0.7
0.5
0.6
0.4
0.5
0.3
0.4
P
P
0.6
0
0.2
0.3
0.1
0.2
0.1
0
0
150
CHAPTER 4
1
X
2
0
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T04_PS.indd 150
9/17/11 4:01 AM
Suppose the dial on the spinner is spun 2 times in a row. Create a
probability histogram for the variable X.
A
B
C
11.
X is the number of times the dial lands on
section A.
13.
X is the number of times the dial lands on
section A or C.
12.
X is the number of times the dial lands on
section C.
14.
X is the number of times the dial lands on
section B or A.
Create a probability distribution table for the variable X. Then use the
table to find the indicated probability.
15.
When a coin is tossed twice, X is the number of
tails. Find P (X < 2).
16.
A basketball player’s probability of making a
9
free throw is ___
10 . When the player shoots 2 free
throws in a row, X is the number of free throws
made. Find P (X = 2).
17.
When a coin is tossed 3 times, X is the number of
tails. Find P (X > 1).
18.
A basketball player’s probability of making a
9
free throw is ___
10 . When the player shoots 3 free
throws in a row, X is the number of free throws
made. Find P (X = 2).
19.
A basketball player’s probability of making a
free throw is 0.6. When the player shoots 2 free
throws in a row, X is the number of free throws
made. Find P (X = 1).
20.
A basketball player’s probability of making a
free throw is 0.6. When the player shoots 3 free
throws in a row, X is the number of free throws
made. Find P (1 ≤ X < 3).
Use the spinner to solve.
21.
D
A
C
B
Challenge Suppose the dial on the spinner is spun 3 times in a
row. Find the probability that the dial will land 2 or more times
on section B.
BINOMIAL DISTRIBUTIONS
151
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T04_PS.indd 151
9/17/11 4:01 AM
Continuous Random
Variables
Areas within probability distributions can be used
to compute probability.
Exploring Continuous Random Variables
An incubator is being used to hatch chickens. The incubator temperatures X,
in degrees Fahrenheit, are distributed according to this probability distribution.
When all values of a random variable X
are equally likely to occur, the resulting
probability distribution is called a
uniform probability distribution.
For the incubator, all of the values
of X, from 100°F to 104°F are
equally likely to occur. Because any
temperature between these values,
such as 101.5°F or 103.9°F, may occur,
X, in this case, is considered to be a
continuous random variable.
P
0.5
Q&A
Uniform Probability
Distribution
Q
How is a discrete random
variable different from a
continuous random variable?
A
A discrete random variable
has a limited number of
values, while a continuous
random variable has an
infinite number of values.
0.4
0.3
0.2
0.1
X
0
99 100 101 102 103 104 105
CONTINUOUS RANDOM VARIABLE
A variable X is a continuous random variable if it can take on any value
in the possible range of the random variable.
Using Area to Determine Probability
When X is a continuous random variable, the resulting probability
distribution is a geometric shape with a total area of 1. The probability that
a ≤ X ≤ b will be the area of the shape bordered by the values a and b.
For instance, suppose the temperature of the incubator is taken at 10 a.m.
What is the probability that the temperature is between 101°F and 103°F?
152
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T05_RG.indd 152
9/17/11 4:06 AM
To answer this question, you can
compute the area of the rectangular
region where 101 ≤ X ≤ 103, which is
shown here.
The area of the rectangular region is
the product of the base and the height:
2 · 0.25 = 0.5. There is a 50%
probability that the temperature will be
between 101°F and 103°F degrees at
10 a.m.
P
REMEMBER
0.5
The area of the graph of a
probability distribution is
equal to 1.
Rectangular
region
0.4
0.3
0.2
0.1
X
0
99 100 101 102 103 104 105
Using Continuous Nonuniform Probability Distributions
Here is another continuous probability distribution that is not uniform and is
in the shape of a triangle.
P
Nonuniform Probability Distribution
0.6
0.4
0.2
0
X
1
2
3
4
5
6
Suppose we want to know P(1 ≤ X ≤ 4), the probability that X is between
1 and 4.
P
0.6
0.4
0.2
Trapezoidal
region
0
X
1
2
3
4
5
6
The region bounded by X = 1 and X = 4 is a trapezoid with base lengths 0.1
and 0.4. The height of the trapezoid is 3.
1
Area = __
2 h ( base1 + base2)
1
= __
2 · 3 (0.1 + 0.4)
= 0.75
So the probability that X is between 3 and 4 is 0.75, or 75%.
CONTINUOUS RANDOM VARIABLES
153
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T05_RG.indd 153
9/17/11 4:06 AM
Problem Set
Use the uniform probability distribution to solve.
P
0
1
2
3
4
X
1.
What is the area of the distribution?
7.
Determine P (1.5 ≤ X ≤ 3).
2.
What is the height of the distribution?
8.
Determine P (1.5 < X ≤ 3).
3.
What is the area of the part of the region where
X ≤ 3?
9.
If P (X < a) = 0.5, what is the value of a?
4.
What is the area of the part of the region where
1.5 ≤ X ≤ 3?
10.
If P (X ≤ a) = 0.5, what is the value of a?
11.
2 , what is the value of a?
If P (X < a) = __
3
5.
Determine P (X ≤ 3).
6.
Determine P (X < 3).
12.
1 , what is the value of a?
If P (X ≤ a) = __
6
The weights X of an animal are distributed according to the probability
distribution shown.
X
147
148
149
150
151
152
153
13.
Suppose the animal is weighed at 11 a.m. What is probability that
the animal’s weight will be less than 150 pounds?
14.
Suppose the animal is weighed at 2 p.m. What is probability that the
animal’s weight will be less than 151 pounds?
15.
Suppose the animal is weighed in the morning. What is probability
1
that the animal’s weight will be within __
2 pound of 150 pounds?
16.
Suppose the animal is weighed in the afternoon. What is probability
3
that the animal’s weight will be within __
4 pound of 149 pounds?
154
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T05_PS.indd 154
9/17/11 4:17 AM
Use the probability distribution to solve.
P
0.3
0.2
0.1
0
1
2
3
4
5
6
7
8
X
17.
What is the area of the distribution?
23.
Determine P (1.5 < X ≤ 6.5).
18.
Determine P (X < 3).
24.
Determine P (X ≤ 8).
19.
Determine P (X ≤ 3).
25.
If P (X > a) = 0.5, what is the value of a?
20.
Determine P (X < 3 or X > 5).
26.
If P (X ≤ a) = 0.5, what is the value of a?
21.
Determine P (X ≤ 3 or X > 6).
27.
If P (X ≤ a) = 0.7, what is the value of a?
22.
Determine P (3 < X < 4).
28.
If P (X < a) = 0.7, what is the value of a?
A random number generator generates real numbers from 0 to 3,
according to this probability distribution.
29.
30.
What is the probability that a number between
1 and 2 is generated?
What is the probability that a number between
2 and 3 is generated?
P
0.6
0.5
31.
What is the probability that a number less than
2 is generated?
0.4
32.
What is the probability that a number between
0.5 and 2.5 is generated?
0.3
33.
What is the probability that a number less than
1 or greater than 1.5 is generated?
0.2
34.
Challenge Suppose the probability is 0.4 that
0.1
a number less than a is generated. What is the
value of a?
0
1
2
3
4
X
CONTINUOUS RANDOM VARIABLES
155
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T05_PS.indd 155
9/17/11 4:17 AM
The Normal Distribution
The bell-shaped curve is often used for data analysis
and probability.
Using the Normal Distribution
When a data set has a normal distribution, a histogram that represents the
data set is symmetric and has a bell shape.
The Normal Distribution
FACT
In a normal distribution, the
mean, median, and mode all have
approximately the same value.
Mean
When a curve is drawn through the tops of the bars, the curve is in the shape
of a bell.
Normal distributions have important properties that help us study data sets
that are otherwise too large to study.
For example, the mean is always at the center of the distribution, so about
half of the data set is above the mean and half of the data set is below
the mean.
The standard deviation of a normal distribution also gives us useful
information about the data set.
PROPERTIES OF A NORMAL DISTRIBUTION
If a data set has a normal distribution, then about
• 68% of the data is within one standard deviation of the mean.
• 95% of the data is within two standard deviations of the mean.
• 99.7% of the data is within three standard deviations of the mean.
156
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T06_RG.indd 156
9/17/11 4:22 AM
The properties involving standard deviation are illustrated in more detail
in this diagram.
The Normal Distribution
99.7%
95%
REMEMBER
The Greek letters μ and σ are
used to represent the mean and
standard deviation of a data set,
respectively.
68%
2.35% 13.5%
34%
13.5% 2.35%
34%
μ - 3σ μ - 2σ μ - 1σ
μ
μ + 1σ μ + 2σ μ + 3σ
For example, we know that about 68% of the data is within one standard
deviation of the mean. This means that about 34% is one standard deviation
above the mean, and 34% of the data is one standard deviation below
the mean.
Using Normal Distribution Properties
A pediatrician read an article that states that the length of the flu incubation
period is distributed normally with a mean length of 7 days and a standard
deviation of 2 days. Afterward, he wonders what the probability is of the flu
incubating for at least 9 days.
Assuming that flu incubation periods are normally distributed, a normal
curve can be drawn to represent this situation.
REMEMBER
In a normal distribution, about
50% of the data is above the
mean, and about 50% of the data
is below the mean.
Flu Incubation Period
50%
3
5
34%
7
Number of days
9
11
We know that 9 days is one standard deviation above the mean of 7 days.
So, 34% of the data is between 7 days and 9 days. Also, 50% of the data is
below the mean.
This means that 50% + 34% = 84% of flu cases have an incubation period
of at most 9 days. By subtracting 84% from 100%, we can obtain the
percent of cases that have an incubation period of at least 9 days.
Flu Incubation Period
16%
3
5
7
Number of days
9
11
Therefore, the probability is about 16% that a flu case will have an
incubation period of at least 9 days.
THE NORMAL DISTRIBUTION
157
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T06_RG.indd 157
9/17/11 4:22 AM
Problem Set
Suppose normal human body temperatures are normally distributed
with a mean of 37°C and a standard deviation of 0.2°C.
1.
What temperature is one standard deviation below the mean
temperature?
2.
What temperature is three standard deviations above the mean?
3.
What percent of humans have a temperature above 37°C?
4.
What percent of humans have a temperature below 37.2°C?
5.
What percent of humans have a temperature below 36.8°C?
6.
What percent of humans have a temperature between 36.8°C
and 37.2°C?
7.
What percent of humans have a temperature between 36.6°C
and 37.4°C?
8.
What percent of humans have a temperature between 37°C
and 37.4°C?
Suppose that the weights of 5400 registered female Labrador retrievers
in the United States are distributed normally with a mean of 62.5 pounds
and a standard deviation of 2.5 lb.
9.
What percent of the Labrador retrievers weigh less than 62.5 pounds?
10.
What percent of the Labrador retrievers weigh less than 65 pounds?
11.
What percent of the Labrador retrievers weigh between 60 pounds
and 65 pounds?
12.
What percent of the Labrador retrievers weigh between 60 lb and
62.5 pounds?
13.
Approximately how many of the Labrador retrievers weigh less
than 62.5 pounds?
14.
Approximately how many of the Labrador retrievers weigh less
than 65 pounds?
15.
Approximately how many of the Labrador retrievers weigh between
60 pounds and 65 pounds?
16.
Approximately how many of the Labrador retrievers weigh between
60 pounds and 62.5 pounds?
158
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T06_PS.indd 158
9/17/11 5:48 AM
Suppose that the number of petals on 2400 fully developed roses
is normally distributed with a mean of 32 petals and a standard
deviation of 2 petals.
17.
Approximately how many flowers have fewer than 32 petals?
18.
Approximately how many flowers have between 30 and 34 petals?
19.
Approximately how many flowers have between 28 and 32 petals?
20.
Approximately how many flowers have between 26 and 36 petals?
21.
Approximately how many flowers have 30 or more petals?
22.
Approximately how many flowers have fewer than 34 petals?
Suppose a factory produces 50,000 packages of mints each week.
Assume that the number of mints in a package is normally distributed
with a mean of 120 mints and a standard deviation of 3 mints.
23.
Approximately how many packages each week have between
120 and 129 mints?
24.
Approximately how many packages each week have between
111 and 117 mints?
25.
Approximately how many packages each week have more than
123 mints?
26.
Approximately how many packages each week have fewer than
126 mints?
Suppose a manufacturer makes nails that are normally distributed with
a length of 150 millimeters and a standard deviation of 1 millimeter.
27.
Challenge In a certain batch of nails, 3900 nails are between
28.
Challenge In a certain batch of nails, 25 nails are longer than
149 millimeters and 151 millimeters long. What is the approximate
number of nails in the entire batch? Round your answer to the
nearest whole number.
153 millimeters. What is the approximate number of nails in the
entire batch? Round your answer to the nearest whole number.
THE NORMAL DISTRIBUTION
159
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T06_PS.indd 159
9/17/11 5:48 AM
Standardizing Data
A standard score tells us how far a data value is from
the mean.
Converting Raw Scores into z-Scores
Antibiotics are strong medicines that are prescribed by a doctor and often
used to treat infections. Once a patient begins taking an antibiotic, the
length of time that it takes to cure the infection can vary.
Suppose that the time it takes a certain antibiotic to cure an infection is
known to be normally distributed with a mean of 5.2 days and a standard
deviation of 1.1 days.
To get an idea of how effective the new antibiotic is with certain patients,
actual results can be converted into z-scores. A z-score tells us the number
of standard deviations that a data value is from the mean of the data set.
When a data value (called a raw score) is greater than the mean, its
corresponding z-score will be positive. When a data value is less than the
mean, its corresponding z-score will be negative.
REMEMBER
z-scores are sometimes called
standard scores.
CONVERTING RAW SCORES INTO z-SCORES
A z-score is the number of standard deviations that a data value is from
the mean. Every data value in a data set has a corresponding z-score and
is obtained by the formula
x−μ
z = _____
σ .
In this formula, z is the z-score, x is the raw data value, μ is the mean of
the data set, and σ is the standard deviation.
Suppose the antibiotic cures an infection in a patient in 3.8 days. What is
the corresponding z-score?
Here, x = 3.8, μ = 5.2, and σ = 1.1. These values can be substituted into
the formula to find the corresponding z-score.
3.8 − 5.2
z = ________
1.1
≈ −1.3
A z-score of −1.3 means that the antibiotic cures the infection in a time that
is 1.3 standard deviations below the mean.
160
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T07_RG.indd 160
9/17/11 4:31 AM
Converting z-Scores into Raw Scores
Suppose a patient takes the antibiotic and is cured of the infection according
to a z-score of 2.3. From what we know about z-scores, this means that the
patient was cured in an amount of time that is about 2.3 standard deviations
greater than the mean cure time.
But suppose we want to know how many days it took the antibiotic to cure
the infection in this patient. In other words, what is the raw score if the
z-score is 2.3?
The z-score formula can be rearranged to create a formula for converting
z-scores back into raw scores.
REMEMBER
A raw score refers to the original
data value.
CONVERTING z-SCORES INTO RAW SCORES
A z-score can be converted back into its original value, called a raw score,
using the formula
x = z ∙ σ + μ.
In this formula, x is the raw score, z is the z-score, μ is the mean of the
data set, and σ is the standard deviation.
Now, we can convert the z-score of 2.3 back into a raw score. Using z = 2.3,
μ = 5.2, and σ = 1.1, we can find the value of the raw score, x.
x = 2.3 · (1.1) + 5.2
≈ 7.7
For this patient, the antibiotic cured the infection in about 8 days—more than
2 days longer than the average cure time for this antibiotic.
STANDARDIZING DATA
161
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T07_RG.indd 161
9/17/11 4:31 AM
Problem Set
Suppose a normal distribution has a mean of 20 and a standard
deviation of 4.
1.
A value of 24 is how many standard deviations away from the mean?
2.
A value of 26 is how many standard deviations away from the mean?
3.
What is the z-score of 22?
4.
What is the z-score of 13?
5.
What is the z-score of 30?
6.
What is the z-score of 0?
7.
What is the z-score of a value that is 1.43 standard deviations greater
than the mean?
8.
What is the z-score of a value that is 0.52 standard deviations less
than the mean?
Suppose the ages of cars driven by employees at a company are
normally distributed with a mean of 8 years and a standard deviation
of 3.2 years.
9.
What is the z-score of a car that is 12 years old?
10.
What is the z-score of a car that is 6 years old?
11.
What is the z-score of a car that is 9.1 years old?
12.
DeSean, a new employee, drives a brand new car. What is the
z-score of the car?
Suppose a manufacturer makes disposable peppercorn grinders. The
number of peppercorns in the grinders are normally distributed with a
mean of 322 peppercorns and a standard deviation of 5.3 peppercorns.
13.
Suppose a peppercorn grinder contains 315 peppercorns. What is the
z-score for this grinder?
14.
Suppose a peppercorn grinder contains 323 peppercorns. What is the
z-score for this grinder?
15.
Suppose the manufacturer will only sell peppercorn grinders with a
z-score between −2 and 2. What are the least and most peppercorns
a grinder can contain?
16.
Suppose the manufacturer will only sell peppercorn grinders with a
z-score between −0.9 and 0.9. What are the least and most
peppercorns a grinder can contain?
162
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T07_PS.indd 162
9/17/11 4:31 AM
Solve by completing the table. Round your answer to the
nearest hundredth.
17.
The healing time for a broken ankle is normally distributed with a
mean of 42 days and a standard deviation of 9.25 days.
Name
Ayla
Healing Time
35 days
Miranda
1.4
Stacy
18.
z-score
–2.2
With new technology, the healing time for a broken ankle now takes
a mean of 36 days with a standard deviation of 8.67 days.
Name
Alfredo
Josh
Sergio
Healing Time
z-score
35 days
1.4
–2.2
Suppose the velocities of golf swings for amateur golfers are normally
distributed with a mean of 95 miles per hour and a standard deviation
of 3.9 miles per hour.
19.
What is the difference in velocities between a golfer whose z-score
is 0 and another golfer whose z-score is 1?
20.
What is the difference in velocities between a golfer whose z-score
is 0 and another golfer whose z-score is −1?
21.
What is the difference in velocities between a golfer whose z-score
is −0.5 and another golfer whose z-score is 0.5?
22.
What is the difference in velocities between a golfer whose z-score
is −1 and another golfer whose z-score is −2?
23.
What is the difference in velocities between a golfer with a z-score
of a and another golfer whose z-score is a + 1?
Suppose a manufacturer makes nails that are normally distributed
with a mean length of x millimeters and a standard deviation of
y millimeters.
24.
Challenge In a certain batch of nails, an 83-millimeter nail has
a z-score of 1 and an 84.5-millimeter nail has a z-score of 2.
Determine x and y.
STANDARDIZING DATA
163
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T07_PS.indd 163
9/24/11 2:08 AM
Comparing Scores
Standard scores can be used to compare
normal distributions.
Comparing z-Scores from Two Distributions
Black tea has been studied for its medicinal properties and for the amount of
caffeine it contains (which can vary depending on the type of tea used and
the amount of time brewed).
The amount of caffeine in single servings of tea is normally distributed.
For a particular comparison, single servings of two different Ceylon tea
mixtures are compared.
Mixture A
BY THE WAY
Ceylon tea is a type of black tea
grown in Sri Lanka.
Mixture B
Mean: μ = 55 mg
Mean: μ = 52 mg
Standard Deviation: σ = 4.1 mg
Standard Deviation: σ = 3.3 mg
Suppose you are given a serving of Ceylon tea which contains 50 milligrams
of caffeine. Which mixture is the tea more likely to come from?
To determine whether the serving with 50 milligrams of caffeine is more
likely to come from Mixture A or Mixture B, you can determine the z-score
for each mixture.
Mixture A
Mixture B
50 − 55
z = _______
4.1
50 − 52
z = _______
3.3
≈ −1.22
≈ −0.61
For a normal distribution, any value that is closer to the mean will be a
more likely outcome than any value farther away from the mean. Since
−0.61 is closer to 0 than −1.22, Mixture B is more likely to produce a
serving of Ceylon tea with 50 milligrams of caffeine.
Paulo is doing his own research about the caffeine content of these two
Ceylon tea mixtures. The brew he makes with Mixture A has 57 milligrams
of caffeine in a single serving. The brew he makes with Mixture B has
49 milligrams of caffeine in a single serving. Which result is more unusual?
164
CHAPTER 4
REMEMBER
x−μ
z = _____
σ
THINK ABOUT IT
The z-score of the mean is 0.00
because it is zero standard
deviations away from the mean.
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T08_RG.indd 164
9/17/11 4:36 AM
To determine whether the serving with 57 milligrams of caffeine from
Mixture A is more unusual than the serving with 49 milligrams of caffeine
from Mixture B, you can determine the z-score for each mixture.
Mixture A
Mixture B
57 − 55
z = _______
4.1
49 − 52
z = _______
3.3
≈ 0.49
≈ −0.91
The z-scores for each mixture have different signs (one positive and one
negative). However, we are only interested in knowing which z-score is
farthest from 0.
Since Mixture A’s z-score of 0.49 is closer to 0 than Mixture B’s z-score of
−0.91, the serving with 49 milligrams of caffeine from Mixture B is more
unusual than the serving with 57 milligrams of caffeine from Mixture A.
Interpreting a z-Score in Two Distributions
Two normally distributed data sets can also be compared for the same
z-score. For instance, in looking at the two mixtures of Ceylon tea, which
mixture has more caffeine for a z-score of −3.00?
We know that the z-score is −3.00, and we know the mean and standard
deviation for each mixture. We can use the appropriate formula to determine
the actual amount of caffiene in a serving from each mixture.
Mixture A
Mixture B
x = −3 · (4.1) + 55
x = −3 · (3.3) + 52
= 42.7
REMEMBER
x=z·σ+μ
= 42.1
At a distance of three standard deviations below the mean, the two mixtures
have about the same amounts of caffeine. However, Mixture A has more
caffeine compared to Mixture B.
COMPARING SCORES
165
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T08_RG.indd 165
9/17/11 4:36 AM
Problem Set
The bar graph shows the z-score results of four students on
two different math tests. The students took Test 1, and then
a month later took Test 2.
Math Test z-Scores
1.75
1.5
1.25
1
z-Score
0.75
0.5
0.25
0
−0.25
−0.5
−0.75
Test 1
−1
Test 2
−1.25
Carla
Dave
Euan
Students
Felicia
1.
What are Carla’s z-scores for both tests?
4.
2.
How many standard deviations away from the
mean did Dave score on Test 2? Was the score
above the mean or below the mean? Explain.
Which student had the lowest score on Test 2?
Explain.
5.
Which student had the best overall performance
on both tests? Explain.
3.
Which student scored the farthest away from the
mean? On which test did this occur? Explain.
6.
Which student showed the most improvement
from Test 1 to Test 2? Explain.
Suppose the growth of Bermuda grass from germination to lawn care
length is normally distributed with a mean of 20 days with a standard
deviation of 3.3 days. The growth of buffalo grass from germination to
lawn care length is normally distributed with a mean of 21 days with a
standard deviation of 2.3 days.
7.
A gardener grew a lawn using Bermuda grass in
23 days. Determine the z-score.
8.
A gardener grew a lawn using buffalo grass in
15 days. Determine the z-score.
166
CHAPTER 4
9.
10.
On which lawn would a z-score of 3 give a
longer growing period? Explain.
On which lawn would a z-score of −2.5 give a
longer growing period? Explain.
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T08_PS.indd 166
9/17/11 4:39 AM
Suppose scores on a biology test are normally distributed with a
mean of 85 and a standard deviation of 4. Scores on a history test are
normally distributed with a mean of 85 and a standard deviation of 2.
11.
Ayla scored 90 on both tests. Determine her z-score for each test.
On which test did she have the higher z-score?
12.
Maya scored 80 on both tests. Determine her z-score for each test.
On which test did she have the higher z-score?
13.
Eric scored 89 on the biology test and 87 on the history test. On which
test did he have the better performance? Explain.
14.
Paulo scored 90 on the biology test and 89 on the history test. On which
test did he have the better performance? Explain.
15.
On which test would a z-score of 3 give a higher raw score? Explain.
16.
On which test would a z-score of −2.5 give a higher raw score? Explain.
17.
On which test would a z-score of 0.5 give a higher raw score? Explain.
18.
On which test would you expect a higher range of scores? Explain.
19.
On which test would a score of 87 be a better score? Explain.
20.
On which test would a score of 80 be a better score? Explain.
Suppose Company A produces packages of throat lozenges that are
normally distributed with a mean of 38.2 individual lozenges and a
standard deviation of 1.7 lozenges. Company B produces packages
of throat lozenges that are normally distributed with a mean of
36.9 individual lozenges and a standard deviation of 2.2 lozenges.
21.
Which company would be more likely to produce a package of 37 throat
lozenges? Explain using z-scores.
22.
Which company would be more likely to produce a package of 43 throat
lozenges? Explain using z-scores.
23.
Would it be more likely for Company A to produce a package of 35 throat
lozenges or Company B to produce a package of 40 throat lozenges?
Explain using z-scores.
24.
Challenge Suppose each company produces a package of x lozenges so
that the z-scores are identical. Find the value of x.
COMPARING SCORES
167
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T08_PS.indd 167
9/17/11 4:39 AM
The Standard Normal Curve
The standard normal curve is used to analyze data.
Standard Normal Curve Properties
When all of the data values of a normal distribution are turned into z-scores,
the resulting distribution is called the standard normal curve.
Each value on the horizontal axis of the
standard normal curve is a z-score, and the
mean is always 0.
Standard Normal Curve
For instance, a data value with a z-score
of 0.54 would correspond to 0.54 on the
horizontal axis of the standard normal curve.
–3
–2
–1 0
1
z-scores
2
3
PROPERTIES OF THE STANDARD NORMAL CURVE
The standard normal curve is a probability distribution with
the following properties:
• The mean is 0.
• The standard deviation is 1.
• The area under the curve is 1.
Using the z-Distribution Table
Because the standard normal curve is a probability distribution, it can be
used to find probabilities. These probabilities correspond to areas under the
standard normal curve.
Suppose a data value has a z-score of 0.54. What percent of the data values
are less than this value?
REMEMBER
If a z-score is positive, then it is
above the mean. If a z-score is
negative, then it is below the
mean.
?
0 0.54
z-scores
168
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T09_RG.indd 168
9/17/11 4:44 AM
To answer this question, we can use the Table of z-Scores for Normal
Distribution (pages A-1 and A-2). This table gives the areas under the
standard normal curve that are less than a given z-score.
z
0.0
0.1
0.2
0.3
0.4
0.5
0.6
.00
0.5000
0.5398
0.5793
0.6179
0.6554
0.6915
0.7257
.01
0.5040
0.5438
0.5832
0.6217
0.6591
0.6950
0.7291
.02
0.5080
0.5478
0.5871
0.6255
0.6628
0.6985
0.7324
.03
0.5120
0.5517
0.5910
0.6293
0.6664
0.7019
0.7357
.04
0.5160
0.5557
0.5948
0.6331
0.6700
0.7054
0.7389
BY THE WAY
.05
0.5199
0.5596
0.5987
0.6368
0.6736
0.7088
0.7422
Using the z-distribution table, look for
the area that corresponds to the row and
column of a z-score of 0.54. We can see that
P (z < 0.54) = 0.7054, which is about 71%.
Technology, such as calculators
and computers, can also be used
to find areas under the standard
normal curve.
Area = 0.7054
This means that about 71% of the data values
are less than a value with a z-score of 0.54.
0 0.54
z-scores
Using the Standard Normal Curve to Solve Problems
Suppose scores on a standardized test for future doctors are normally
distributed with a mean of 10.5 and a standard deviation of 2.2. If 150 future
doctors take the test, how many are likely to score between 12 and 15?
The score 12 has a z-score of about 0.68, and the score 15 has a z-score of
about 2.05. To help us answer the question, we need to know the area that
corresponds to P (0.68 < z < 2.05).
REMEMBER
z-scores are obtained using the
following formula:
x−μ
z = _____
σ
The area under the standard normal curve less than z = 0.68 is 0.7517, and
the area less than z = 2.05 is 0.9798.
So P (0.68 < z < 2.05) = 0.9798 − 0.7517 = 0.2281.
0.2281
0 0.68
z-scores
2.05
Multiply the area by the number of doctors taking the exam.
0.2281 · 150 ≈ 34
So about 34 of the 150 future doctors taking the exam are likely to score
between 12 and 15.
THE STANDARD NORMAL CURVE
169
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T09_RG.indd 169
9/17/11 4:44 AM
Problem Set
The expression represents the area under the standard normal curve
below a given value of z. Use the z-distribution table on pp. A-1 to A-2
or technology to find the indicated area, and include a sketch of the
standard normal curve with the appropriate area shaded.
1.
P (z < 0.00)
5.
P (z < −1.00)
2.
P (z < 1.00)
6.
P (z < −1.53)
3.
P (z < 1.57)
7.
P (z < −2.02)
4.
P (z < 2.87)
8.
P (z < −2.77)
The expression represents the area under the standard normal curve
above a given value of z. Use the z-distribution table on pp. A-1 to A-2
or technology to find the indicated area, and include a sketch of the
standard normal curve with the appropriate area shaded.
9.
P (z > 0.00)
13.
P (z > −1.00)
10.
P (z > 1.00)
14.
P (z > −1.53)
11.
P (z > 1.57)
15.
P (z > −2.05)
12.
P (z > 2.87)
16.
P (z > −2.87)
The expression represents the area under the standard normal curve
above a given value of z. Use the z-distribution table on pp. A-1 to A-2
or technology to find the indicated area, and include a sketch of the
standard normal curve with the appropriate area shaded.
17.
P (0.00 < z < 1.00)
21.
P (−1.33 < z < 1.59)
18.
P (0.00 < z < 2.00)
22.
P (−2.00 < z < 2.00)
19.
P (1.00 < z < 1.00)
23.
P (−1.61 < z < −0.09)
20.
P (1.06 < z < 2.63)
24.
P (−2.53 < z < −0.55)
170
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T09_PS.indd 170
9/24/11 2:09 AM
The scores on a standardized test are normally distributed with a mean
of 500 and a standard deviation of 100. Use the z-distribution table on
pp. A-1 to A-2 or technology to solve.
25.
Sofia scored 632 on the test. What percent of students scored
below Sofia?
26.
Jake scored 500 on the test. What percent of students scored
below Jake?
27.
Benita scored 432 on the test. What percent of students scored
below Benita?
28.
Kendall scored 380 on the test. What percent of students scored
below Kendall?
29.
Suppose 6000 students took the test. Approximately how many
students scored below 632?
30.
Suppose 6000 students took the test. Approximately how many
students scored above 490?
31.
Suppose 6000 students took the test. Approximately how many
students scored between 450 and 550?
32.
Suppose 6000 students took the test. Approximately how many
students scored between 500 and 600?
Lengths of newborn girls are normally distributed with a mean of
49.2 centimeters and a standard deviation of 1.8 centimeters. Use the
z-distribution table on pp. A-1 to A-2 or technology to solve.
33.
What percent of newborn girls will be 50 centimeters or longer?
34.
What percent of newborn girls will be 49 centimeters or longer?
35.
What percent of newborn girls will be 52 centimeters or longer?
36.
What percent of newborn girls will be 45 centimeters or shorter?
37.
Consider a group of 2000 newborn girls. How many girls will be
50.5 centimeters or longer?
38.
Consider a group of 2000 newborn girls. How many girls will be
51 centimeters or shorter?
39.
Consider a group of 2000 newborn girls. How many girls will be
between 49 centimeters and 51 centimeters?
40.
Consider a group of 2000 newborn girls. How many girls will be
between 44 centimeters and 45 centimeters?
41.
Challenge Consider a group of 2000 newborn girls. If 500 of the
newborn girls are between 47 centimeters and x centimeters long,
what is the value of x?
THE STANDARD NORMAL CURVE
171
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T09_PS.indd 171
9/17/11 4:50 AM
Finding Standard Scores
Areas under the standard normal curve can be used
to find standard scores and percentiles.
Using Area To Find a z-Score
While the straightforward way of reading the Table of z-Scores Normal
Distribution is to look up the z-score and then find the corresponding area
under the standard normal curve, we can also read the table in the other
direction.
REMEMBER
z-scores are sometimes called
standard scores.
Suppose that about 20% of the data in a data set is below a certain value.
What z-score corresponds to this value?
Standard Normal Curve
20%
z=?
z-scores
Since 20% = 0.2, we want to find an area in the body of the z-distribution
table that is closest to 0.2.
z
–1.4
–1.3
–1.2
–1.1
–1.0
–0.9
–0.8
–0.7
.00
0.0808
0.0968
0.1151
0.1357
0.1587
0.1841
0.2119
0.2420
.01
0.0793
0.0951
0.1131
0.1335
0.1562
0.1814
0.2090
0.2389
.02
0.0778
0.0934
0.1112
0.1314
0.1539
0.1788
0.2061
0.2358
.03
0.0764
0.0918
0.1093
0.1292
0.1515
0.1762
0.2033
0.2327
.04
0.0749
0.0901
0.1075
0.1271
0.1492
0.1736
0.2005
0.2296
.05
0.0735
0.0885
0.1056
0.1251
0.1469
0.1711
0.1977
0.2266
The area 0.2005 is closest to 20%. Use the row and column of the area to
locate the corresponding z-score. This corresponds to a z-score of −0.84.
172
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T10_RG.indd 172
9/17/11 4:55 AM
Determining Area Above a z-Score
Suppose that 15% of the data in a data set is above a certain value.
What z-score corresponds to this value?
85%
15%
z-scores
z=?
Since the z-distribution table only gives the areas that are below a
particular z-score, we need to find the percent of data below the value:
100% − 15% = 85%. Since 85% = 0.85, we want to find an area in the
body of the z-distribution table that is closest to 0.85.
z
0.6
0.7
0.8
0.9
1.0
1.1
.00
0.7257
0.7580
0.7881
0.8159
0.8413
0.8643
.01
0.7291
0.7611
0.7910
0.8186
0.8438
0.8665
.02
0.7324
0.7642
0.7939
0.8212
0.8461
0.8686
.03
0.7357
0.7673
0.7967
0.8238
0.8485
0.8708
.04
0.7389
0.7704
0.7995
0.8264
0.8508
0.8729
.05
0.7422
0.7734
0.8023
0.8289
0.8531
0.8749
The area 0.8508 is closest to 85%. This corresponds to a z-score of 1.04.
Understanding Percentiles
A percentile rank is the percentage of data that falls below a particular
value. For example, if a student scores higher than 60% of the other
students who took an exam, then the student’s percentile rank is at the
60th percentile.
Scores on the math section of the Scholastic Aptitude Test (SAT) are
normally distributed with a mean of about 500 and a standard deviation of
about 100. According to this information, what score would be at the
80th percentile?
Remember
The nth percentile is the value
(or score) below which n percent
of the data may be found.
Using the z-distribution table, we see that the area 0.8023 is closest to 80%.
This corresponds to a z-score of 0.85. We can use the appropriate formula to
find the corresponding score.
x = z · σ + µ
= 0.85 · 100 + 500
= 585
So a score of 585 on the math section of the SAT is at the 80th percentile.
FINDING STANDARD SCORES 173
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Problem Set
The number represents the area under the standard normal curve
below a particular z-score. Find the z-score, and include a sketch of the
standard normal curve with the appropriate area shaded.
1.
0.2358
5.
0.9115
2.
0.9967
6.
0.8869
3.
0.5000
7.
0.1003
4.
0.1170
8.
0.2389
Suppose a set of data is normally distributed. Use the z-distribution
table on pp. A-1 to A-2 or technology to solve.
9.
Below which z-score do approximately half of the data lie?
10.
Above which z-score do approximately half of the data lie?
11.
Below which z-score do approximately 23.58% of the data lie?
12.
Above which z-score do approximately 23.58% of the data lie?
13.
Below which z-score do approximately 88.3% of the data lie?
14.
Above which z-score do approximately 88.3% of the data lie?
15.
3
Above which z-score do approximately __
4 of the data lie?
16.
3
Below which z-score do approximately __
4 of the data lie?
Suppose scores on a standardized test are normally distributed with
a mean of 500 and a standard deviation of 100. Use the z-distribution
table on pp. A-1 to A-2 or technology to solve.
17.
Approximately 50% of the scores are above
which z-score? Which test score corresponds
to the z-score?
20.
Approximately 10% of the scores are below
which z-score? Which test score corresponds
to the z-score?
18.
Approximately 50% of the scores are below
which z-score? Which test score corresponds
to the z-score?
21.
Approximately 80% of the scores are above
which z-score? Which test score corresponds
to the z-score?
19.
Approximately 10% of the scores are above
which z-score? Which test score corresponds
to the z-score?
22.
Approximately 80% of the scores are below
which z-score? Which test score corresponds
to the z-score?
174
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T10_PS.indd 174
9/17/11 4:57 AM
Suppose ages of cars driven by company employees are normally
distributed with a mean of 8 years and a standard deviation of
3.2 years. Use the z-distribution table on pp. A-1 to A-2 or
technology to solve.
23.
Approximately 15% of cars driven by company employees are older
than what age?
24.
Approximately 75% of cars driven by company employees are older
than what age?
25.
In a sample of 8000 company employees who drive cars, 6000 of
them drive cars younger than x years old. Determine x.
26.
In a sample of 10,000 company employees who drive cars, 1350 of
them drive cars younger than y years old. Determine y.
Assume that lengths of newborn girls are normally distributed with a
mean of 49.2 centimeters and a standard deviation of 1.8 centimeters.
27.
Find the length of a newborn girl whose length is at the 50th percentile.
28.
Find the length of a newborn girl whose length is at the 90th percentile.
29.
Find the length of a newborn girl whose length is at the 45th percentile.
30.
What is the percentile rank of a baby girl whose length is 51 centimeters?
31.
What is the percentile rank of a baby girl whose length is 53 centimeters?
32.
What is the percentile rank of a baby girl whose length is 47 centimeters?
FINDING STANDARD SCORES
175
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_T10_PS.indd 175
9/17/11 4:57 AM
Chapter 4 Wrap-Up
Probability Distributions
Though there are slightly more boys than girls under the age of 15 in
the United States, it is often estimated that there is a 50% chance that a
newborn baby will be a boy (or girl as the case may be).
In general, there should be an equal probability of a newborn being either a
boy or a girl. To test this, a binomial distribution can be created for a family
with 3 children.
This is a binomial distribution with P(success) = 0.5 and 3 trials. The
probability distribution is given in the table and the histogram.
X
0
1
2
3
P
0.125
0.375
0.375
0.125
Number of Boys in Family
(Predicted)
40
35
Probability (%)
30
25
20
15
10
5
0
0
1
2
Number of boys
3
A few conclusions can be drawn from this distribution. In a family of
3 children,
• There is a 37.5% chance that the family will have 1 boy.
• There is a 75% chance that the family will have boys and girls.
• There is a 25% chance that the family will have all boys or all girls.
176
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_WU_RG.indd 176
9/17/11 4:59 AM
Normal Distributions
In general, the length of a full-term newborn baby is distributed normally
with a mean of 20.1 inches with a standard deviation of 0.4 inches. According
to the properties of a normal distribution, this means that approximately 68%
of newborn babies are between 19.7 inches and 20.5 inches; 95% of newborn
babies are between 19.3 inches and 20.9 inches; and 99.7% of newborn
babies are between 18.9 inches and 21.3 inches.
What is the 80th percentile of a newborn baby’s length? According to the
standard normal distribution, the value in the z-distribution table closest
to 0.8 is when the z-score is z = 0.84.
To convert the z-score into the actual length, multiply 0.84 by the standard
deviation (0.4), then add the mean (20.1): 0.84 · (0.4) + 20.1 = 20.44.
This means that 80% of newborn babies will be no more than
20.44 inches long.
In Summary
Probability distributions give probabilities over a whole set of outcomes.
A binomial distribution explains repeated trials of the same event when the
probability of success is the same for each trial. A normal distribution has
properties that can be used to analyze data sets.
CHAPTER 4 WRAP-UP
177
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_WU_RG.indd 177
9/17/11 4:59 AM
Practice Problems
Random Variables and Distributions
The frequency table shows the summer employment hourly wages
(to the nearest dollar) for a group of high school students.
Hourly Wage
$7
$8
$9
$10
$11
$12
$17
Number of High School Students
6
4
3
4
1
1
1
1.
Explain why the table is not a probability distribution table.
2.
If X is the hourly wage, complete the probability distribution table.
Round probabilities to the nearest hundredth.
X
7
8
9
10
11
12
17
P
3.
If a high school student is selected at random, determine P(X < 10).
4.
Based on these data, what would be the expected value of a high
school student’s summer hourly wage?
The Personal Computers data set on p. A-9 shows the number of
personal computers owned per 100 inhabitants of South American
countries from 1998 to 2005.
5.
Create a probability distribution table for 2004. Round probabilities
to the nearest thousandth.
6.
Suppose a personal computer is in use in South America in 2004.
What is the probability, to the nearest thousandth, that the computer
is being used in Chile?
7.
Suppose there are 280,000 personal computers of a certain brand in
use in South America in 2004. How many would you expect to be
used in Peru?
Determine if X has a binomial distribution. If it does, create a binomial
distribution histogram. Otherwise, explain why X does not have a
binomial distribution.
8.
When a coin is tossed twice, X is the number of heads.
9.
A tennis player’s probability of winning a match is 0.6. X is the
number of wins for the next 3 matches.
10.
178
Over the next 5 days, X is the number of days it will snow.
CHAPTER 4
RANDOM VARIABLES AND DISTRIBUTIONS
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_WU_PS.indd 178
9/17/11 5:03 AM
Chapter 4 Wrap-Up
The uniform probability distribution shows temperature X of an oven
set at 325°F at random times.
318
320
11.
What is the area of the distribution?
12.
What is the height of the distribution?
13.
Determine P (X < 323).
322
324
326
328
Temperature (°F)
330
332
X
14.
Suppose the oven temperature is taken at
6 p.m. What is probability that the temperature
will be within 1.5°F of 325°F?
15.
If P (X > a) = 0.7, what is the value of a?
Use the probability distribution to find the probability indicated.
16.
What is the area of the distribution?
17.
Determine P (X < 3).
18.
Determine P (X < 2 or X > 4).
19.
Determine P (3 < X ≤ 6).
20.
If P (X < a) = 0.5, what is the value of a?
21.
If P (X < a) = 0.25, what is the value of a?
P
0.3
0.25
0.2
0.15
0.1
0.05
0
X
1 2 3 4 5 6 7
Use the z-distribution table on pp. A-1 to A-2 or technology to solve.
The number of seeds in a certain type of strawberry are distributed
normally with a mean of 200 seeds and a standard deviation of 9 seeds.
22.
What percent of strawberries have fewer than
200 seeds?
25.
If X is the number of seeds on a strawberry,
what is P (185 < X < 205)?
23.
What percent of strawberries have more than
191 seeds?
26.
How many seeds would a strawberry contain if it
were at the 20th percentile?
24.
If a strawberry has a z-score of 1.11, about how
many seeds does it contain?
27.
How many seeds would a strawberry contain if it
were at the 90th percentile?
Use the z-distribution table on pp. A-1 to A-2 or technology to solve.
Scores on a college entrance test of verbal skills are normally
distributed with a mean of 78 and a standard deviation of 5. At the
same college, scores on the entrance test of mathematical skills are
normally distributed with a mean of 84 and a standard deviation of 2.
28.
Henry had a z-score of −0.1 on the verbal skills test and a z-score of 1.8
on the mathematical skills test. What raw score did he get on each test?
29.
Millie scored an 83 on the verbal skills test and an 86 on the
mathematical skills test. Explain why Millie did equally well on
both tests.
30.
On which test is a score of 80 more likely? Explain.
CHAPTER 4 WRAP-UP
179
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
Copyright © 2011, K12 Inc. All rights reserved. This material may not be reproduced in whole or in part, including illustrations, without the express prior written consent of K12 Inc.
HS_PS_S1_04_WU_PS.indd 179
9/17/11 5:03 AM