Download The White Board

Document related concepts

Probability wikipedia , lookup

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Transcript


Printer
Computer w/ Internet Access
Assessment
 Discussions (4)
0-40
 Practice (3)
0-75
 Quizzes (4)
0-40
 Tests (CST)
Test 1 (0-48)
TOTAL
0-203
AP Statistics
Grading Scale
Semester 2 - Unit 1
97 - l00% = A+
93 - 96% = A
90 - 92% = A87 - 89% = B+
83 - 86% = B
80 - 82% = B77 - 79% = C+
73 - 76% = C
70 - 72% = C<69% = INC
Name:
Teacher:
1 of 70
AP Statistics Sem 2 - Unit 1
Materials List
Binomials and Distributions
The White Board
04152013
This Page Left
Intentionally
Blank
2 of 70
AP Statistics
Unit Overview: Binomials and Distributions
Think about It
Page 1 of 4
Unit 1
S2
The following questions will help you study key concepts covered in this unit.
1. You're taking samples of size 10 from a population of 20 marbles and recording the
number of blue marbles in each sample. Why isn't this a binomial situation, and why
would it be almost binomial if you did the same experiment with a population of 200
marbles?
2. Explain the term continuity correction. Why does it work, and how do you use it?
3. Consider a parent population with mean 75 and standard deviation 7. The population
doesn't appear to have extreme skewness or outliers.
A. What are the mean and standard deviation of the distribution of sample means for
n = 40?
B. What's the shape of the distribution? Explain your answer in terms of the central
limit theorem.
C. What proportion of the sample means of size 40 would you expect to be 77 or less?
If you use your calculator, show what you entered.
D. Draw a sketch of the probability you found in part C, and label the horizontal axis.
4. Suppose that after several years of offering AP Statistics, a high school finds that final
exam scores are normally distributed with mean 78 and standard deviation 6.
A. What are the mean, standard deviation, and shape of the distribution of x-bar for
n = 50?
B. What's the probability a sample of scores will have a mean greater than 80?
C. Sketch the distribution curve for part B, showing the area that represents the
probability you found. Be sure to label the horizontal axis.
5. In a survey, 600 mothers and fathers were asked about the importance of sports for
boys and girls. Of the parents interviewed, 70% said the genders are equal and should
have equal opportunities to participate in sports.
A. What are the mean, standard deviation, and shape of the distribution of the sample
ˆ of parents who say the genders are equal and should have equal
proportion p
opportunities? Be sure to justify your answer for the shape of the distribution. Use
n = 600.
B. Using the normal approximation without the continuity correction, sketch the
ˆ . Shade equal areas on both
probability distribution curve for the distribution of p
sides of the mean to show an area that represents a probability of .95, and label the
upper and lower bounds of the shaded area as values of p-hat (not z-scores).
C. In your sketch from part B, the shaded area shows a .95 probability of what
happening? In other words, what does the probability of .95 represent?
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use
3 ofat70www.apexvs.com/TermsOfUse)
AP Statistics
Unit Overview: Binomials and Distributions
Think About It
Page 2 of 4
D. Using the normal approximation, what's the probability that a randomly drawn
sample of 600 parents will have a sample proportion between 67% and 73%?
Draw a sketch of the probability curve, shade the area representing the
probability you're finding, and label the z-scores that represent the upper and
lower bounds of the probability you're finding. Don't use the continuity correction.
E. Now, use the exact binomial calculation to find the probability of getting between,
but not including, 67% and 73% of the respondents in a sample of 600 who say the
genders are equal and should have equal opportunities. To use the exact binomial,
you'll need to convert the proportions to counts by multiplying each proportion by
600.
F. Now try it again, but this time find the probability of getting at least 67% but no
more than 73%. Use the exact binomial calculation.
6. Annie is a basketball player who makes, on average, 65% of her free throws. Assume
each shot is independent and the probability of making any given shot is .65.
A. What's the probability Annie will miss three straight free throws before she makes
one? (If you use a calculator to get your answer, write your answer in standard
notation and show what you do on the calculator as well.)
B. During a season, Annie takes 140 free throws. What's the exact binomial probability
she'll make at least 100 out of 140 of these throws?
C. For part B, are the conditions that permit you to use a normal approximation to the
binomial satisfied? Explain.
D. Redo part B using a normal approximation, without continuity correction, to the
binomial. Draw a sketch of the situation. Then draw the distribution, shade the
area representing the probability you're finding, and label the horizontal
axis approximately.
NOTE: If you use a graphing calculator, show what you do to get the z-scores, and
explain your answer in enough detail to show your instructor you understand what
you're doing.
E. Redo part B using a normal approximation, with continuity correction, to the
binomial. NOTE: If you use a graphing calculator, show what you do to get the
z-scores on your calculator, and explain your answer in enough detail to show your
instructor you understand what you're doing.
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use
4 ofat70www.apexvs.com/TermsOfUse)
AP Statistics
Unit Overview: Binomials and Distributions
Think About It
Page 3 of 4
Discussion
7. Have you ever been stuck in Jail when playing Monopoly? To get out, you need to roll doubles in
three tries or fewer or you have to pay. On average, how many times would people have to roll
before getting doubles, and is that number larger than three? This is an average waiting-time
question, where you're interested in the number of tries you need on average to get the outcome
you want.
You'll learn more about situations like this and the probability distributions they produce in the
next Tutorial. Meanwhile, this Discussion will help you think about waiting-time situations.
Respond to any one (or more) of the following, or respond to another student's posting. As you
explain your reasoning, be sure to use what you know about the laws of probability.
1. If you buy a very large bag of candies colored brown, yellow, green, blue, orange, and red,
and you start eating them, how many candies would you expect to pick until you got a blue
one? Explain your reasoning using what you know about probability.
2. What's the probability you'll get out of Jail in Monopoly without having to pay? In other words,
what are the chances you'll roll doubles on two six-sided dice within three tries?
3. You may be familiar with promotional campaigns where a company's products are marked
with a letter, under the bottle cap of a soft drink, for example, and you're supposed to spell
something to win a prize. In these cases, do you think some game pieces are more common
than others? Describe an experiment you could conduct to see if some game pieces are easier
to get than others.
4. Say you're playing a game like the one described in topic 3. A soft drink company has a letter
printed on each bottle cap and the object is to spell the words I bought a lot of bottles to spell
this. You have all the letters you need except p. The company's disclaimer statement says for
each bottle you buy there's a 1/200 chance of getting a p. On average, how many bottles
would you expect most people to buy in order to get this letter?
5. Create your own question about an average = waiting-time situation, or describe a real one
you've seen or participated in. Explain why it's an average = waiting-time situation, and invite
other students to answer it.
6. Have you ever won anything in a game like the ones described in topics 3 and 4? Describe the
game, and calculate the probability of winning.
8.
A sampling distribution is the distribution of a statistic. In other words, if you took the mean of
many samples from a population, the set of means would form the distribution of .
Some of the questions below have important information you'll need in order to understand the
other questions, so before choosing which question to answer please read them all.
Respond to one or more of the following or respond to another student's posting.
1. Suppose you had a population of fish whose lengths were normally distributed with a mean of
50 centimeters and a standard deviation of 5 centimeters. You draw a simple random sample
of size 10, record the length of each fish, and calculate the mean of the sample lengths. What
do you think your sample mean would most likely be? Explain your answer using what you
know about random samples and sample means.
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use
5 ofat70www.apexvs.com/TermsOfUse)
AP Statistics
Unit Overview: Binomials and Distributions
Think About It
Page 4 of 4
2. For the scenario in topic 1, imagine you draw hundreds of samples of size 10 (replacing each
fish after you record its length) and calculate the mean of each sample. If you kept doing this
until you took every possible sample, you'd get a distribution of sample means called a
sampling distribution. What would the shape of the sampling distribution be? Would the
standard deviation of the sampling distribution be larger or smaller than the population
standard deviation? Explain your answer using what you know about random samples and
sample standard deviations.
3. Why might a sampling distribution be helpful if you wanted to estimate the likelihood of
getting a particular sample value, say x-bar = 48 centimeters? What's your guess about the
likelihood of getting a sample mean of 48 centimeters? Explain your answer using what you
know about probability distributions.
4. Building on items 1-3, how might a sampling distribution be helpful if you wanted to estimate
a population mean with a sample mean? Outline a scenario for using a sampling distribution
for inference.
5. Write a paragraph-long story about a sampling distribution; describe the population, the
sampling method, the mean and standard deviation of the population and the sampling
distribution.
6. There's a case where a sampling distribution qualifies as a binomial setting. Describe this case,
using the criteria for a binomial distribution.
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use
6 ofat70www.apexvs.com/TermsOfUse)
1.1
Key Terms
inferential statistics
7 of 70
S2
6WDWLVWLFV6WXG\*XLGH
,QWURGXFWLRQWR,QIHUHQWLDO6WDWLVWLFV
3DJHRI
1.1.1
'LUHFWLRQV
• 8VHWKLVJXLGHWRWDNHQRWHVZKLOH\RXZDWFKWKH7XWRULDO
• .HHS\RXUFRPSOHWHG6WXG\*XLGHLQ\RXUQRWHERRNIRUODWHUVWXG\
.H\7HUPVDQG&RQFHSWV
• QRQH
7XWRULDO6HFWLRQV
3UREDELOLW\DQG&RQILGHQFH,QWHUYDOV
3UREDELOLW\DQG7HVWVRI6LJQLILFDQFH
6WHSVWR,QIHUHQFH
BBBBBBBBBBBBBBBBBBB
‹&RS\ULJKW$3(;2QOLQH/HDUQLQJ,QF$OOULJKWVUHVHUYHG$GYDQFHG3ODFHPHQWDQG$3DUHUHJLVWHUHG
WUDGHPDUNVRIWKH&ROOHJH%RDUG7KLVPDWHULDOLVLQWHQGHGIRUWKHH[FOXVLYHXVHRI$3(;HQUROOHGVWXGHQWV$Q\
XQDXWKRUL]HGFRS\LQJUHXVHRUUHGLVWULEXWLRQLVSURKLELWHG
8 of 70
S2
6WDWLVWLFV6WXG\*XLGH
,QWURGXFWLRQWR,QIHUHQWLDO6WDWLVWLFV
3DJHRI
&RPPRQ$SSOLFDWLRQV
BBBBBBBBBBBBBBBBBBB
‹&RS\ULJKW$3(;2QOLQH/HDUQLQJ,QF$OOULJKWVUHVHUYHG$GYDQFHG3ODFHPHQWDQG$3DUHUHJLVWHUHG
WUDGHPDUNVRIWKH&ROOHJH%RDUG7KLVPDWHULDOLVLQWHQGHGIRUWKHH[FOXVLYHXVHRI$3(;HQUROOHGVWXGHQWV
$Q\XQDXWKRUL]HGFRS\LQJUHXVHRUUHGLVWULEXWLRQLVSURKLELWHG
9 of 70
1.2
Key Terms
almost binomial
binomcdf on a graphing calculator
almost binomial
binomcdf on a graphing calculator
binomial experiment (setting)
binomial probabilities for a range of outcomes
binomial probabilities for an exact number of outcomes
binompdf on a graphing calculator
continuity correction
geometric setting
normal approximation to the binomial
10 of 70
S2
AP Statistics
Study Guide: Binomial Situations (Events)
3DJHRI4
1.2.1
'LUHFWLRQV
• 8VHWKLVJXLGHWRWDNHQRWHVZKLOH\RXZDWFKWKH7XWRULDO
• .HHS\RXUFRPSOHWHG6WXG\*XLGHLQ\RXUQRWHERRNIRUODWHUVWXG\
.H\7HUPVDQG&RQFHSWV
•
•
•
•
DOPRVWELQRPLDO
ELQRPFGI
ELQRPLDOH[SHULPHQW
ELQRPLDOSUREDELOLWLHVIRUDUDQJHRI
RXWFRPHV
•
•
•
•
ELQRPLDOSUREDELOLWLHVIRUDQH[DFW
QXPEHURIRXWFRPHV
ELQRPLDOVHWWLQJ
ELQRPSGI
LQIHUHQWLDOVWDWLVWLFV
7XWRULDO6HFWLRQV
7KH'HILQLWLRQRID%LQRPLDO6HWWLQJRU(YHQW
&DOFXODWLQJ%LQRPLDO3UREDELOLWLHVIRUDQ([DFW1XPEHURI2XWFRPHV
&DOFXODWLQJ%LQRPLDO3UREDELOLWLHVIRUD5DQJHRI2XWFRPHV
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
11 of 70
S2
AP Statistics
Study Guide: Binomial Situations (Events)
$OPRVW%LQRPLDO
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
12 of 70
Page 2 of 4
AP Statistics
Study Guide: Binomial Situations (Events)
Page 3 of 4
8VLQJWKH7,/TI-84WR&DOFXODWH%LQRPLDO'LVWULEXWLRQ3UREDELOLWLHV
7R&DOFXODWHD%LQRPLDO3UREDELOLW\IRUD6SHFLILF9DOXHELQRPSGI
7KHSGIVWDQGVIRUSUREDELOLW\GHQVLW\IXQFWLRQ8VHELQRPSGIDQGHQWHUWKHQXPEHUV
DQGFRPPDVLQWKHVDPHRUGHUDVWKHSUREDELOLW\QRWDWLRQ)RUH[DPSOHWRFDOFXODWH
%
•
•
•
•
SUHVVQG >',675@
VFUROOGRZQWR>ELQRPSGI@DQGSUHVV(17(5
.H\LQ<RXUKRPHVFUHHQVKRXOGUHDGELQRPSGI
SUHVV(17(5DQG\RXVKRXOGJHW
7R&DOFXODWHD3UREDELOLW\IRUD5DQJHRI9DOXHVELQRPFGI
7KHFGIVWDQGVIRUFXPXODWLYHGHQVLW\IXQFWLRQ(QWHUWKHQXPEHUVDQGFRPPDVLQWKH
VDPHRUGHUDVWKHSUREDELOLW\QRWDWLRQ)RUH[DPSOHWRFDOFXODWH%
•
•
•
SUHVVQG >',675@
VFUROOGRZQWR>ELQRPFGI@DQGSUHVV(17(5
$VLQELQRPSGIHQWHUWKHQXPEHURIWULDOVWKHSUREDELOLW\IRUHDFKWULDODQGWKH[
YDOXH
)RUH[DPSOHIRU%3;”LVELQRPFGI
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
13 of 70
AP Statistics
Study Guide: Binomial Situations (Events)
Page 4 of 4
Using the TI-89 to Calculate Binomial Distribution Probabilities
To Calculate a Binomial Probability for a Specific Value: Binomial Pdf f
Pdf stands for “probability density function.” Use binomPdf and enter the numbers and
commas in the same order as they are given in the probability notation. For example, to
calculate B(20,.3,5):
1. From Stats/List Editor, press [F5]:Distr. Scroll down to [B]: Binomial Pdf.... :Press
[ENTER].
2. For Num Trials, n, enter 20. Arrow down one cell. For Prob Success, p, enter .3.
Arrow down again. For X Value, enter 5. Press [ENTER] to save your settings, then
[ENTER] again to execute the command. You should get .178863 for your answer.
3. You can also access many of the statistics functions by pressing [CATALOG][F3]. This
will produce a list of common functions available on the calculator.
To Calculate a Binomial Probability for a Range of Values: Binomial Cdf
Cdf stands for “cumulative density function.” This function allows you to calculate the
probability of a range of possible outcomes from a binomial setting. You will need to identify
the lower and upper bounds for that range. To calculate the probability of obtaining five or
fewer successes out of 20 trials, with a probability of success of 0.3:
1. From Stats/List Editor, press [F5]:Distr. Scroll down to [C]: Binomial Cdf.... :Press
[ENTER].
2. For Num Trials, n, enter 20. Arrow down one cell. For Prob Success, p, enter .3.
Arrow down again. For the Lower Bound, enter 0. Arrow down. For the Upper Bound,
enter 5. Press [ENTER] to save your settings, then [ENTER] again to execute the
command.
3. You should get .416371 for your answer.
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
14 of 70
Page 1 of 3
1.2.3
Why Study Binomial Distributions?
Remember that, although statistical inference takes many forms, it often follows these
phases:
1. Identify what you want to study.
2. Ensure that your study and your sample are valid and useful for drawing conclusions
about what you're studying. This includes understanding how the data were collected
(so you know their limitations).
3. Recognize the type of probability distribution that models your situation. The
probability distribution tells you how likely or unlikely your findings are. You can also
use it to find the range of values your population parameter is likely to have.
4. If you're comparing at least two measures or studying a relationship, conduct
significance tests to compare your results against what you'd expect from chance
alone. If the study's results are highly unlikely, your results are statistically
significant.
This Activity focuses on phase 3, the probability distribution, with particular emphasis on
the binomial distribution. You'll need to know about a few different types of probability
distributions to do inferential statistics.
Review
Distribution is one of the most important concepts in statistics. Every variable has a
distribution, whether the variable is categorical or numerical, continuous or discrete. A
variable's distribution consists of the values it takes and how often it takes each of these
values. Distributions are often shown in graphs so you can see the relative frequency
with which values occur.
A binomial distribution (also referred to sometimes as the binomial setting) is a discrete
probability distribution of the counts of successes and failures, such as the number of
times you get heads when you flip a coin 100 times. Most textbooks list the following
four characteristics for the binomial distribution (or setting):
In the Binomial Setting,
1. There's a set n of identical trials.
2. The outcome of each trial is either success or failure.
3. The probability of success of each trial is p. (The probability of failure of each trial is
q = 1 - p.)
4. Each trial is independent.
15 of 70
S2
Page 2 of 3
Some textbooks list the following as a fifth characteristic, and others use it as an
introduction to the previous four:
5. In the binomial setting, we're interested in x, the number of successes observed
during the n trials, for x = 0, 1, 2, 3 , … , n.
The following are examples of binomial settings:
• What's the probability of getting exactly three heads on five flips of a coin?
• A coin is flipped 100 times. What's the probability of getting 45 heads?
• If the probability of having Rh-negative blood is .4 in a population of 500, what's the
probability 210 have type Rh-negative blood?
These are all probability settings in which:
1. We know the number of repetitions.
2. The outcome of each trial is either success or failure.
3. We know the probability of success or failure of any trial.
4. The probability doesn't change from trial to trial (the trials are independent.)
Other Definitions to Remember
Binomial Event: a set of trials within a binomial setting
Simple Event or Outcome: a single trial
B(n, p): the binomial probability distribution with parameters n, the number of
observations, and p, the probability of success on any one observation. If X is a binomial
 n
random variable with B(n, p), B(n, p, x) = P(X = x) =   (p)x(1 – p)n–x.
x
On your calculator, binompdf(n,p,X) calculates the probability of exactly x successes in n
trials with a probability of p.
With a range of outcomes for X, where X takes on the number of successes 0, 1, 2, 3, …,
n, use binomcdf(n,p,X) on your calculator, which calculates the probability of up to and
including x successes in n trials with a probability of p for the binomial probability
distribution.
Remember, the binomial distribution is discrete: Each single value has a probability. This
means that, unlike a continuous distribution such as the normal distribution, there's a
difference between P(x < 4) and P(x ≤ 4). So for P(x < 4), you add
P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3). But for P(x ≤ 4) you include the 4, so you
add P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3) + P(x = 4).
Be careful about your endpoints when using binomcdf. With binomcdf on a calculator,
you get the lower-tail probability that includes the upper bound. So, if you enter
binomcdf(20,.3,5), you're finding P(x ≤ 5) for B(20, .3). If you need a range with a
specific lower and upper bound, remember to subtract like this:
(probability below upper bound) – (probability below lower bound).
16 of 70
Page 3 of 3
Be careful about the range you calculate. It may help to draw the range on a number
line. For example, if you want to calculate P(4 < x < 8) for B(10, .3), you want the
range represented by the underlined numbers:
0 1 2 3 4 5 6 7 8 9 10.
So you'd calculate:
binomcdf(10,.3,7) – binomcdf(10,.3,4). Note that this will give you the range from 5 to
7.
If you wanted to calculate P(4 ≤ x < 8), you include the 4 in your range:
0 1 2 3 4 5 6 7 8 9 10.
So you'd calculate:
binomcdf(10,.3,7) – binomcdf(10,.3,3). Note that now you're subtracting the 3 (since
it's included in binomcdf(10,.3,3)) but not the 4.
If you want P(4 ≤ x ≤ 8), you'd include the 8:
0 1 2 3 4 5 6 7 8 9 10.
You'd calculate:
binomcdf(10,.3,8) – binomcdf(10,.3,3).
There are cases where the criteria of the binomial setting might not be satisfied, but
which would still be considered binomial. An event is almost binomial if the population
size of an experiment is at least 20 times larger than the sample size. (NOTE: 20 is a
rule of thumb. Some references use 10 instead of 20.)
17 of 70
Page 1 of 3
1.2.3
Why Study Binomial Distributions?
Remember that, although statistical inference takes many forms, it often follows these
phases:
1. Identify what you want to study.
2. Ensure that your study and your sample are valid and useful for drawing conclusions
about what you're studying. This includes understanding how the data were collected
(so you know their limitations).
3. Recognize the type of probability distribution that models your situation. The
probability distribution tells you how likely or unlikely your findings are. You can also
use it to find the range of values your population parameter is likely to have.
4. If you're comparing at least two measures or studying a relationship, conduct
significance tests to compare your results against what you'd expect from chance
alone. If the study's results are highly unlikely, your results are statistically
significant.
This Activity focuses on phase 3, the probability distribution, with particular emphasis on
the binomial distribution. You'll need to know about a few different types of probability
distributions to do inferential statistics.
Review
Distribution is one of the most important concepts in statistics. Every variable has a
distribution, whether the variable is categorical or numerical, continuous or discrete. A
variable's distribution consists of the values it takes and how often it takes each of these
values. Distributions are often shown in graphs so you can see the relative frequency
with which values occur.
A binomial distribution (also referred to sometimes as the binomial setting) is a discrete
probability distribution of the counts of successes and failures, such as the number of
times you get heads when you flip a coin 100 times. Most textbooks list the following
four characteristics for the binomial distribution (or setting):
In the Binomial Setting,
1. There's a set n of identical trials.
2. The outcome of each trial is either success or failure.
3. The probability of success of each trial is p. (The probability of failure of each trial is
q = 1 - p.)
4. Each trial is independent.
Some textbooks list the following as a fifth characteristic, and others use it as an
introduction to the previous four:
5. In the binomial setting, we're interested in x, the number of successes observed
during the n trials, for x = 0, 1, 2, 3 , … , n.
18 of 70
S1
Page 2 of 3
The following are examples of binomial settings:
• What's the probability of getting exactly three heads on five flips of a coin?
• A coin is flipped 100 times. What's the probability of getting 45 heads?
• If the probability of having Rh-negative blood is .4 in a population of 500, what's the
probability 210 have type Rh-negative blood?
These are all probability settings in which:
1. We know the number of repetitions.
2. The outcome of each trial is either success or failure.
3. We know the probability of success or failure of any trial.
4. The probability doesn't change from trial to trial (the trials are independent.)
Other Definitions to Remember
Binomial Event: a set of trials within a binomial setting
Simple Event or Outcome: a single trial
B(n, p): the binomial probability distribution with parameters n, the number of
observations, and p, the probability of success on any one observation. If X is a binomial
 n
random variable with B(n, p), B(n, p, x) = P(X = x) =  (p)x(1 – p)n–x.
x
On your calculator, binompdf(n,p,X) calculates the probability of exactly x successes in n
trials with a probability of p.
With a range of outcomes for X, use Binomial Cdf on your calculator, which allows you to
select lower and upper values for X, and finds the probability of obtaining a count of successes
that falls within those values.
Remember, the binomial distribution is discrete: Each single value has a probability. This
means that, unlike a continuous distribution such as the normal distribution, there's a
difference between P(x < 4) and P(x ≤ 4). So for P(x < 4), you add
P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3). But for P(x ≤ 4) you include the 4, so you
add P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3) + P(x = 4).
Be careful about your endpoints when using Binomial Cdf. If you are finding P(X<5), then your
lower value will be 0 and your upper value will be 5. However, if you are finding P(X<5), then
your values will be 0 and 4.
If you need a range with a specific lower and upper value, then be sure to determine whether or
not to include those specific values. Generally, words such as ‘at least’, ‘at most’, ‘no greater/
more/fewer/less than’, ‘including’, and ‘inclusive’ indicate the inclusion of an upper or lower value,
while words such as ‘less than’, ‘more than’, ‘between’, ‘over/under’, and ‘between’ indicate the
omission of that value and the need for the next higher or lower value to be used.
19 of 70
Page 3 of 3
Be careful about the range you calculate. It may help to draw the range on a number
line. For example, if you want to calculate P(4 < x < 8) for B(10, .3), you want the
range represented by the underlined numbers:
0 1 2 3 4 5 6 7 8 9 10.
So you'd calculate:
binomcdf(10,.3,7) – binomcdf(10,.3,4). Note that this will give you the range from 5 to
7.
If you wanted to calculate P(4 ≤ x < 8), you include the 4 in your range:
0 1 2 3 4 5 6 7 8 9 10.
So you'd calculate:
binomcdf(10,.3,7) – binomcdf(10,.3,3). Note that now you're subtracting the 3 (since
it's included in binomcdf(10,.3,3)) but not the 4.
If you want P(4 ≤ x ≤ 8), you'd include the 8:
0 1 2 3 4 5 6 7 8 9 10.
You'd calculate:
binomcdf(10,.3,8) – binomcdf(10,.3,3).
There are cases where the criteria of the binomial setting might not be satisfied, but
which would still be considered binomial. An event is almost binomial if the population
size of an experiment is at least 20 times larger than the sample size. (NOTE: 20 is a
rule of thumb. Some references use 10 instead of 20.)
20 of 70
'LUHFWLRQV
1.2.4
• 8VHWKLVJXLGHWRWDNHQRWHVZKLOH\RXZDWFKWKHtutorial.
• .HHS\RXUFRPSOHWHGstudy guide in your notebook for later study.
.H\7HUPVDQG&RQFHSWV
• FRQWLQXLW\FRUUHFWLRQ
• QRUPDODSSUR[LPDWLRQWRWKHELQRPLDO
7XWRULDO6HFWLRQV
7KH5HODWLRQVKLS%HWZHHQWKH%LQRPLDO'LVWULEXWLRQDQGWKH1RUPDO
'LVWULEXWLRQ
8VLQJWKH1RUPDO$SSUR[LPDWLRQWR)LQG3UREDELOLWLHV
,QFUHDVH$FFXUDF\E\8VLQJWKH&RQWLQXLW\&RUUHFWLRQ
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
21 of 70
S2
Using the TI-83 or TI-84 to Calculate Binomial Distribution Probabilities
7R&DOFXODWHD%LQRPLDO3UREDELOLW\IRUD6SHFLILF9DOXHELQRPSGI
7KHSGIVWDQGVIRUSUREDELOLW\GHQVLW\IXQFWLRQ8VHELQRPSGIDQGHQWHUWKH
QXPEHUVDQGFRPPDVLQWKHVDPHRUGHUDVWKHSUREDELOLW\QRWDWLRQ)RUH[DPSOHWR
FDOFXODWH%
•
•
•
•
SUHVVQG >',675@
VFUROOGRZQWR>ELQRPSGI@DQGSUHVV(17(5
.H\LQ<RXUKRPHVFUHHQVKRXOGUHDGELQRPSGI
SUHVV(17(5DQG\RXVKRXOGJHW
7R&DOFXODWHD3UREDELOLW\IRUD5DQJHRI9DOXHVELQRPFGI
7KHFGIVWDQGVIRUFXPXODWLYHGHQVLW\IXQFWLRQ(QWHUWKHQXPEHUVDQGFRPPDV
LQWKHVDPHRUGHUDVWKHSUREDELOLW\QRWDWLRQ)RUH[DPSOHWRFDOFXODWH%
•
•
•
SUHVVQG >',675@
VFUROOGRZQWR>ELQRPFGI@DQGSUHVV(17(5
$VLQELQRPSGIHQWHUWKHQXPEHURIWULDOVWKHSUREDELOLW\IRUHDFKWULDODQGWKH
[YDOXH
)RUH[DPSOHIRU%3;”LVELQRPFGI
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
22 of 70
Using the TI-83 or TI-84 to Calculate Normal Distribution Probabilities
)LQGLQJ$1RUPDO&XUYH3UREDELOLW\ZLWKQRUPDOFGI
3UHVVQG >',675@7KLVVHOHFWVQRUPDOFGI
(QWHU>ORZHUERXQGXSSHUERXQGµDQGσ@)RUWKHELUGHJJH[DPSOHWKHORZHU
ERXQGLV]HURWKHXSSHUERXQGLVWKHPHDQLVDQGWKHVWDQGDUGGHYLDWLRQ
LV6RSUHVV
3UHVV(17(5WRVHHWKHDQVZHU
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
23 of 70
AP Statistics
Study Guide: The Normal Approximation to the Binomial
1.2.4
Directions


Use this guide to take notes while you watch the tutorial.
Keep your completed study guide in your notebook for later study.
Key Terms and Concepts


Page 1 of 3
continuity correction
normal approximation to the binomial
Tutorial Sections
The Relationship Between the Binomial Distribution and the Normal
Distribution
Using the Normal Approximation to Find Probabilities
Increase Accuracy by Using the Continuity Correction
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
24 of 70
S2
AP Sta
atistics
Study Guide: The Normal Approx
ximation
n to the B
Binomial
Page 2 of 3
Calculattor Instruc
ctions for the TI-89
These instructions assume
a
thatt you've read
d and follow
wed the dire
ections in the "Getting
ator manuall. The steps covered in this guide a
are also cov
vered
Started" chapter of your calcula
in the "S
Statistics" ch
hapter of your calculato
or manual.


Text in brack
kets, like thiis [HOME], gives
g
the na
ame of a bu
utton on the calculator.
Two sets of brackets
b
like
e [2nd][VAR
R-LINK] get you to a co
olor-coded ffunction abo
ove
each key.

The diamond
d button
and brack
kets get you
u to anotherr color-coded
d function
sh
hown above
e each key.
The white [A
ALPHA] butto
on followed by another bracket getts you a lettter.
Text in bold, like 3:Regr
ressions->
> is a comm and within a menu on tthe calculattor
sc
creen.


Using th
he TI-89 to
o Calculate
e Binomial Distributio
on Probabillities
To Calcu
ulate a Binomial Prob
bability for
r a Specific
c Value:
mial Pdf to calculate th
Pdf stand
ds for “prob
bability dens
sity function.” Use Binom
he probabilitty of
observing a specific number of successes in an exercisse that invo
olves a fixed
d number of
trials. Fo
or example, to calculate
e Binomial Pdf(20,.3,5),
P
, which is th
he probabilitty of seeing
g
exactly five
f
successes in 20 tria
als when the
e probability
y of successs is 0.3:
1.
2.
3.
4.
5.
6.
7.
Enter List Editor.
D
/ [B]: Binomial
B
Pdf....
Press [F5]: Distr
Enter 20 for Num Trials, n:.
0
for Prob Success, p:.
Enter .3 (or 0.3)
Enter 5 for X Value:
R] to store your
y
settings, then [EN TER] again to execute the command.
Press [ENTER
g .178863.
You should get
ulate a Binomial Prob
bability for
r a Range o
of Values:
To Calcu
Cdf stands for “cumulative dens
sity function
n.” Use Bino
omial CDF to
o calculate tthe probabillity
of observ
ving success
ses that ran
nge from a lower bound
d to an uppe
er bound. . F
For example
e, to
, which is th
calculate
e Binomial Cdf(20,.3,5)
C
he probabilitty of seeing
g between ze
ero and five
e
successe
es in 20 trials when the probability of success is 0.3:
1.
2.
3.
4.
5.
6.
Enter List Editor.
B
Cdff....
Press[F5]: Distr / [C]: Binomial
Enter 20 for Num Trials, n:.
0
for Prob Success, p:.
Enter .3 (or 0.3)
Enter 0 for Lower Value::.
U
Value::.
Enter 5 for Upper
_________
____________
___________
Copyright © 2011 Apex Learning
L
Inc. (See
(
Terms of Use at www.ap
pexvs.com/TerrmsOfUse)
25 of 70
AP Statistics
Study Guide: The Normal Approximation to the Binomial
Page 3 of 3
7. Press [ENTER] to store your settings, then [ENTER] again to execute the command.
8. You should get .416371.
If you want to calculate the probability of seeing between 5 and 10 successes under these
same conditions, simply change your Lower Value: to 5 and your Upper Value: to 10. Your
answer should be .745347
Using the TI-89 to Calculate Normal Distribution Probabilities
Finding a Normal Curve Probability
1. From List Editor, press [F5]: Distr / [4]:Normal Cdf....
2. You will be prompted for a Lower Bound, an Upper Bound, the mean ( μ ) and the
standard deviation ( σ ).
For the bird example, enter a Lower Bound of 0.
Enter an Upper Bound of 11.
Enter a mean of 12.
Enter a standard deviation of 0.8.
Press [ENTER] to save your settings, then press [ENTER] again to execute the
command.
8. Your answer should be .10565.
3.
4.
5.
6.
7.
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
26 of 70
6WDWLVWLFV
6WXG\6KHHW
%LQRPLDO3UREOHPV
3DJHRI
1.2.6 S2
7KH1RUPDO$SSUR[LPDWLRQWRWKH%LQRPLDO
,QWKLVDFWLYLW\ZH
OOORRNDWWZRNLQGVRIGLVWULEXWLRQQRUPDODQGELQRPLDO:KHQWKH
QXPEHURIVXFFHVVHVDQGIDLOXUHVIRUDJLYHQVLWXDWLRQDUHHDFKHTXDOWRRUJUHDWHUWKDQ
WKHELQRPLDOLVVDLGWRDSSUR[LPDWHWKHQRUPDOGLVWULEXWLRQ0DQ\WH[WERRNVXVH
UDWKHUWKDQEXWERWKDUHYDOLG,QRWKHUZRUGVWKHELQRPLDODSSUR[LPDWHVWKH
QRUPDOZKHQQS≥DQGQ±S≥1RWHWKDW±SRUWKHSUREDELOLW\RIIDLOXUHLV
RIWHQZULWWHQDVTVRQ±SFDQEHZULWWHQDVQT
,QWKHVHFDVHVWKHELQRPLDOUHVHPEOHVWKHQRUPDOVRFORVHO\WKDW\RXFDQXVH]VFRUHV
DQGDQRUPDOFXUYHWDEOHWRILQGDUHDVXQGHUWKHFXUYH$VPRVWVLWXDWLRQVLQLQIHUHQWLDO
VWDWLVWLFVPHHWWKHUHTXLUHPHQWVIRUWKHELQRPLDODSSUR[LPDWLRQUHVHDUFKHUVUDUHO\QHHG
WRILJXUHELQRPLDOSUREDELOLWLHVH[DFWO\WKH\XVXDOO\XVHWKHQRUPDODSSUR[LPDWLRQ
LQVWHDG
$ELQRPLDOGLVWULEXWLRQWKDWPHHWVWKHUHTXLUHPHQWVIRUWKHQRUPDODSSUR[LPDWLRQZLOO
KDYHDPHDQHTXDOWRQSWKHH[SHFWHGYDOXHDQGDVWDQGDUGGHYLDWLRQHTXDOWR
QS − S WKHVTXDUHURRWRIWKHH[SHFWHGYDOXHWLPHVWKHQXPEHURIIDLOXUHV
7KH&RQWLQXLW\&RUUHFWLRQ
7KHELQRPLDOGLVWULEXWLRQPRGHOVWKHGLVWULEXWLRQVRIGLVFUHWHUDQGRPYDULDEOHV6R
ZKHQ\RXIOLSDFRLQWLPHV\RXFDQKDYHKHDGVRUKHDGVEXWQRWKHDGV
7KHQRUPDOGLVWULEXWLRQRQWKHRWKHUKDQGPRGHOVWKHGLVWULEXWLRQVRIFRQWLQXRXV
UDQGRPYDULDEOHV5HFDOOWKDWDFRQWLQXRXVUDQGRPYDULDEOHFDQWDNHRQDQ\YDOXHZLWKLQ
DUDQJH)RUH[DPSOHWHPSHUDWXUHFDQEHGHJUHHVFHQWLJUDGHGHJUHHV
FHQWLJUDGHGHJUHHVFHQWLJUDGHRUDQ\WKLQJLQEHWZHHQ&RQWLQXRXV
YDULDEOHVKROGPHDVXUHGGDWDDQGGLVFUHWHYDULDEOHVKROGFRXQWHGGDWD
:KHQ\RXXVHWKHQRUPDODSSUR[LPDWLRQWRWKHELQRPLDO\RX
UHWUHDWLQJDGLVFUHWH
GLVWULEXWLRQDVWKRXJKLWZHUHFRQWLQXRXVDQGWKLVLQWURGXFHVHUURU,QRUGHUWRPLQLPL]H
HUURUZKHQXVLQJWKHDSSUR[LPDWLRQWRWKHELQRPLDOXVHWKHFRQWLQXLW\FRUUHFWLRQ7KH
FRQWLQXLW\FRUUHFWLRQDOORZV\RXWRWUHDWQXPEHUVDVWKRXJKWKH\RFFXSLHGDUDQJH
DERYHDQGEHORZWKHGLVFUHWHYDOXHLQFOXVLYH
+HUH
VWKHPRVWLPSRUWDQWWKLQJWRUHPHPEHUZKHQ\RX
UHXVLQJWKHFRQWLQXLW\
FRUUHFWLRQ:KHQ\RX
UHILQGLQJWKHSUREDELOLW\IRUDUDQJHRIYDOXHVLQDELQRPLDO
GLVWULEXWLRQDQG\RX
UHXVLQJWKHQRUPDODSSUR[LPDWLRQVXEWUDFWIURPWKHORZHU
ERXQGDQGDGGWRWKHXSSHUERXQG)RULQVWDQFHIRUWKHFORVHGLQWHUYDO
ZKHUHDQGDUHERWKLQFOXGHG\RXILQGWKHQRUPDOFXUYHSUREDELOLW\IRUWKHLQWHUYDO
,I\RXZDQWWKHSUREDELOLW\IRUWKHELQRPLDOLQWHUYDOZKHUHLV
LQFOXGHGDQGLVQ
W\RX
GILQGWKHSUREDELOLW\IRUWKHLQWHUYDOLQDQRUPDO
GLVWULEXWLRQ,I\RXZDQWWKHSUREDELOLW\IRUWKHLQWHUYDOZKHUHLVQ
WLQFOXGHG
DQGLV\RX
GILQGWKHQRUPDOSUREDELOLW\IRUWKHLQWHUYDO)LQDOO\LI\RX
ZDQWHGWKHSUREDELOLW\IRUZKHUHQHLWKHUQRUDUHLQFOXGHG\RX
GILQGWKH
QRUPDOSUREDELOLW\IRU
BBBBBBBBBBBBBBBBBB
‹&RS\ULJKW$3(;2QOLQH/HDUQLQJ,QF$OOULJKWVUHVHUYHG$GYDQFHG3ODFHPHQWDQG$3DUHUHJLVWHUHG
WUDGHPDUNVRIWKH&ROOHJH%RDUG7KLVPDWHULDOLVLQWHQGHGIRUWKHH[FOXVLYHXVHRI$3(;HQUROOHGVWXGHQWV
$Q\XQDXWKRUL]HGFRS\LQJUHXVHRUUHGLVWULEXWLRQLVSURKLELWHG
27 of 70
1.2.7
28 of 70
S2
29 of 70
30 of 70
31 of 70
32 of 70
33 of 70
34 of 70
1.3
Key Terms
average waiting time
geometcdf on a graphing calculator
geometpdf on a graphing calculator
geometric distribution
35 of 70
S2
AP Statistics
Study: Geometic Probability Distributions
1.3.2
'LUHFWLRQV
• 8VHWKLVJXLGHWRWDNHQRWHVZKLOH\RXZDWFKWKH7XWRULDO
• .HHS\RXUFRPSOHWHG6WXG\*XLGHLQ\RXUQRWHERRNIRUODWHUVWXG\
.H\7HUPVDQG&RQFHSWV
• DYHUDJHZDLWLQJWLPH
• JHRPHWFGI
• JHRPHWSGI
Page 1 of 3
• JHRPHWULFGLVWULEXWLRQ
• JHRPHWULFVHWWLQJ
7XWRULDO6HFWLRQV
,QWURGXFWLRQWR*HRPHWULF3UREDELOLW\'LVWULEXWLRQV
3UREDELOLW\&DOFXODWLRQVLQWKH*HRPHWULF6HWWLQJ
$YHUDJH:DLWLQJ7LPH
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
36 of 70
S2
AP Statistics
Study: Geometic Probability Distributions
Page 2 of 3
TI-83 and TI-84: Finding a 6LQJOH Probability Value for a Geometric Distribution
7RGRWKLVXVHJHRPHWSGIZKLFKVWDQGVIRUJHRPHWULFSUREDELOLW\GHQVLW\IXQFWLRQ
•
•
3UHVVQG >',675@$/3+$>'@7KLVZLOOSXWWKHFXUVRURQJHRPHWSGI,WZLOODOVRZRUNWR
VFUROOGRZQWKHOLVWPDQXDOO\SXWWKHFXUVRURQJHRPHWSGIWKHQSUHVV(17(5
(QWHU>SUREDELOLW\RIVXFFHVVRQHDFKWULDOS@>@>QXPEHURIWULDOVQ@WKHQFORVHWKH
SDUHQWKHVLV
)RUH[DPSOHLI\RXZDQWWRILQGWKHSUREDELOLW\RIJHWWLQJVXFFHVVLQIRXUWULDOVLIWKH
SUREDELOLW\RIVXFFHVVRQHDFKWULDOLV
(QWHU ÷ 7KLVWHOOVWKHFDOFXODWRUWKDWLQHDFKWULDOWKHUH
VDSUREDELOLW\RI
VXFFHVVRIDQGWKDWWKHUHDUHWULDOV7KHQSUHVV(17(5DQG\RXVKRXOGJHW
VRPHWKLQJFRQVLVWHQWZLWKQXPEHURIGLJLWVDQGURXQGLQJZLOOYDU\GHSHQGLQJRQ
KRZ\RXUFDOFXODWRULVVHW
TI-83 and TI-84: Finding a 5DQJH of Probability Values for a Geometric Distribution
7RGRWKLVXVHJHRPHWFGIZKLFKVWDQGVIRUJHRPHWULFSUREDELOLW\GHQVLW\IXQFWLRQ,W
ILJXUHVWKHSUREDELOLW\IURPWULDOVXSWRDQGLQFOXGLQJQWULDOV
•
•
3UHVVQG >',675@$/3+$>(@7KLVZLOOVHOHFWJHRPHWFGI,WZLOODOVRZRUNWRVFUROOGRZQ
WKHOLVWPDQXDOO\SXWWKHFXUVRURQJHRPHWFGIWKHQSUHVV(17(5
(QWHU>SUREDELOLW\RIVXFFHVVRQHDFKWULDOS@>@>WKHXSSHUERXQGIRUQXPEHURIWULDOV
Q@WKHQFORVHWKHSDUHQWKHVLV
)RUH[DPSOHLI\RXZDQWWRILQGWKHSUREDELOLW\RIVXFFHVVLQWKUHHRUIHZHUWULDOVLIWKH
SUREDELOLW\RIVXFFHVVRQHDFKWULDOLV
(QWHU ÷ 7KLVWHOOVWKHFDOFXODWRUWKDWLQHDFKWULDOWKHUH
VDSUREDELOLW\RI
DQGWKDWWKHUHDUHWULDOV7KHQSUHVV(17(5DQG\RXVKRXOGJHWQXPEHURIGLJLWV
DQGURXQGLQJZLOOYDU\GHSHQGLQJRQKRZ\RXUFDOFXODWRULVVHW
TI-83 and TI-84: Generating Random Numbers Using randInt
•
•
•
•
3UHVV0$7+
6FUROODFURVVWR35%
6FUROOGRZQWRUDQG,QWDQGSUHVV(17(5
(QWHUWKH>PD[LPXPQXPEHU@DQG>PLPLPXPQXPEHU@DQGFORVHWKHSDUHQWKHVHV
7KHQSUHVV(17(5HDFKWLPH\RXZDQWDUDQGRPQXPEHU
)RUH[DPSOHLI\RXZDQWWRJHQHUDWHUDQGRPQXPEHUVEHWZHHQDQGLQFOXVLYHHQWHU
DQGSUHVV(17(5HDFKWLPH\RXZDQWDQXPEHU
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
37 of 70
AP Statistics
Study: Geometric Probability Distributions
Page 3 of 3
TI-89: Finding a Single Probability Value for a Geometric Distribution
To do this, use Geometric Pdf….
1. Enter List Editor.
2. Press [F5]:Distr / [F]: Geometric Pdf….
3. You will be prompted for the probability of success (Prob Success, p:) and the trial
on which you want the first success to occur (X Value:).
4. For example, if you were rolling a die and wanted to roll a 6, P(roll a 6) = 1/6, and
you wanted to find the probability of getting your first 6 on your fourth roll, you
would enter 1/6 for Prob Success, p, and 4 for X Value. Press [ENTER] to save your
settings, then press [ENTER] again to execute the command.
5. Your answer should be .096451.
TI-89: Finding a Range of Probability Values for a Geometric Distribution
To do this, use Geometric Cdf.
1. Enter List Editor.
2. Press [F5]:Distr / [G]: Geometric Cdf….
3. To find the probability of rolling your first 6 within the first four rolls, enter 0 for your
Lower Value and 4 for your Upper Value. This will calculate the probability of rolling
your first 6 on the first, second, third, or fourth roll.
4. Press [ENTER] to save your settings then press [ENTER] again to execute the
command.
5. Your answer should be .517747.
If you want to find the probability of rolling your first 6 between the third and fifth rolls,
enter 4 for your Lower Value and 5 for your Upper Value. Your answer should be .292567.
TI-89: Generating Random Numbers
1. From the home screen, press [2nd][5] and scroll to 7:Probability [ENTER].
2. Select 4:rand( [ENTER].
3. This will return you to the command line, where you will enter the upper value for
your random integer. Close the parentheses and press [ENTER].
4. The screen will display a random number between 1 and the number you
entered. Press [ENTER] again to generate additional random numbers.
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
38 of 70
AP Statistics
Study: Geometric Distribution Problems
Page 1 of 2
1.3.4
S2
$V\RXPD\UHFDOOWKHELQRPLDOVHWWLQJKDVIRXUFRQGLWLRQV
7KHWULDOVDUHLQGHSHQGHQW
(DFKWULDOKDVMXVWWZRSRVVLEOHRXWFRPHVVXFFHVVDQGIDLOXUH
7KHSUREDELOLW\RIVXFFHVVLVWKHVDPHIRUHDFKWULDOUHIHUUHGWRDVS
7KHQXPEHURIWULDOVLVIL[HGXVXDOO\UHIHUUHGWRDVQ
,QWKHELQRPLDOVHWWLQJWKHUDQGRPYDULDEOH;LVWKHQXPEHURIVXFFHVVHVWKDWRFFXULQQ
WULDOV
7KHJHRPHWULFVHWWLQJLVVLPLODUWRWKHELQRPLDOVHWWLQJ7KUHHRIWKHFRQGLWLRQVDUHWKH
VDPHWKHWULDOVDUHLQGHSHQGHQWHDFKWULDOKDVMXVWWZRRXWFRPHVDQGWKHSUREDELOLW\
RIVXFFHVVLVWKHVDPHIRUHDFKWULDO,QWKHJHRPHWULFVLWXDWLRQKRZHYHUWKHUDQGRP
YDULDEOH;LVWKHQXPEHURIWULDOVUHTXLUHGWRJHWWKHILUVWVXFFHVV
$UDQGRPYDULDEOH;KDVDJHRPHWULFGLVWULEXWLRQLIHDFKWULDOLVLQGHSHQGHQWDQGKDV
SUREDELOLW\RIVXFFHVVS;LVWKHQXPEHURIWULDOVUHTXLUHGWRJHWWKHILUVWVXFFHVV;FDQ
WDNHDQ\LQWHJHUYDOXHIURPRQHWRLQILQLW\DQGLVDGLVFUHWHUDQGRPYDULDEOH
7KHIRUPXODIRUFDOFXODWLQJDJHRPHWULFSUREDELOLW\IRUDJLYHQQXPEHURIWULDOVLV
3 ; = Q = ( − S)Q − S ZKHUHQLVWKHQXPEHURIWULDOVDQGSLVWKHSUREDELOLW\RI
VXFFHVVIRUHDFKWULDO
127(0DQ\WH[WERRNVZULWHWKHSUREDELOLW\RIIDLOXUHRU±SDVT6R\RXPD\VHH
WKHIRUPXODZULWWHQDV 3 ; = Q = T Q − (S) 7KHSUREDELOLW\\RX
OOJHWDVXFFHVVRQRUEHIRUHWKHQWKWULDOLVVLPSO\WKHVXPRIDOO
SUREDELOLWLHVIURPRQHWRQ)RULQVWDQFH3[≤ 3[ 3[ 1RWLFHWKDWZH
GRQ
WLQFOXGH3[ EHFDXVH\RXFDQQHYHUJHWDVXFFHVVZLWKRXWGRLQJDQ\WULDOV
2QDgraphingFDOFXODWRUWKH*HRPHWULF3UREDELOLW\'HQVLW\)XQFWLRQJHRPHWSGI
FDOFXODWHVWKHSUREDELOLW\RIWKHILUVWVXFFHVVRQWKHQWKWULDO
7KH*HRPHWULF&XPXODWLYH'HQVLW\)XQFWLRQJHRPHWFGIFDOFXODWHVWKHSUREDELOLW\RI
JHWWLQJWKHILUVWVXFFHVVRQRUEHIRUHWKHQWKWULDO,WVXPVWKHSUREDELOLWLHVIRUHDFKRI
WKHHYHQWV3ILUVWVXFFHVVRQILUVWWULDO3ILUVWVXFFHVVRQVHFRQGWULDO3ILUVW
VXFFHVVRQWKLUGWULDO«3ILUVWVXFFHVVRQ[WKWULDO
*HRPHWULFGLVWULEXWLRQVDUHDOZD\VVNHZHGWRWKHULJKWEHFDXVHWKHUH
VDVKDUSOLPLWRQ
RQHHQG\RXFDQ
WKDYHIHZHUWKDQQ DQGDORQJWDLORIGLPLQLVKLQJSUREDELOLWLHVRQ
WKHRWKHUHQG
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
39 of 70
AP Statistics
Study: Geometric Distribution Problems
Page 2 of 2
7KHH[SHFWHGYDOXHRUDYHUDJHYDOXHLQDJHRPHWULFGLVWULEXWLRQLVFDOOHGWKHDYHUDJH
ZDLWLQJWLPH,W
VDOZD\VS6RLIWKHSUREDELOLW\RIJHWWLQJDIRXUZKHQUROOLQJDVL[
VLGHGGLHLVWKHDYHUDJHZDLWLQJWLPHZRXOGEHWULDOV,WWHOOV\RXWKDWRQDYHUDJH
\RX
OOKDYHWRUROODGLHVL[WLPHVEHIRUHJHWWLQJ\RXUILUVWIRXU7KLVPD\VHHPFRQIXVLQJ
ZKHQ\RXFRQVLGHUWKDWLQDJHRPHWULFSUREDELOLW\GLVWULEXWLRQWKHKLJKHVWIUHTXHQF\LV
DWQ n=1
2
3
4
5
6
7
8
9
10
%XWUHFDOOWKDWDQH[SHFWHGYDOXHWDNHVLQWRDFFRXQWDOOSRVVLEOHRXWFRPHV7KHPRVW
FRPPRQRXWFRPHPD\EHQ EXWWKHUHDUHSOHQW\RIRWKHUWLPHVZKHUHWKHILUVW
VXFFHVVFRPHVDWQ RUPRUH7KHVHKLJKHUYDOXHVSXOOWKHDYHUDJHXSIURPRQHVR
WKDWRQDYHUDJH\RXFDQH[SHFWWRUROOWKHGLHVL[WLPHVEHIRUHJHWWLQJ\RXUILUVW
VXFFHVV
______________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
40 of 70
1.4
Key Terms
central limit theorem
(distribution of)
mean of the sampling distribution of a sample mean
mean of the sampling distribution of a sample proportion
sample proportions
sampling distribution
sampling distribution of a sample mean
standard deviation of the sampling distribution of a sample mean
standard deviation of the sampling distribution of a sample proportion
41 of 70
S2
6WDWLVWLFV6WXG\*XLGH
6DPSOLQJ'LVWULEXWLRQVDQGWKH&HQWUDO/LPLW7KHRUHP
'LUHFWLRQV
3DJHRI
1.4.2
• 8VHWKLVJXLGHWRWDNHQRWHVZKLOH\RXZDWFKWKH7XWRULDO
• .HHS\RXUFRPSOHWHG6WXG\*XLGHLQ\RXUQRWHERRNIRUODWHUVWXG\
.H\7HUPVDQG&RQFHSWV
•
•
•
•
&HQWUDO/LPLW7KHRUHP
PHDQRIWKHVDPSOLQJGLVWULEXWLRQRID
VDPSOHPHDQ
VDPSOLQJGLVWULEXWLRQ
•
VDPSOLQJGLVWULEXWLRQRIDVDPSOH
PHDQ
VWDQGDUGGHYLDWLRQRIWKHVDPSOLQJ
GLVWULEXWLRQRIDVDPSOHPHDQ
7XWRULDO6HFWLRQ
7KH0HDQLQJRID6DPSOLQJ'LVWULEXWLRQ
7KH6DPSOLQJ'LVWULEXWLRQRID6DPSOH0HDQ
7KH&HQWUDO/LPLW7KHRUHP
BBBBBBBBBBBBBBBBBBB
‹&RS\ULJKW$3(;2QOLQH/HDUQLQJ,QF$OOULJKWVUHVHUYHG$GYDQFHG3ODFHPHQWDQG$3DUHUHJLVWHUHG
WUDGHPDUNVRIWKH&ROOHJH%RDUG7KLVPDWHULDOLVLQWHQGHGIRUWKHH[FOXVLYHXVHRI$3(;HQUROOHGVWXGHQWV$Q\
XQDXWKRUL]HGFRS\LQJUHXVHRUUHGLVWULEXWLRQLVSURKLELWHG
42 of 70
S2
1.4.2 S2
AP Statistics
Page 1 of 4
Study Guide: Sampling Distributions and the Central Limit Theorem
Calculator Instructions for the TI-89
These instructions assume that you've read and followed the directions in the "Getting
Started" chapter of your calculator manual. The steps covered in this guide are also covered
in the "Statistics" chapter of your calculator manual.


Text in brackets, like this [HOME], gives the name of a button on the calculator.
Two sets of brackets like [2nd][VAR-LINK] get you to a color-coded function above
each key.

The diamond button
and brackets get you to another color-coded function shown
above each key.
The white [ALPHA] button followed by another bracket gets you a letter.
Text in bold, like 3:Regressions-> is a command within a menu on the calculator
screen.


Section 1: Find the mean of a sampling distribution.
Step 1. Clear a list, generate a sample, and store it in list1.
1. Enter List Editor.
2. Clear List 1.
3. Press[2nd][5] / 3:List / 1:seq( [ENTER] [2nd][5] 7:Probability 5:randNorm(20,3),
x,1,10) [ENTER] [ENTER].
4. This generates 10 random numbers from a normal distribution with mean 20 and
standard deviation 3. These numbers are automatically stored in list1.
Step 2. Find the mean of the sample and store the mean value into a variable.
1. Press [HOME][CATALOG] [ALPHA] M, scroll to mean( , and press [ENTER]. Type
the name list1 followed by [)]. [ENTER]. You will see the mean of list1 on the
screen.
2. If the command mean(list1) is not highlighted, press[ENTER] to highlight it.
Press your right arrow, and your cursor will be placed to the right of your
command.
3. Press [STO->]. This places an arrow after your formula, where you can enter a
name for the value of the mean of list1. Type a name there (do not use the
calculator domain variables like s, y, z, etc., but enter something like a by
pressing [ALPHA] a [ENTER]). The mean of list1 is now stored in the variable a.
Step 3. Generate nine more random samples. Find and store their means in
additional variables.
1. Repeat Steps 1 and 2 to generate additional lists of random values, calculate the
new mean of each list, and store each mean in a new variable. You may continue
to use list1 to store the list of random values, but be sure to store each new
mean before creating the next list of values.
2. For each random sample, store the mean in a new variable. Using the letters a,
b, c, d, e, . . . should help you remember where you put them and make it easy
to retrieve them.
__________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
TI-89 screens are used with the permission of the publisher. Copyright © 2011, Texas Instruments, Incorporated.
43 of 70
AP Statistics
Page 2 of 4
Study Guide: Sampling Distributions and the Central Limit Theorem
3. Continue creating random samples and storing their means in variables until you
have 10 random sample means. The table below shows each random sample and
the variable letter in which its mean could be stored.
Random Sample
Variable
1
a
2
b
3
c
4
d
5
e
6
f
7
g
8
h
9
i
10
j
Step 4: Find the mean of the sampling distribution.
Now that you’ve stored the means of your 10 samples, you need to find the
mean of the sampling distribution.
1. Enter List Editor.
2. Clear list2.
On line 1 of list2 (list2[1]), enter [ALPHA] a to put the first mean (stored in
variable a) in this cell. Then do the same to put the rest of the letters that have
the stored means (b, c, d, etc.) on each line of list2.
3. Press [HOME] and enter mean(list2). Press [ENTER].
4. This will be the mean of your sampling distribution.
Section 2: For a population with N(50,5), what proportion of the sample means of
size 36 would be expect to be less than 51?
Step 1: Solve for P( x < 51) for N(50, (5/√36)).
1. From List Editor, press [F5]:Distr / 4:Normal Cdf….
2. Enter a Lower Value of  , an Upper Value of 51, a mean of 50, and a standard
deviation of 5/√(36).
3. Press [ENTER] to save your settings, then [ENTER] again to execute the
command.
4. You will see the answer to your question, .88493.
Step 2: Graph P( x < 51) for N(50, .83).
1. Press
[F1] and uncheck any plots or functions you have active.
__________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
TI-89 screens are used with the permission of the publisher. Copyright © 2011, Texas Instruments, Incorporated.
44 of 70
AP Statistics
Page 3 of 4
Study Guide: Sampling Distributions and the Central Limit Theorem
2. Enter List Editor.
3. Press [F5]:Distr / 1:Shade  1: Shade Normal… [ENTER]. Enter a Lower Value of
 .
4. Enter an Upper Value of 51.
5. Enter a mean of 50 and a standard deviation of 5/6.
6. Make sure the Auto-scale setting is on YES.
7. Press [ENTER] to save your settings, then [ENTER] again to execute.
8. You should see the graph shown below.
__________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
TI-89 screens are used with the permission of the publisher. Copyright © 2011, Texas Instruments, Incorporated.
45 of 70
AP Statistics
Page 4 of 4
Study Guide: Sampling Distributions and the Central Limit Theorem
Directions


Use this guide to take notes while you watch the tutorial.
Keep your completed study guide in your notebook for later study.
Key Terms and Concepts



sampling distribution of a sample mean
• sampling distribution
mean of the sampling distribution of a sample mean
• Central Limit Theorem
standard deviation of the sampling distribution of a sample mean
Tutorial Sections
The Meaning of a Sampling Distribution
The Sampling Distribution of a Sample Mean
The Central Limit Theorem
__________________________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
TI-89 screens are used with the permission of the publisher. Copyright © 2011, Texas Instruments, Incorporated.
46 of 70
6WDWLVWLFV,QGHSHQGHQW6WXG\
6WXG\6KHHW
6DPSOLQJ'LVWULEXWLRQV
3DJHRI
1.4.4
7KH0HDQLQJRID6DPSOLQJ'LVWULEXWLRQ
,PDJLQHWDNLQJKXQGUHGVRIVDPSOHVRIVL]HQIURPDSRSXODWLRQ,I\RXWDNHWKHPHDQRI
HDFKVDPSOH\RX
OOKDYHDGLVWULEXWLRQRIVDPSOHPHDQV,I\RXFRXOGVRPHKRZWDNH
HYHU\SRVVLEOHVDPSOHIURP\RXUSRSXODWLRQ\RX
GJHWDGLVWULEXWLRQRIHYHU\SRVVLEOH
VDPSOHPHDQ7KLVLVFDOOHGWKHVDPSOLQJGLVWULEXWLRQIRUWKHVWDWLVWLF
,I\RXURULJLQDOSRSXODWLRQKDVPHDQµDQGVWDQGDUGGHYLDWLRQσWKHVDPSOLQJ
GLVWULEXWLRQRIPHDQVDOVRFDOOHGWKHGLVWULEXWLRQRI [ ZLOOKDYHPHDQ µ [ PXVXE[
EDUDQG σ [ VLJPDVXE[EDUZKLFKDUHFDOFXODWHGDV
µ[
µ
σ[ =
σ
Q
1RWHWKDWWKHIRUPXODIRUWKHVWDQGDUGGHYLDWLRQDVVXPHV\RXNQRZσWKHSRSXODWLRQ
VWDQGDUGGHYLDWLRQ,QUHDOOLIHWKLVLVXQUHDOLVWLFDQGDV\RXFRQWLQXHWRVWXG\VWDWLVWLFV
\RX
OOOHDUQZKDWWRGRZKHQ\RXGRQ
WNQRZσ
$OVRQRWHWKDWLQDVDPSOLQJGLVWULEXWLRQDOOVDPSOHVDUHWKHVDPHVL]HWKHVDPSOLQJ
GLVWULEXWLRQRI[EDULVWKHGLVWULEXWLRQRIDOO[EDUVZLWKWKHVDPHVL]HGQ
7KH6KDSHRIWKH6DPSOLQJ'LVWULEXWLRQ
7KHVKDSHRIWKHVDPSOLQJGLVWULEXWLRQGHSHQGVERWKRQWKHVDPSOHVL]HDQGRQWKH
VKDSHRIWKHSDUHQWSRSXODWLRQ,IWKHRULJLQDOSRSXODWLRQLVQRUPDOWKHVDPSOLQJ
GLVWULEXWLRQZLOODOVREHQRUPDO,IWKHRULJLQDOSRSXODWLRQLVQRQQRUPDOWKHQWKHVKDSH
RIWKHVDPSOLQJGLVWULEXWLRQGHSHQGVRQWKHVDPSOHVL]H,QWKHVHFDVHVWKHVDPSOLQJ
GLVWULEXWLRQZLOOKDYHDVLPLODUVKDSHWRWKHSDUHQWSRSXODWLRQIRUVPDOOQDQGEHFRPH
DSSUR[LPDWHO\QRUPDOIRUODUJHQ,QPRVWFDVHVODUJHLVGHILQHGDVQ!2XWOLHUVRU
H[WUHPHVNHZQHVVZLOOUHTXLUHODUJHUYDOXHVRIQ
7KH&HQWUDO/LPLW7KHRUHP
7KHEHKDYLRURIVDPSOLQJGLVWULEXWLRQVLVVXPPDUL]HGE\WKH&HQWUDO/LPLW7KHRUHP
ZKLFKVWDWHV
µ [ µUHJDUGOHVVRIWKHVDPSOHVL]HRUWKHVKDSHRIWKHRULJLQDOGLVWULEXWLRQ
σ [ =
σ
Q
UHJDUGOHVVRIWKHVDPSOHVL]HQRUWKHVKDSHRIWKHRULJLQDOGLVWULEXWLRQ
,IWKHRULJLQDOGLVWULEXWLRQLVQRUPDOWKHVKDSHRIWKHVDPSOLQJGLVWULEXWLRQRIWKH
VDPSOHPHDQZLOOEHQRUPDOIRUDQ\Q
47 of 70
S2
6WDWLVWLFV,QGHSHQGHQW6WXG\
6WXG\6KHHW
6DPSOLQJ'LVWULEXWLRQV
3DJHRI
)RUVPDOOVDPSOHVL]HVLIWKHRULJLQDOGLVWULEXWLRQLVQRQQRUPDOWKHVKDSHRIWKH
VDPSOLQJGLVWULEXWLRQRIWKHVDPSOHPHDQZLOOEHVLPLODUWRWKHVKDSHRIWKHRULJLQDO
SRSXODWLRQ7KDWLVLIWKHRULJLQDOSRSXODWLRQZDVVNHZHGWRWKHULJKWWKHVDPSOLQJ
GLVWULEXWLRQRIWKHVDPSOHPHDQIRUVPDOOQZLOODOVREHVNHZHGWRWKHULJKWDOWKRXJK
OHVVVRWKDQWKHRULJLQDOSRSXODWLRQ
,IWKHVDPSOHVL]HLVODUJHWKHVDPSOLQJGLVWULEXWLRQRIWKHVDPSOHPHDQZLOOEH
DSSUR[LPDWHO\QRUPDOPDQ\WH[WVGHILQHODUJHDVQ!DOWKRXJKWKHDFWXDOYDOXH
ZLOOGHSHQGRQWKHVKDSHRIWKHRULJLQDOSRSXODWLRQ
6DPSOH3UREDELOLWLHV
6LQFHDVDPSOLQJGLVWULEXWLRQIRUODUJHQLVQRUPDO\RXFDQXVHQRUPDOSUREDELOLWLHVWR
ILQGWKHSUREDELOLWLHVIRUVDPSOHYDOXHV)RUH[DPSOH
$GLVWULEXWLRQRI [ KDVPHDQDQGVWDQGDUGGHYLDWLRQ:KDW
VWKHSUREDELOLW\
RIGUDZLQJDVDPSOHZLWKDPHDQRIOHVVWKDQ"
7RDQVZHUWKHTXHVWLRQILQGWKH]VFRUHIRUDVDPSOHPHDQRIDQGILQGWKH
SUREDELOLW\IRUYDOXHVEHORZWKDW]VFRUH
]
− ±
3]± 48 of 70
6WDWLVWLFV,QGHSHQGHQW6WXG\
6WXG\6KHHW
6DPSOLQJ'LVWULEXWLRQV
3DJHRI
6WXG\4XHVWLRQV
,Q\RXURZQZRUGVGHILQHWKHWHUPVDPSOLQJGLVWULEXWLRQ
,Q\RXURZQZRUGVGHVFULEHWKHILYHPDLQSRLQWVRIWKH&HQWUDO/LPLW
7KHRUHPDVGHILQHGDERYH
-HIILVDQHJJSODQWLQVSHFWRUIRU&HQWUDO0LFKLJDQ&RXQW\%DVHGRQKLV\HDUV
H[SHULHQFHRQWKHMRE-HIIDVVXPHVWKHPHDQZHLJKWRIWKHHJJSODQWSRSXODWLRQLV
JUDPVDQGWKHVWDQGDUGGHYLDWLRQLVJUDPV
$ $VVXPLQJWKDW-HII
VDVVXPSWLRQLVFRUUHFWLW
VDGHEDWDEOHDVVXPSWLRQEXW
JRZLWKLWDQ\ZD\IRUQRZZKDWDUHWKHPHDQDQGWKHVWDQGDUGGHYLDWLRQ
RIWKHVDPSOLQJGLVWULEXWLRQIRUVDPSOHVRIVL]HQ "
% $VVXPLQJ-HII
VDVVXPSWLRQLVFRUUHFWZKDWDUHWKHPHDQDQGWKHVWDQGDUG
GHYLDWLRQRIWKHVDPSOLQJGLVWULEXWLRQIRUVDPSOHVRIVL]HQ "
:HGUDZVDPSOHVRIVL]HIURPDGLVWULEXWLRQWKDW
VVNHZHGVWURQJO\WRWKHULJKW
7KHPHDQRIWKHSRSXODWLRQGLVWULEXWLRQLVDQGWKHVWDQGDUGGHYLDWLRQLV
$ :KDWDUHWKHPHDQDQGWKHVWDQGDUGGHYLDWLRQRIWKHVDPSOLQJGLVWULEXWLRQ"
% :KDW
VWKHVKDSHRIWKHVDPSOLQJGLVWULEXWLRQ"
:HGUDZVDPSOHVRIVL]HIURPDQRUPDOGLVWULEXWLRQ7KHPHDQRIWKHGLVWULEXWLRQLV
DQGWKHVWDQGDUGGHYLDWLRQLV
$ :KDWDUHWKHPHDQDQGVWDQGDUGGHYLDWLRQRIWKHVDPSOLQJGLVWULEXWLRQ"
% :KDW
VWKHVKDSHRIWKHVDPSOLQJGLVWULEXWLRQ"
& )RUWKLVSRSXODWLRQZKDW
VWKHSUREDELOLW\RIJHWWLQJDVDPSOHRIVL]HZLWK
DPHDQOHVVWKDQ"
:HGUDZVDPSOHVRIVL]HIURPDGLVWULEXWLRQWKDW
VELPRGDO7KHPHDQRIWKH
GLVWULEXWLRQLVDQGWKHVWDQGDUGGHYLDWLRQLV
$ :KDWDUHWKHPHDQDQGWKHVWDQGDUGGHYLDWLRQRIWKHVDPSOLQJGLVWULEXWLRQ"
% :KDW
VWKHVKDSHRIWKHVDPSOLQJGLVWULEXWLRQ"
& )RUWKLVSRSXODWLRQZKDW
VWKHSUREDELOLW\RIJHWWLQJDVDPSOHRIVL]H
ZLWKDPHDQOHVVWKDQ"
49 of 70
6WDWLVWLFV,QGHSHQGHQW6WXG\
6WXG\6KHHW
6DPSOLQJ'LVWULEXWLRQV
3DJHRI
$VWKHVDPSOHVL]HJHWVODUJHUZKDWKDSSHQVWRWKHVWDQGDUGGHYLDWLRQRI
WKHVDPSOLQJGLVWULEXWLRQ":K\"
BBBBBBBBBBBBBBBBBB
‹&RS\ULJKW$3(;2QOLQH/HDUQLQJ,QF$OOULJKWVUHVHUYHG$GYDQFHG3ODFHPHQWDQG$3DUHUHJLVWHUHG
WUDGHPDUNVRIWKH&ROOHJH%RDUG7KLVPDWHULDOLVLQWHQGHGIRUWKHH[FOXVLYHXVHRI$3(;HQUROOHGVWXGHQWV
$Q\XQDXWKRUL]HGFRS\LQJUHXVHRUUHGLVWULEXWLRQLVSURKLELWHG
50 of 70
Page 1 of 3
1.4.5
S2
1. Consider a parent population with mean 75 and a standard deviation 7. The population
doesn't appear to have extreme skewness or outliers.
A. What are the mean and standard deviation of the distribution of sample means
for n = 40? (2 points)
B. What's the shape of the distribution? Explain your answer in terms of the
Central Limit Theorem. (1 point)
C. What proportion of the sample means of size 40 would you expect to be 77 or
less? If you use your calculator, show what you entered. (1 point)
D. Draw a sketch of the probability you found in part C. (1 point)
2. Suppose over several years of offering AP Statistics, a high school finds that final exam
scores are normally distributed with a mean of 78 and a standard deviation of 6.
A. What are the mean, standard deviation, and shape of the distribution of x-bar
for n = 50? (2 points)
B. What's the probability a sample of scores will have a mean greater than 80?
(1 point)
C. Sketch the distribution curve for part B, showing the area that represents the
probability you found. (1 point)
3. Suppose college faculty members with the rank of professor at two-year institutions earn
an average of $52,500 per year with a standard deviation of $4,000. In an attempt to
verify this salary level, a random sample of 60 professors was selected from a personnel
database for all two-year institutions in the United States.
A. What are the mean and standard deviation of the sampling distribution for
n = 60? (2 points)
B. What's the shape of the sampling distribution for n = 60? (1 point)
C. Calculate the probability the sample mean x-bar is greater than $55,000.
(1 point)
D. If you drew a random sample with a mean of $55,000, would you consider this
sample unusual? What conclusions might you draw? (1 point)
51 of 70
Page 2 of 3
4. A manufacturer of paper used for packaging requires a minimum strength of 20 pounds
per square inch. To check the quality of the paper, a random sample of ten pieces of
paper is selected each hour from the previous hour's production and a strength
measurement is recorded for each. The distribution of strengths is known to be normal
and the standard deviation, computed from many samples, is known to equal 2 pounds
per square inch. The mean is known to be 21 pounds per square inch.
A. What are the mean and standard deviation of the sampling distribution for
n = 10? (2 points)
B. What's the shape of the sampling distribution for n = 10? (1 point)
C. Draw a random sample of size 10 from this distribution and compute the mean
of the sample, using the randNorm function on your calculator. On a TI-83/84
store the values in L1: MATH [PRB] randNorm(21,2,10) STO [L1]. On a TI-89
Store the values in list1: seq(randNorm(21,2),x,1,10,1) -> list1.
On the TI-83/84 you can then get the mean of the values by doing 1–Var stats
on the values in L1. Then Press STAT, arrow to calc, select 1-Var stats, Press
ENTER, then 2nd [L1] ENTER.
On the TI-89 you can then get the mean of the values by doing 1-Var Stats on
the values in list1. Enter List Editor, then press F4: 1:1-Var Stats.... Enter list1
as your list and press ENTER twice to do the calculation.
Write down all your sample values. What are the mean and sample standard
deviation of this sample? Compare them with the mean and standard deviation
of the parent population, and explain why they're different or the same.
(2 points)
D. Using the mean and standard deviation of the sampling distribution (from part
A), calculate the probability of getting a sample mean less than what you got.
If you use a calculator, show what you entered. (2 points)
5. Suppose a random sample of n = 25 observations is selected from a normal population,
with mean 106 and standard deviation 12.
A. Find the probability that x-bar exceeds 110. Show the mean and standard
deviation you used. (2 points)
B. Find the probability that the sample mean deviates from the population mean
= 106 by no more than 4. (1 point)
C. Sketch the probability distribution from part B, showing the area that
represents the probability you found. (1 point)
52 of 70
Page 3 of 3
Acknowledgements
Question 3:
This question is based on question 7.24 (c & d) from page 261 of Introduction to Probability and Statistics, Tenth
Edition, by W. Mendenhall, R. Beaver, and B. Beaver. Copyright © 1999 by Brooks Cole, division of Thompson
Learning Incorporated. Further reproduction is prohibited without permission of the publisher.
Question 4:
This question is based on question 7.26 from page 262 of Introduction to Probability and Statistics, Tenth Edition,
by W. Mendenhall, R. Beaver, and B. Beaver. Copyright © 1999 by Brooks Cole, division of Thompson Learning
Incorporated. Further reproduction is prohibited without permission of the publisher.
Question 4:
This question is based on question 7.20 from page 261 of Introduction to Probability and Statistics, Tenth Edition,
by W. Mendenhall, R. Beaver, and B. Beaver. Copyright © 1999 by Brooks Cole, division of Thompson Learning
Incorporated. Further reproduction is prohibited without permission of the publisher.
53 of 70
6WDWLVWLFV6WXG\*XLGH
6DPSOH3URSRUWLRQV
3DJHRI
1.4.6
'LUHFWLRQV
• 8VHWKLVJXLGHWRWDNHQRWHVZKLOH\RXZDWFKWKH7XWRULDO
• .HHS\RXUFRPSOHWHG6WXG\*XLGHLQ\RXUQRWHERRNIRUODWHUVWXG\
.H\7HUPVDQG&RQFHSWV
•
pˆ GLVWULEXWLRQRI
• PHDQRIWKHVDPSOLQJGLVWULEXWLRQRIDVDPSOHSURSRUWLRQ
• VDPSOHSURSRUWLRQV
• VWDQGDUGGHYLDWLRQRIWKHVDPSOLQJGLVWULEXWLRQRIDVDPSOHSURSRUWLRQ
7XWRULDO6HFWLRQV
7KH0HDQLQJRID6DPSOLQJ'LVWULEXWLRQRID6DPSOH3URSRUWLRQ
([HUFLVHV8VLQJWKH6DPSOH3URSRUWLRQ
BBBBBBBBBBBBBBBBBBB
‹&RS\ULJKW$3(;2QOLQH/HDUQLQJ,QF$OOULJKWVUHVHUYHG$GYDQFHG3ODFHPHQWDQG$3DUHUHJLVWHUHG
WUDGHPDUNVRIWKH&ROOHJH%RDUG7KLVPDWHULDOLVLQWHQGHGIRUWKHH[FOXVLYHXVHRI$3(;HQUROOHGVWXGHQWV$Q\
XQDXWKRUL]HGFRS\LQJUHXVHRUUHGLVWULEXWLRQLVSURKLELWHG
54 of 70
S2
Statistics Assignment
Sampling Distribution of p-hat
The Sampling Distribution of p-hat
Page 1 of 5
1.4.8
S2
ˆ is calculated from a count of successes to failures, the distribution of p
ˆ is
Because p
binomial. You may recall that a binomial distribution of counts has a mean np and a
standard deviation
np(1 − p) . You may also recall that to turn a count of successes (x) into
ˆ ), you divide the count x by the sample size n. Similarly, to turn np and
a proportion ( p
ˆ , divide each term by
np(1 − p) into the mean and standard deviation of the proportion p
n:
µ pˆ =
np
= p , σ pˆ =
n
np(1 − p)
n
2
=
p(1 − p)
.
n
p(1 − p)
. Note
n
that the term (1 – p), which is the proportion or probability of failure, is written as q in
pq
many textbooks. So you may see the standard deviation written as
.
n
ˆ is p, and the standard deviation is
So, the mean of the distribution of p
Using the Normal Approximation
When the population is large relative to the sample, and when np ≥ 10 and
n(1 – p) ≥ 10, a binomial distribution can be approximated by a normal distribution. This
ˆ . When conditions are met, it has a distribution
rule applies to the distribution of p
N(p,
p(1 − p)
).
n
NOTE: Some textbooks say you can use the normal approximation if np >5 and nq (or n(1p)) is > 5. These are also valid criteria, but for this Assignment use the criteria np ≥ 10 and
n(1-p) ≥ 10.
The Continuity Correction
ˆ uses a continuous distribution to model a
The normal approximation to the distribution of p
discrete one. Use the continuity correction to offset the error inherent in these situations. To
do this, simply proceed as though the interval you're finding also occupies the space .5
below the lower bound and .5 above the upper bound of the interval.
ˆ , you need to convert your sample proportions to
When using the continuity correction for p
counts, and then use the normal approximation on these counts. For example, let's say the
proportion of people who will vote for candidate A is .7. You draw a sample of 100 people,
and your expected count of successes in a sample (which is the mean of your distribution of
counts) is .7(100) = 70. You want the probability that you'd get a sample with between 66
and 68 successes. To use the continuity correction in this scenario, you'd find the normal
probability for the interval from 65.5 to 68.5.
_____________
© Copyright 2000 Apex Learning Inc. All rights reserved. This material is intended for the exclusive use of
registered users only. No portion of these materials may
be70
reproduced or redistributed in any form without the
55 of
express written permission of Apex Learning Inc.
Statistics Assignment
Sampling Distribution of p-hat
Page 2 of 5
NOTE: As you advance into doing statistical inference, you may not see the continuity
correction very often. In many cases it doesn't make enough of a difference to change your
result.
Using Your Calculator for the Normal Approximation to p-hat
Use normalcdf(lowerbound,upperbound,mean,standard deviation).
ˆ > p) = normalcdf(p,E99,µ,σ)
• P( p
ˆ < p) = normalcdf(–E99,p,µ,σ) Note that you can use any large number in place of
• P( p
E99.
• P(p1 < pˆ < p2) = normalcdf(p1,p2,µ,σ)
NOTE: If you don't enter the mean and standard deviation, the calculator assumes a
standard normal distribution with mean 0 and standard distribution 1. In that case, you can
enter z-scores. On your calculator, that'd be normalcdf(lower z,upper z).
_____________
© Copyright 2000 Apex Learning Inc. All rights reserved. This material is intended for the exclusive use of
registered users only. No portion of these materials may
be70
reproduced or redistributed in any form without the
56 of
express written permission of Apex Learning Inc.
Statistics Assignment
Sampling Distribution of p-hat
Page 3 of 5
Assignment Questions
Answer each question completely and send your work back to your instructor. Show all
work. Justify your answers clearly and completely with formulas and numerical solutions.
Showing your work will allow your instructor to give you partial credit if you make a mistake
but show that you still understand the problem.
1. In a survey, 600 mothers and fathers were asked about the importance of sports for
boys and girls. Of the parents interviewed, 70% said the genders are equal and should
have equal opportunities to participate in sports.
A. What are the mean, standard deviation, and shape of the distribution of the
ˆ of parents who say the genders are equal and should have
sample proportion p
equal opportunities? Be sure to justify your answer for the shape of the
distribution. Use n = 600. (1 point)
B. Using the normal approximation without the continuity correction, sketch the
ˆ . Shade equal areas on
probability distribution curve for the distribution of p
both sides of the mean to show an area that represents a probability of .95,
and label the upper and lower bounds of the shaded area as values of p-hat
(not z-scores). Show your calculations for the upper and lower bounds. (2
points)
C. Considering the sketch in part B, the shaded area shows a .95 probability of
what happening? In other words, what does the probability of .95 represent? (2
points)
D. Using the normal approximation, what's the probability a randomly drawn
sample of parents of size 600 will have a sample proportion between 67% and
73%? Draw a sketch of the probability curve, shade the area representing the
probability you're finding, and label the z-scores that represent the upper and
lower bounds of the probability you're finding. Don't use the continuity
correction. (2 points)
E. Now, use the exact binomial calculation to find the probability of getting
between, but not including, 67% and 73% of the respondents in a sample of
600 who say the genders are equal and should have equal opportunities. To use
the exact binomial, you'll need to convert the proportions to counts by
multiplying each proportion by 600. (1 point)
F. Now try it again, but this time find the probability of getting at least 67% but
no more than 73%. Use the exact binomial calculation. (1 point)
_____________
© Copyright 2000 Apex Learning Inc. All rights reserved. This material is intended for the exclusive use of
registered users only. No portion of these materials may
be70
reproduced or redistributed in any form without the
57 of
express written permission of Apex Learning Inc.
Statistics Assignment
Sampling Distribution of p-hat
Page 4 of 5
2. Random samples of size n = 500 were selected from a binomial population with
p = .1.
A. Is it appropriate in this case to use the normal distribution to approximate the
ˆ ? What are the mean, standard deviation, and shape of the
distribution of p
distribution? (1 point)
B. Using the normal approximation without the continuity correction, find the
ˆ < .12. (1 point)
probability that p
ˆ < .12). This time use the continuity
C. Using the normal approximation, find P( p
correction. (3 points)
Hint: To use the continuity correction you'll need to use the binomial distribution of
counts:
• Convert .12 to a count by multiplying it by 500.
•
Use np and
np(1 − p) to find the mean and standard deviation for the normal
•
•
approximation to the binomial for counts.
You're looking for the probability of getting a count below a certain number.
To use the continuity correction, add .5 to your true upper bound. (You don't need to
increase your lower bound because the lower bound is essentially infinity.)
ˆ < .12). Compare your answer
D. Using the exact binomial calculation, find P( p
with the answers you got for parts B and C. Why are they different? (1 point)
3. One of the ways Americans relieve stress is to reward themselves with sweets. Suppose
a study claims 52% of Americans admit to overeating sweets when stressed. Suppose
also that the 52% figure is correct for the population and that random samples of size n
= 100 Americans are selected.
ˆ have an approximately normal distribution? If so,
A. Does the distribution of p
what are its mean and standard deviation? (1 point)
ˆ without the continuity correction, what's
B. Using the normal approximation of p
ˆ greater than .6? (1 point)
the probability of getting a sample (n = 100) with p
C. Using the normal approximation of the binomial distribution with the continuity
ˆ greater
correction, what's the probability of getting a sample (n = 100) with p
than .6? (3 points)
D. Using the exact binomial calculation, what's the probability of getting sample
ˆ greater than .6? (1 point)
(n = 100) with p
_____________
© Copyright 2000 Apex Learning Inc. All rights reserved. This material is intended for the exclusive use of
registered users only. No portion of these materials may
be70
reproduced or redistributed in any form without the
58 of
express written permission of Apex Learning Inc.
Statistics Assignment
Sampling Distribution of p-hat
Page 5 of 5
4. In 1996 there was a battle in the courts and in the marketplace between Intel and
Digital Equipment Corp. about the technology behind Intel's Pentium microprocessing
chip. Digital accused Intel of willful infringement on Digital's patents. Although Digital's
Alpha microprocessor chip was the fastest on the market at the time, its speed fell
victim to Intel's marketing clout. That same year, Intel shipped 76% of the
microprocessor market. Suppose a random sample of n = 1,000 personal computer (PC)
ˆ be the
sales is monitored and the type of microprocessor installed is recorded. Let p
proportion of personal computers in the sample with a Pentium microprocessor.
ˆ ? (1
A. What are the mean, standard deviation, and shape of the distribution of p
point)
B. Using the normal approximation without the continuity correction, what's the
probability you'd draw a random sample of 1,000 PCs with a proportion of
Pentium chips exceeding 80%? (1 point)
C. Looking at the answer you got for part B, if you got a simple random sample
with
ˆ > .8, would you conclude the population proportion may be higher than .76?
p
Why? (2 points)
Acknowledgements
Question 1:
This question is based on question 7.35 from page 267 of Introduction to Probability and Statistics, Tenth Edition,
by W. Mendenhall, R. Beaver, and B. Beaver. Copyright © 1999 by Brooks Cole, division of Thompson Learning
Incorporated. Further reproduction is prohibited without permission of the publisher.
Question 3:
This question is based on question 7.36 from page 267 of Introduction to Probability and Statistics, Tenth Edition,
by W. Mendenhall, R. Beaver, and B. Beaver. Copyright © 1999 by Brooks Cole, division of Thompson Learning
Incorporated. Further reproduction is prohibited without permission of the publisher.
_____________
© Copyright 2000 Apex Learning Inc. All rights reserved. This material is intended for the exclusive use of
registered users only. No portion of these materials may
be70
reproduced or redistributed in any form without the
59 of
express written permission of Apex Learning Inc.
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 1 of 11
1.5.2
S2
Key Terms and Concepts
Before taking the Quiz, you need to be able to explain the meanings (and recognize symbols
in cases where there is an associated symbol) of each of these terms or concepts. You
should also know when and how to use them in statistics problems.
These terms and concepts are defined in Key Terms.
almost binomial
average waiting time
binomcdf
binomial experiment
binomial probabilities for a range of outcomes
binomial probabilities for an exact number of outcomes
binomial setting
binompdf
Central Limit Theorem
continuity correction
geometcdf
geometpdf
geometric distribution
geometric setting
inferential statistics
mean of the sampling distribution of a sample mean
mean of the sampling distribution of a sample proportion
normal approximation to the binomial
ˆ (distribution of)
p
sample proportions
sampling distribution
sampling distribution of a sample mean
standard deviation of the sampling distribution of a sample mean
standard deviation of the sampling distribution of a sample proportion
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
60 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 2 of 11
Objectives, Example Problems, and Study Tips
Introduction to Inferential Statistics
Objective
Define inferential statistics, and give at least two examples.
Example
1. Define inferential statistics. Give two examples.
Answer
1. Inferential statistics is the process of estimating population parameters from sample
statistics. Examples include predicting population proportions, such as the percentage of
voters who will vote for a particular candidate, and estimating the mean of a population
parameter, such as the mean age of juniors at a high school.
Objective
Give one reason why probability distributions are important in inferential statistics.
Example
1. Give one reason why probability distributions are important in inferential statistics.
Tip
The answer to this question will become much clearer as you learn more about inferential
statistics. For now, it's helpful to have a general idea of why you're learning so much about
normal, binomial, and geometric distributions.
Answer
1. Probability distributions are used to calculate probabilities so that a researcher can
determine the likelihood of an outcome. If a particular outcome is shown to be highly
unlikely, then it's statistically significant and something other than chance is probably
influencing the results.
Objective:
Outline the process that an inferential study often takes.
Example
1. Outline the process that an inferential investigation often takes.
Tip
Don't memorize these steps word for word. They're only general guidelines, and you won't
have to list them in the Unit Quiz. Once you learn how to do inferential statistics in different
settings, you'll find that different cases require different methods.
Answer
1. Inference takes many forms, but it often includes the following phases:
A.
B.
C.
D.
Identify what you want to study.
Ensure that the study and the sample data are valid.
Use a probability distribution to calculate the likelihood of getting the same results.
If you're comparing at least two measures, do significance testing. This involves
writing hypotheses and testing them.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
61 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 3 of 11
Binomial Distributions
Objective
List the four conditions that define a binomial setting, and use those conditions to determine
whether a situation is binomial.
Examples
1. What are the conditions that define a binomial setting?
2. Suppose we know that 30% of all college students nationwide graduated from private
high schools. A researcher wants to find the number of freshmen at a small college in
Kansas who graduated from private high schools. The college has 300 freshmen. The
researcher randomly selects a student from the freshman class and determines whether
the student graduated from a private high school. The researcher repeats this process
until 50 different students have been selected. Is this a binomial setting?
3. Consider the situation in the previous question. If the 50 students were sampled from
the entire population of 1,250 students at the college, would this be a binomial setting?
Tip
In a sample-drawing situation, each trial may not be strictly independent. That's because
the ratio of the population size to sample size will change each time a sample is drawn.
However, we can still consider the setting binomial in certain situations. For instance, the
setting is said to be binomial when the population is large compared to the sample.
Although we've said in this course that a population 20 times larger than the sample is
sufficient, some textbooks say that 10 times is acceptable.
Answers
1. A setting is binomial if:
A. There are only two possible outcomes for each trial (success or failure).
B. Each trial is independent.
C. The probability of success for each trial is the same.
D. There are a set number of trials.
For example, you roll a die 100 times and count the number of times you get a 6. This is
a success/failure situation, since the die either lands on 6 or it doesn't. Each trial is
independent, and the probability of success on each trial is 1/6. The set number of trials
is 100.
2. This situation isn't binomial, since the probability for each trial isn't independent. Within
the population of 300 freshmen, the proportion of unsampled students from public and
private high schools will change slightly each time a student is drawn. This means that
the probability of success for each trial is different. We'd still consider a setting to be
binomial, even if each trial isn't independent, if the population is very large compared to
the sample size. In this case, the population is only six times as large as the sample
size, so we'd probably not consider this to be a binomial setting.
3. Since the population is more than 20 times as large as the sample, we'd consider this a
binomial setting. In this course we said that the population must be at least 20 times
larger than the sample. Some textbooks say it only needs to be 10 times greater.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
62 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 4 of 11
Objective
Solve binomial problems involving at least, at most, and between.
Examples
1. You have an octahedral die with faces A, B, C, D, E, F, G, and H. It's a fair diethat is,
each face is equally likely to appear on top. You roll the die 75 times. What's the
probability of getting exactly 9 A's?
2. For the die in question 1, what's the probability of getting at least 7 A's but less than 12
A's?
3. Suppose we know that 30% of all college students graduated from private high schools.
A sample of 50 students is taken from the 1,200 students at a small college, and it's
determined whether each student in the sample went to a private high school. Assuming
that the 30% proportion applies to your population of interest, what's the probability
that at least 18 of the students in the sample went to a private high school.
Tips
• At least 7 includes 7; less than 12 doesn't include 12.
• Remember, the population size should be at least 20 times the sample size to use the
binomial distribution.
• Although you can solve most (but not all) binomial probability problems using your
calculator, you'll have a much better grasp of the concept if you remember the formula:
æn ö
P( X = x) = çç ÷÷ p x (1 − p) n − x
èxø
where
n = number of trials
p = the probability of success for each trial
x = number of successes
Answers
( )( )
9
æ 75 ö
7 66 = .139 . You can also use
1. Solve using n = 75, p = 1/8, x = 9. Thus, çç ÷÷ 1
8
è9 ø 8
binompdf(75,1/8,9).
2. This is difficult to do using the formula, but easy on a graphing calculator. If B(n, p, x)
represents the probability of getting exactly x successes out of n occurrences of an event
that occurs with probability p, then P(7 ≤ X < 12) = B(75, 1/8, 7) + B(75, 1/8, 8) + . . . +
B(75, 1/8, 11) = binomcdf(75, 1/8, 11) – binomcdf(75, 1/8, 6), = .62
3. It's okay to use a binomial distribution here, since the population is at least 20 times
greater than the sample size. Solve using P(X ≥ 18) = 1 – binomcdf(50,.3,17) = .21.
Objective:
Use the normal approximation in binomial setting problems.
Examples
1. Out of 100 specimens of a certain type of rose bush, 85 will produce flowers annually.
Use the normal approximation to calculate the probability that at least 90% of a plot of
250 plants will flower this year.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
63 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 5 of 11
2. Can we use a normal approximation to the binomial in the following situation: Statistics
indicate roughly 20% of all adults jog more than twice a week. A sample of 18 adults is
asked if they jog more than twice a week. What's the probability only three of these 18
adults jog more than twice a week?
Tips
• When using the normal approximation, don't forget about the continuity correction.
• There are some rules of thumb for deciding if you can use a normal approximation.
Some textbooks say that the expected numbers of successes and failures should be at
least 5, or np ≥ 5 and n(1 – p) ≥ 5. But the Tutorials in this course use 10 instead of 5,
or np ≥ 10 and n(1 – p) ≥ 10. Either rule will work, though in this course it's preferable
to use 10.
Answers
1. The answer is: µ x = 250(.85) = 212.5, σ x = 250(.85)(.15) = 5.65 . Thus .9(250) = 225,
224.5 − 212.5
= 2.12 .
5.65
So P(z > 2.12) = .017. Note that the exact answer is given by 1 – binomcdf(250, .85,
224) = .013.
so now we find P(X ≥ 225), which is z225 =
2. You can't use a normal approximation here. Some texts will tell you that, to use a
normal approximation, both np and n(1 – p) must be at least 10 (though some books
say 5). Here np = 18(.2) = 3.6 < 10.
Note: Your selections from the Mendenhall textbook may tell you that np and n(1 – p)
must be at least 5, but in this course (and on the AP Exam) it's safer to use the rule that
they should be at least 10.
Geometric Distributions
Objective
List the four conditions that define a geometric setting. Use these conditions to determine
whether a situation is geometric.
Examples
1. Describe a geometric setting and explain how, if at all, it differs from a binomial setting.
2. An event occurs with probability p. On average, how long will we have to wait for the
first success of the event?
3. A sack contains eight red dice and nine green dice. We remove one die at a time (and do
not replace it) until we get a red die. Is this a geometric situation?
Tip
The expected value of X in a geometric setting is the average waiting time and is always
1/p.
Answers
1. A random variable is binomial if the only two possible outcomes are success or failure, if
it occurs a set number of times with each occurrence being independent of the others,
and if each occurrence has the same probability of success. In a geometric setting, there
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
64 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 6 of 11
aren't a set number of trials. Rather, our interest is in the number of trials until the first
success (or in some cases, a certain number of successes) occurs. However, like the
binomial, each occurrence must be independent of the others and occur with the same
probability each time.
2. The answer is: 1/p.
3. No, this isn't geometric. The probability of getting a red die when the first marble is
drawn is 8/17. But the probability of getting a red die on the second draw is either 7/16
or 8/16 depending on the color of the die drawn first.
Objective
Solve problems involving the geometric distribution.
Examples
1. There are 10 different prizes in boxes of Googily-Snaps. Prize #4 is the most valuable to
collectors. What's the probability that you'll get prize #4 without having to buy more
than four boxes of Googily-Snaps? (Assume that all prizes are equally likely in each
box.)
2. Given the situation in question 1, suppose you're unlucky and don't get your prize in one
the first four boxes. On average, how long will you have to wait to get prize #4?
Tips
While you can solve most geometric probability problems using your calculator, you'll have
a much better grasp of the concept if you remember the formula:
P( X = n) = (1 − p )
⋅p
where
n = the number of the trial that yields the first success
p = the probability of success for each trial
n −1
Some geometric probability problems require you to know and understand the formula.
Don't assume you can do it all on your calculator.
Geometric distributions are skewed right. In other words, the probability of getting your first
success decreases with each trial. The expected value for the number of trials is the average
waiting time, or 1/p, where p is the probability of success for each trial.
Answers
1. The answer is: P(X ≤ 4) = G(1) + G(2) + G(3) + G(4) = geometcdf(.1,4) = .3439.
2. This is asking for the average waiting time, which is 1/p. Thus 1/.1 = 10 boxes.
Sampling Distributions: Means and Proportions
Objective
Define sampling distribution and use the definition to answer questions about sampling
distributions.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
65 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 7 of 11
Examples
1. True or False: A sampling distribution of the mean retirement age in North America can
consist of as few as 100 sample values.
2. Consider drawing samples of size 2 from {A,B,C,D,E} and computing the mean of each
sample. The sampling distribution would consist of how many values?
Tips
If you're not sure what a sampling distribution is, look it up in the Glossary.
Remember, a distribution is composed of all possible values rather than a subset of all
possible values. In most cases, all possible values is a theoretical limit, and so the
distribution is a theoretical construct rather than an actual set of values.
Answers
1. False. A sampling distribution is composed of all possible values of a sample statistic
using samples of the same size. Since the population of retirees in North America is
composed of millions of individuals, 100 sample values is only a simulation of a
distribution, it isn't the actual distribution. While 1,000 values may give a pretty good
idea of what the distribution looks like, it's still just a simulation or an approximation
unless it's composed of all possible values.
2. The sampling distribution would consist of 10 values, since there are 10 ways to select a
æ5 ö
sample of size 2 from a set with 5 elements ( çç ÷÷ = 10 ). Even though there are only 10
è2ø
values in this distribution, it's a sampling distribution, since it's composed of all possible
samples of a given size drawn from the population.
Objective:
Determine the shape, mean, and standard deviation of a sampling distribution.
Examples
1. Samples of size 10 are drawn from a large (N > 10,000), symmetric population with a
mean of 45 and a standard deviation of 9. What are the mean and standard deviation of
the sampling distribution of the mean for samples of size 10, and what's the shape of
the distribution?
2. A population has proportion .35 of some characteristic of interest. What are the mean
ˆ for samples of size 50? What's
and standard deviation of the sampling distribution of p
ˆ
the shape of the sampling distribution of p ?
3. A distribution is strongly skewed to the left. Samples of size n are drawn and the
sampling distribution of the sample mean is constructed. What's the shape of the
sampling distribution of x if n = 10, if n = 25, and if n = 100?
Tips
• The shape of the sampling distribution tends to resemble the shape of the parent
population for small samples.
ˆ , we must have np ≥
• To use the normal approximation to the sampling distribution of p
10 and n(1 – p) ≥ 10.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
66 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 8 of 11
• Remember that the Central Limit Theorem doesn't apply until the sample size (n) is
large. In many cases, a sample size of 30 is large enough to make the sampling
distribution close to normal.
Answers
1. The answer is: µ x = 45, σ x =
9
= 2.85 . The sample size is small, so the shape will
10
resemble that of the parent population, that is, it will be symmetric about its mean.
(.35)(1 − .35)
= .067 . Samples of size 50 are relatively
50
large, so we'd expect the shape of the sampling distribution to be approximately normal.
(Note that the assumptions we must make to be able to use the normal approximation
ˆ are satisfied in this case.)
to p
2. The answer is: µ pˆ = .35, σ pˆ =
3. For n = 10, we'd expect the sampling distribution to resemble the parent population.
That is, it would still be skewed to the left, but would bunch more about the population
mean than the original population. For n = 25, the distribution would be more normal
than the original, but the extent to which it would resemble the original population
depends on how severe the skewness is. We'd usually expect samples of size 25 to
produce sampling distributions that are approximately normal. If n = 100, we'd certainly
expect an approximately normal sampling distribution regardless of the shape of the
original population.
Objective
Summarize the Central Limit Theorem and use elements of the theorem to answer
questions about sampling distributions.
Examples
1. True or False: The Central Limit Theorem tells us that the sampling distribution of a
sample mean will be approximately normal regardless of the shape of the parent
population.
2. We create a sampling distribution of x by taking samples of size 70 from a population
whose mean is known to be 5 and whose standard deviation is 1. What can we say
about the sampling distribution of x ?
Tips
Remember that the Central Limit Theorem doesn't apply until the sample size (n) is large.
In many cases, a sample size of 30 is large enough to make the sampling distribution close
to normal. For symmetric distributions with no outliers, samples smaller than 30 may be
sufficient.
If you can't easily summarize the Central Limit Theorem in your own words, look it up in the
Glossary and review the Tutorial.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
67 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 9 of 11
Answers
1. False. The Central Limit Theorem tells us that the shape of the sampling distribution of a
sample mean will be approximately normal for large samples. Just how large is large
enough depends on the shape of the parent population, but samples of size 30 or
greater are often large enough. For symmetric distributions with no outliers, even small
sample sizes will yield approximately normal sampling distributions.
2. Because the sample size is large, the Central Limit Theorem applies, and you can say
that the shape of the sampling distribution of x will be approximately normal even
though you're given no information about the shape of the parent population. You also
1
= .12 . Note: The last two facts would be true
know that µ x = 5 and that σ x =
70
regardless of the shape of the original distribution or the sample size.
Objective
Solve problems involving the sampling distribution of a sample mean or of a sample
proportion.
Examples
1. A popular soda comes in 12-oz cans. However, the actual volume of soda in the can
varies normally with a mean of 11.9 oz and a standard deviation of .3 oz. What's the
probability that the mean amount of soda in a six-pack is less than 12 oz?
2. A popular soda comes in 12-oz cans. However, the actual volume of soda in the can
varies normally with a mean of 11.9 oz and a standard deviation of .3 oz. What's the
probability that the mean amount of soda in a six-pack is between 11.7 oz and 12 oz?
3. The probability of winning at roulette is about .474. Suppose you bet 50 times. What's
your probability of being even or ahead after 50 bets and after 1,000 bets?
Tips
• When you calculate the standard deviation of the sampling distribution, don't forget to
divide by the square root of the sample size.
Answers
1. You need to find the probability of getting a mean less than or equal to 12.
12 − 11.9
= .82 è Area = .793.
P( x ≤ 12) = z12 =
.3
6
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
68 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 10 of 11
2. You need to find the probability of getting a mean between 11.7 and 12, inclusive.
) = .742.
P(11.7 ≤ x ≤ 12) = normalcdf(11.7,12,11.9, .3
6
3. You need to find the probability of getting a proportion greater than or equal to .5, which
ˆ ≥ .5). Because 50(.474) ≥ 10 and 50(1 – .474) ≥ 10, we can use the normal
is P( p
approximation to the sampling distribution of a sample proportion. This would be as
follows:
(.474)(1 − .474)
For n = 50: µ pˆ = .474, σ pˆ =
= .071 , normalcdf(.5,10,.474,.071)
50
= .36. For n = 1000: σ pˆ =
(.474)(1 − .474)
= .016 , normalcdf(.5,10,.474,.016)
1000
= .05.
___________
TI-83 screens are used with the permission of the publisher. Copyright ã 1996, Texas Instruments, Incorporated.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
69 of 70
AP Statistics
Review: Binomial Situations and Sampling Distributions
Page 11 of 11
About the Unit Quiz
What to Bring
• Scratch paper
• Calculator
• Approved formula sheet
• Approved tables
You can't have any reference materials other than those specifically mentioned above. You
won't be able to ask for help during the Quiz.
Hints and Tips for the Free-Response Portion
•
Show your work. The test corrector won't assume you used proper set up and
methods if you reach the correct answer. It's up to you to communicate the methods
that you used. Answers alone, without appropriate justification, will receive no credit.
•
Take your time reading the question. Since we want to see how well you can apply
your knowledge to new and somewhat unfamiliar situations, take some time to think
about the question. If you don't understand the question, you're unlikely to find the right
answer. Read the entire question before beginning to answer.
•
Most questions will be given in several parts. The answers from one section will
often be used in subsequent sections. Missing points in an early section does not mean
you'll lose points in subsequent sections. Again, read the entire question to see how the
different sections connect to each other.
•
The calculator. As in the AP Exam, this Quiz will test you on how well you know
statistics, not on how well you can use your calculator. Be sure you understand the
concepts behind the calculator operations. Don't use "calculator-speak" in your answer—
the instructor doesn't want to read a set of steps for your graphing calculator! Use your
calculator for doing the mechanics, but be sure to clearly communicate your process for
solving the problem.
•
Use Units. If units are given in the problem, make sure that you give them in your
answer.
•
Answer the Question. Finally, be very careful to answer the question asked. Before
you move on, read over your answer to make sure you're providing exactly what the
question asks for. Generally, an answer to a question you weren't asked will receive no
credit.
_____________
Copyright © 2011 Apex Learning Inc. (See Terms of Use at www.apexvs.com/TermsOfUse)
70 of 70