Download Determining the `reasonability of media`s statistics`

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Inductive probability wikipedia , lookup

Birthday problem wikipedia , lookup

Ars Conjectandi wikipedia , lookup

Probability interpretations wikipedia , lookup

Probability wikipedia , lookup

Transcript
TWO
Probability
- If one learns by memory, and does not think, all remains dark.
Confucius
In the mid-1980’s, the ten states with the greatest longevity were Hawaii, Montana, Iowa, Utah,
North Dakota, Nebraska, Wisconsin, Kansas, Colorado, and Idaho. During the same time period,
these same states except for Hawaii, were the states that consumed the most animal fat per
human diet. Does this mean eating large quantities or even ridiculous amounts of animal fat
prolongs life? An inference such as this from the data seems ridiculous and for a dairy company
to propagate such lies to the American public would have all the indecency of two priests
swapping stories they heard from confessionals. So what would be a more appropriate
interpretation of the data? It also turned out that there was another occurrence in these same
states, again except for Hawaii; these same states had the highest rate of graduation from high
school. Thus, one could infer, the population’s demographics with regard to education may have
played a significant role in longevity. A greater percentage of people who had more advantages,
maybe better health insurance, maybe better living conditions on the whole contributed to greater
longevity in these states.
There is a fundamental difference between correlating a relationship, which just means drawing
a parallel between the states where people lived the longest and states the consumed the most
animal fat, versus inferring causation, deducing from the correlation that eating animal fat makes
one live longer.
It is said the ordinary chess player’s eyes roam over the entire board while the great grand
masters scan the few squares that really matter. Let’s examine our own vision. Again, in the
mid-1980’s there was a correlation between the thirteen states that were voted to be the most
scenic, WY, NV, CO, NM, MT, AZ, FL, OR, OK, CA, VT, AL, and WA according to the
Automobile Association of America (AAA) and those states whose residents commit the most
suicides. Should we conclude that beautiful scenery surrounding a neighborhood is a risky
venture because it increases the rate of suicide? Is there perhaps a more reasonable explanation?
Unlike the above data, the insight we need to see is not in the form of another set of hidden data.
Rather, it is in the interpretation of the data it self. Do you see it? If not, we will tell you at the
end of this opening section. Think about it. Scan the few squares that really matter.
Statistics you may not have heard.
 While 79 out of 1000 people are injured at home, it is a surprise that nearly 58 out of
1000 are injured at work.
 In this country, there are roughly two suicides for every murder.
Surprising, only because we tend to focus on injuries at home, or worry about our loved ones
being in mortal danger, but rarely do we fixate on injuries at work or the possibility that suicide
can take a loved one.
And why do we talk about so many people out of 1000? Because, raw numbers, when shown in
a comparative light, give you a perspective of the world around you. For example, we know
there are roughly 6.4 billion people living around the world, but since this number is too big to
comprehend we use proportions to visualize a smaller number to represent the whole, say 1000
people. These 1000 people are assumed to provide an average slice of the greater population,
preserving proportions to the world’s true demographic profile. This method is no different than
shrinking the earth’s age to week’s time or shrinking the sun down to the size of a soccer ball to
visualize the age of the earth or the vastness of space. Now, let’s examine the world around us
through these 1000 representatives of the earth’s human population.
By continent, there are
 570 Asians
 210 Europeans
 and just 60 North Americans
By religion, there are
 300 Christians
 175 Moslems, the fastest growing
religion on earth
 55 Buddhists
 33 Jews
Literacy,
 700 would be illiterate
By income,
 60 people would own half the
world’s wealth
Poverty,
 167 would earn less than 1 dollar per
day
Hunger,
 500 would go to sleep hungry.
- 40 -
On the surface of these numbers, things may look really awful. Well, not really. A slight
perspective change could show that as bad as things seem on a global level, it was far worse in
the past. For example, in the 1700’s half of the world’s population died before they were eight
years old.
You may be thinking, well, yes, while this is interesting in a trivial sort of way, maybe even eye
opening if I was to stretch it, but to be honest, so what? How can a ‘feel’ for numbers, basic
numeracy in a literate society be of help to me. Well, we all walk around in darkness on many
subjects, but with a smidge of numeric awareness, maybe every now and then a little light may
shine through.
For a moment, let’s just examine one of those conversations parents of teenagers must be
prepared to convincingly win. Their adorable child who they have protected from the dangers of
the cruel world says to them, “Why should I work hard in school, pops? I’m gonna be a
professional athlete after high school. My future is in the NBA (or NFL or Major League
Baseball or on the LPGA Tour… ).” Well, let’s reply in a way that would make even the
Huxtable’s proud by simply examining the dream through a numerical argument. The parent’s
reply, “Junior, what is the greatest number of high school athletes you believe could actually
make such a leap? Is it reasonable or even generous to assume that at most there are 50 players
per high school graduating class in the United States that can conceivably make the NBA by way
of the draft? With the influx of European players, non-senior NCAA athletes and some high
school ballers tossed into the NBA draft, this 50 is actually quite optimistic considering each
draft has 32 teams and only 3 rounds. Moreover, did you know only 1 out of every 50,000 high
school athletes go on to play a professional sport and this includes bowling. The odds are not in
your favor. Try altering your dream. Why don’t you aim for a Ph.D? Last year, there were
roughly 44,000 doctoral degrees conferred in this country. Do you think the odds against each of
those newly appointed Dr.’s were bordering on 1 in 50,000? That would mean for their high
school class, if we multiplied 44,000 by 50,000, 2.2 billion students in that high school class
must have set their sites on a Ph.D too. Does this seem reasonable? Of course not, this number
is absurd, it is 7 ½ times the size of our country, for goodness sakes. Set the Ph.D. in your sights.
It’s a more realistic dream. Pick a book. Study … “
Now, to formulate that argument, you would have to know some of the figures quoted. We
easily researched ours and found the “1 out of 50,000” portion of the argument was formed by
Art Young, the director of Urban Youth Sports at Northeastern University Center for the Study
of Sports in Society. And the 44,000 doctorals degrees was actually a modest adjustment of the
true figure, 44,1660, according to the National Center for Education Statistics (US Department of
Education). The point is, mathematics and the ability to be comfortable with numbers and the
use of logic with the ability to draw proper inferences, does not exist independently from us.
Rather, mathematical reasoning can enhance your life and with it sharpened, you’ll be a better
decision maker. The statements, symbols and methods we will study are not meaningless
statements governed by rules, but rather a way of viewing life around us. The trick here is to
understand how to work with numbers, and at the same time to not have the personality of an
accountant.
- 41 -
Back to the correlation between the thirteen most scenic states and the same thirteen state’s with
the highest rate of suicide. Is it reasonable to infer that nature’s beauty causes one to ponder
whether of not to end their own life? Of course not. But, it does seem reasonable to conclude
that a larger number of transient people move to those same scenic states seeking to start over,
picking scenic states as a place because the state offers nature’s comfort. Many are looking for
a new job, new life. Unfortunately, for a percentage of these people, their dreams don’t come
true.
Exercise Set
Ratios designed to help you understand the world around you and allow you to
make forecasts
1. Internet Project: The World Population
compressed to 100
If we could shrink the Earth's population to
a 100 people, with all the human ratios
remaining the same; what would this
collection of people look like? How many
would be …



















Asians
Europeans
From the western hemisphere (north
and south)
Africans
Female
Male
Non-white
White
Non-Christian
Christian
60 % of the entire world's wealth
would be in the hands of this many
people
from the United States
live in substandard housing
unable to read
suffer from malnutrition
be near death
be near birth
have a college education
own a computer
For problems 2 – 9, use the following data
from Table 1.
The following data can be found at
http://www.census.gov/ at the US Census Bureau web
site.
Table 1. Population of Selected States in the
years 1990, 2000 and 2003.
Fill in the remaining information for your
home state.
State
Florida
New York
California
Arizona
Your home
state
1990
12,941,197
17,987,163
29,816,592
3,665,228
2000
15,982,378
18,976,457
33,871,648
5,130,632
2003
17,019,068
19,190,115
35,484,453
5,580,811
2. Fill in the table below by finding the
numeric population growth (actual
difference in population) for each state for
the indicated time period.
State
Numerical
Growth
from ’90 – ‘00
Numerical
Growth
from ’00 – ‘03
Florida
New York
California
Arizona
Your home
state
- 42 -
6. Population Forecast for each state.
Predict the each state’s population for the
year 2010 based on the percent growth for
the indicated time period.
3. Fill in the table below by finding the
average annual population growth
(population growth per year) for each state
for the indicated time period.
State
Average Annual
Numeric
Growth from ’90
– ‘00
Average Annual
Numeric
Growth from 00 –
’03
Florida
New York
California
Arizona
Your home
state
4. Fill in the table below by finding the
percent growth (ratio of 2000 to the 1990
population or ratio of 2003 to the 2000
population respectively) for each state for
the indicated time period.
State
Percent Growth
from ’90 – ‘00
Percent Growth
from ’00 – ‘03
State
Predicted 2010
population
based on
Percentage of
Growth from ’90 –
‘00
Predicted 2010
population
based on
Percentage of
Growth from ’00 –
‘03
Florida
New York
California
Arizona
Your home
state
7. Why are the population projections for
the year 2010 larger when based on the
percent change?
8. Which of the four predictions, Annual
Numeric Growth from ’90 – ’00, Annual
Numeric Growth from ’90 – ’00, Percent
Growth from ’00 – ‘03 or Percent Growth
from ’00 – ’03, do you feel is the most
accurate? Why?
9. For which of the states is the projected
predictions the largest and why?
Florida
New York
California
Arizona
Your home state
5. Population Forecast for each state.
Predict the each state’s population for the
year 2010 based on the annual numeric
growth for the indicated time period.
State
Predicted 2010
population
based on Average
Annual
Numeric Growth
from ’90 – ‘00
Predicted 2010
population
based on Average
Annual
Numeric Growth
from ’00 – ‘03
Florida
New York
California
Arizona
Your home
state
- 43 -
Introduction to Probability, Coincidences
- I don’t know what’s the matter with people; they don’t learn by understanding; they learn by
some other way – by rote, or something. Their knowledge is so fragile! Richard P. Feyman, winner of
Noble Prize in Physics. 1985. “Surely You’re Joking, Mr. Feyman!”, WW Norton & Company, Inc.
I had a friend in college who had the world’s worst pick-up line. He would set his eyes on some
gorgeous co-ed and amble over to her. In a dead pan voice he would ask, “have you ever been to
Africa?” The co-ed, with eyes rolling, would reply, “no” and he would say, “what a coincidence,
never have I. See, we already have something in common.”
The fact that two events coincide means nothing. Situations where two events coincide gives us
the root of the word coincidence, right? Consider one’s parent who routinely has said over the
years “you and I were talking about so and so yesterday, and guess who I ran into today? So and
so. What’s the odds of that?” When one considers all the names spoken within a family each
and every day, all the tens or even hundreds of conversations they have each week, all the
hundreds or even thousands of conversations they have each month, sooner or later, one of the
people discussed, on one of the many days, weeks, months, and years they have been together,
would show up somewhere in the presence one of the family members. No one should be
stunned by this occurrence. Life can sometimes seem like photographs in our minds and we crop
away the edges of anything we do not want to see. What is construed as a coincidence with some
special meaning assigned to it is nothing more than basic probability presenting itself. As was
the college pick up line. “Well, what are the probability she has been to Africa,” he would say
after his personalized probability lecture with respect to being so cheesy. “At least I have a can’t
miss opening.” Can’t miss, no. But, it is more likely than not that the unknowing woman will
have that ‘something uncommon in common’.
We are all familiar with the bizarre coincidences between the Kennedy and Lincoln presidencies,
and these Kennedy/Lincoln coincidences certainly make interesting reading. They captivate
many filling chat rooms and web pages. Let’s pick one of the chat rooms and glance at the
coincidences. We choose snopes.com because they choose to discuss the coincidences in a
manner befitting this discussion; they claim to have analyzed each coincidence thoroughly. Source:
http://www.snopes.com/history/american/linckenn.htm









Abraham Lincoln was elected to congress in 1846. John F. Kennedy was elected to congress in 1946.
Abraham Lincoln was elected president in 1860. John F. Kennedy was elected President in 1960
Lincoln's secretary was named Kennedy. Kennedy's secretary was named Lincoln.
Andrew Johnson, who succeeded Lincoln, was born in 1808. Lyndon Johnson, who succeeded Kennedy,
was born in 1908.
John Wilkes Booth, who assassinated Lincoln, was born in 1839. Lee Harvey Oswald, who assassinated
Kennedy was born in 1939.
Booth ran from the theater and was caught in a warehouse.
Oswald ran from a warehouse and was caught in a theater.
Booth and Oswald were assassinated before their trials.
The names Lincoln and Kennedy each contain seven letters.
- 44 -









Both names contain fifteen letters
Both wives lost their children while living in the White House.
Both Presidents were shot on a Friday.
Both were shot in the head.
Both were assassinated by southerners.
Both were succeeded by southerners.
Both successors were named Johnson.
Both assassins were known by three names.
Both were particularly concerned with civil rights.
Hain’t we got all the fools in town on our side? And hain’t that a big enough majority in any town?
- Mark Twain, Huckleberry Finn
Conspiracy Theories or Coincidences What do all these seemingly unrelated coincidences
mean? Anything? we say, “no.” Any two particular events have an extremely high number of
associated facts, events and meanings associated with them. The fact that some of these
occurrences appear to overlap or coincide should not surprise us. It is just a reflection of the
multiplication principle discussed in the previous chapter. The events that coincide are as the
term suggest, coincidences, not anything more. They do not conspire something dark and
dangerous, they are not conspiracies.
Take this example. Two people meet on a plane, they find they have a common acquaintance.
So what? I know 100 people, who know 100 people, who also know 100 people means
100x100x100 = 102 x 102 x 102  106  1,000,000 acquaintances. She knows 100
people who each know 100 people who also each know 100 people. And one person is an
acquaintance of two people sitting next to each other on the plane. Big deal. It’s just a
consequence of the multiplication principle. It does not mean there exists a special significance
between the two passangers.
Don’t believe it? Take any two seemingly unrelated events, ‘google’ the topics on the Internet,
and you will uncover just as many “coincidences” or “what’s the odds of that” as the
Kennedy/Lincoln coincidences. You try it. Start benignly. Pick any two unconnected
presidents, Millard Fillmore and Harry Truman or Grover Cleveland and James Buchanan.
Google. Then spin your own conspiracy theories about two presidents. Write a book, if you
want. We often thought a good book could be written if we employed the same notion with the
vice-presidents. Common thread or little known conspiracies about those really in charge.
From John Adams to Dick Cheney. Certainly, Dan Quayle and Levi Morton must have had
inconsequential commonalities that may be cleverly exploited into something more.
Enter these three seemingly unrelated items into google.com: December 5th,, boats, sinking. And
then search. And search. And search. The result: You could find that on December 5th, 1664, a
ship off north Wales with 81 passengers aboard, sank. There was one survivor. A man named
Hugh Williams. On the same date, December 5th, 1785, a ship sank with 60 passengers aboard.
There was one survivor. A man named Hugh Williams. On December 5th, 1860, a ship with 25
passengers abroad sank. There was one survivor. A man named Hugh Williams. Going into
this venture, we didn’t know the coincidence we would find, but we believed a coincidence
could be put together. By only telling you the three facts by themselves, we could easily extol
- 45 -
this coincidence as something meaningful. And if the point has yet to be made, coincidences are
not meaningful.
Numerology Closely associated with the notion of coincidence is the pseudo science
numerology. Through the dark annals of history, people have made veiled attempts at projecting
importance in the numerical coincidences that may be nothing more than the result of a simple
fact; seemingly rare events happen everyday. Collecting data for a single day from history and
assigning some hidden importance to random occurrences is more than likely meaningless.
Rather, what is more astounding is that someone actually took the time to tie together unrelated
facts.
Let’s look at September 11th, 2001. The tragic day will forever be seared into the very
conscience of the American people with images never to be forgotten. The death toll associated
with the terrorist attacks that day exceeded 3,000. Now, let’s examine this date through out
history:












September 11, 3 B.C. The Birth of Christ. In a book written in 1981 called "The Birth
of Christ Recalculated", the author, Dr. Ernest L. Martin, claims this as the date of the
birth of Jesus Christ's. (Yes, we know it is controversial.)
Sept. 11th , 1297. The Battle of Stirling Bridge that was portrayed in the film Braveheart
took place
September 11, 1609. The explorer Henry Hudson discovers Manhattan Island, the
location of the World Trade Center Towers.
Sept 11th, 1709. The first European War of the modern Era, The Spanish War of
Succession began. And on September 11, 1714, the war ends.
Sept 11th, 1772. The last battle of the American Revolution took place.
September 11, 1922. A British memorandum is approved for the state of Palestine,
securing a promise to create a Jewish national home. There was vast objection voiced
through out the Arab world.
September 11, 1941. Construction of the Pentagon officially starts, 60 years to the day
before the Sept 11th attacks would destroy one side.
September 11, 1944. The US Army crosses the border into German territory for the
first time during WWII.
September 11, 1972. The ending of the 1972 Munich Olympics, the Olympics which
introduced the world to terrorism. There were 121 participating countries (11x11=121),
and 11 Israeli athletes were killed.
September 11th, 1990. Then U.S. President George Bush addresses Congress on the
Persian Gulf crisis, vowing that "Sadam Hussein will fail" in his takeover of Kuwait.
September 11, 1992. Hurricane Iniki, the most powerful hurricane to strike the state of
Hawaii, causes six deaths.
September 11, 1999. The Jewish calendar is 6,000 years old on this date.
Source: napsterites web site http://www.p2p-zone.com/underground/showthread.php?t=20564 ,
September11thNews.com http://www.september11news.com/Sept11History.htm , and http://en.wikipedia.org/wiki/British_Mandate_of_Palestine
For some reason far too many of us are compelled to find significance when presented a string of
facts that appear to be related. Sadly, in reality, it is random coincidences that more often than
- 46 -
not govern our lives. Unfortunately, these numerological assumptions are more fascinating to us
and implications that much in life is random is probably a little scary to most.
Sports Every four years, the most important football game of the year is played by the
Washington Redskins amid the District of Columbia’s changing fall colors of late October or
early November. For the National Football League, there were times when this game was of
consequence, but usually for the NFL, due to the too often lean years of the Redskins, this game
was meaningless. But, for our nation, ‘the game’ dictated more than just the direction of two
football teams; it directed the direction of our nation for the next four years. Since 1936, in an
election year, if the Washington Redskins won their last home game prior to the presidential
election, the incumbent party won the election. If they lost, the challenger’s party won. The
streak began in 1933, when the Boston Braves were renamed the Redskins. Since then,
beginning with a Redskins home win prior to the FDR versus Alf Landon presidential race and
the consequent re-election of Franklin Roosevelt in 1936, the trend has held. In 2000, the
Redskins lost to the Tennessee Titans, and the incumbent’s party was ousted as George W. Bush
beat Al Gore. In 2004, the Redskins lost to the Packers 28-14. Correlation verses causation?
Political If you want to ingratiate someone to your way of thinking, then exploit the highly
known fact that most people are not comfortable with numbers. Politicians, spin doctors and the
like have been doing this for years. Just assign significance to a situation by extolling raw
numbers and leave out minor details such as the likelihood said situation would occur in the first
place.
Democratic US Representative from Massachusetts Edward Markey voiced a frightening
statement recently when he said, “This is a rapidly growing problem that will soar out of control
if the industry does not wake up to its responsibility … “ What was he referring to, what was the
problem that could spin out of control if something is not done by the highest echelon of the
industry? His statement was in response to these recently publicized raw numbers: in 2000,
there were 6,594 injuries at fixed site amusement parks, and there were 15 fatalities or serious
head injuries in the past ten years.
What we are about to say in no way minimizes the seriousness of this type of injury. In fact, just
the opposite. It serves to give us, the public, a better understanding of this issue. Articles
extolling this fear, with disturbing headlines like “Do breakneck speeds and high-g turns push
thrills to a new lethal level” are deceptively alarming. Yes, new roller coasters these days are
bigger, faster, wilder and have incredible ground-breaking latest up to the minute modernized
new twists and turns. But, to imply the industry is negligent in its attempt to secure the safety of
its passengers is outright misleading. The fact is when published alone, the number
6,594
stands out in a frightening way. But, let’s examine this number more closely. Roughly
317,000,000 or over 300 million people visit fixed site amusement parks in a given year. Now,
let’s compare this number to those injured each year in other recreational activities: 82,722
people are injured on trampolines, 62,812 are injured in swimming pools, 544,561 are injured on
bicycles, 20,000 are injured at music concerts, and 200,000 are injured in preschool and
elementary school playgrounds.
- 47 -
Why do people fear amusement parks so much? Why don’t they fear a trampoline, a bicycle or a
playground? Sure, maybe there are deep psychological reasons, like when one bounces on a
trampoline or rides a bike, they are in control and when you go on a ride in an amusement park,
you strap yourself into a ride, thus giving up your control over the motion. But, to be honest,
headlines extolling raw numbers, such as 6594 people injured each year with follow ups from
politicians are notoriously common instruments to instill public panic or at the least, public
apprehension.
We see alarming headlines all of the time associated with sensitive issues. Memorial Day
weekend, 2003. The headlines read, “Holiday weekend traffic death soar to nearly 500
nationwide.” Should you be alarmed? Reluctant to drive next weekend? 29 million people
traveled by car more than 50 miles from their homes that weekend.
Unfortunately, the public places too much emphasis on small numbers singled out by alarmists
and does not recognize the significance of large raw numbers that are quite significant. Consider
the state of Massachusetts, where US Representative Markey hails. In 2002, in Boston alone,
109,128 people live in poverty, up 7,036 from the 102,092 people in the city ten years earlier.
Massachusetts has 600,000 people living in poverty. 600,000 people is nearly one in ten people
from the good commonwealth. In Boston, 28,928 children live in poverty and statewide,
240,000 children are poor. These numbers pale compared to Hollywood’s dilemma. In Los
Angeles county alone, where entertainment routinely makes the headlines, 1,679,000 men,
woman and children are poor. 640,000 of these people are children. Where are the alarming
headlines here? Where is the public outcry? Is the issue of mass poverty, where some
35,800,000 United States citizens live and function below the poverty level too big to confront.
Health care, education, homeless, and unemployment are too complicated to tackle. Headlines?
We just don’t see them.
Take a lighter issue, where the numbers and the implications are largely ignored by our society.
In 2002, there were 288,368,698 people in the United States. Roughly one in eight were 65
years or older. By 2030, it is estimated that one in five will be ‘elderly’. Moreover, the fastest
growing segment of our population is the oldest of the old, the 85 and older crowd. In 2010,
there are projected to be 6 million, which is double the 1990 number of 85 and older. By 2050,
the number is projected to rise to 21 million.
- 48 -
Exercise Set
1. Answer this question, what impact does
the demographic statistics stated in the last
paragraph of this section concerning the
‘elderly’ have on politicians, policy makers
and planners. Start with the notion that the
people from this segment of the population
are more likely to be disabled than the
young elderly, from 65 to 85. Attack the
question by discussing the effects on
insurance, care, hospitals, voting alignments
and so on. To support your argument use
numbers. Research and gather data and then
develop a numerical argument to advocate
your position.
For questions 2 and 3 use the data below.
Taken from the US Census Bureau: http://www.census.gov/PressRelease/www/releases/archives/facts_for_features/001676.html
May 17, 1954 marked the U.S. Supreme
Court decision that ruled unanimously
segregation of public schools “solely on the
basis of race” denied African American
children equal educational opportunity, even
though “physical facilities and other
‘tangible’ factors may have been equal.” To
honor that landmark decision, the US
Census Bureau assembled the following data
below in the “*Special Edition* Brown v.
Board of Education: 50th Anniversary on
the educational attainment and school
enrollment of African Americans — then
and now.”
Enrollment: 1954 to 2002. Sixty nine
percent of African American children ages
5 and 6 were enrolled in school in 1954. By
2002, enrollment for African American
children of those ages had risen to 96
percent.
Twenty four percent of young, African
American adults ages 18 and 19 were
enrolled in school in 1954. In 2002,
enrollments of this age group increased to
58 percent.
High School Graduates: 1952 to 2002.
Fifteen percent of African Americans age
25 and over in 1952 were high school
graduates. By 2002, this percentage had
risen to 79 percent.
The raw numbers associated with these
percentages are 1.6 million African
American, 25 years of age and older, had
earned a high school diploma in 1957. This
number had risen to 16 million by in the
year 2002.
College Graduates: 1952 to 2002. Two
percent of African Americans age 25 and
over in 1952 were college graduates. By
2002, this percentage had risen to17 percent.
In 1957, 252,000 African Americans had
earned at least a bachelor’s degree. In 2002,
3.5 million African Americans had achieved
a bachelor’s degree.
Students: 1955 to 2002. In 1955, 4.5
million African Americans were enrolled in
schools (nursery through college). This
number had risen to 11.7 million by 2002.
There were 155,000 African American
college students in 1955 and by 2002, this
number had risen to 2.3 million.
There were 926,000 African American high
school students in 1955 and in 2002, this
number had rose to 2.6 million.
2. Causation. Certainly, dramatic social
changes occurred from 1955 to 2002 and are
reflected in the numbers above. No rational
person could construe these changes as
coincidences. The statistics reflect
causation. Construct an argument using the
above numbers provided that addresses a
“cause and effect relationship” for these
increases. In your argument, try to account
for as many of the statistics presented above.
Also, include any statistic or social change
you are presently aware of to support your
argument.
- 49 -
3. Causation. What impact did the changes
in the above demographic statistics have on
society over the past 50 years? Attack the
question by imagining you are currently a 75
year old who witnessed the changes first
hand. Write a narrative discussing the
perceived effects on education, finance,
business and entertainment/the arts.
Research and gather data and then develop a
numerical argument to advocate and support
your position.
4.
Taken from the US Census Bureau
http://www.census.gov/PressRelease/www/releases/archives/facts_for_features/001702.html
Is it difficult for you to imagine life without
a color television? March 25, 2004 marked
the 50th year anniversary of color TV
emergence in American society. On March
25, 1954, the Radio Corporation of America
began to manufacture color television sets at
its Bloomington, Ind., plant. Roughly 5,000
model CT-100 color receiver’s were built
11 whose parents in 2000 imposed at least
one rule for watching TV, such as types of
programs watched, how early or late the
children could watch and the number of
hours watched. The percentage dropped to
73 percent for children ages 12 to 17.
$255.18 - The projected spending per person
for cable and satellite TV in 2004.
$34.71 - The estimated average monthly rate
for cable TV in 2002. 154,000 - The
number of people employed in the
manufacture of television, radio and wireless
communications equipment in the United
States in 2001. 21,724 - The number of
stores that primarily sold televisions and
other electronic equipment in 2001.
$11.7 billion - The annual payroll for the
245,000 employees of 6,692 cable TV
networks and program distribution firms in
the United States in 2001. 1,937 - The
number of television broadcasting networks
and they retailed for $1,000 each. The US
Census Bureau assembled, as seen below, a
sampling of statistics from its publications
about television and the television industry
to celebrate this anniversary.
248 million - The number of television sets
in U.S. households in 2001.
98.2% - The percentage of households with
at least one TV in 2001. (Compared to
87.3%, the percentage of households with at
least one TV in 1960: From the 1960 census)
2.4 - The average number of televisions per
home in 2001.
1,669 - The projected number of hours that
adults (age 18 and older) will watch
television in 2004. This is the equivalent of
about 70 days.
94.3% - The percentage of people age 18
and over who said they watched television
in the spring of 2002. Older Americans (age
65 and over) were more likely to be glued to
the tube (97 percent) than any other age
group.
92% - The percentage of children ages 6 to
and stations in the United States in 2001.
31,235 - The number of people working
behind the lens as television, video and
motion picture camera operators and editors,
according to Census 2000. $10.7 billion The payments by television broadcasting
firms for broadcast rights and music license
fees in 2001. Such payments constitute the
biggest expense of TV broadcasters. The
next highest expense was the annual payroll,
$6.5 billion. $41.8 billion - Amount spent
on television advertising in 2002, up from
$38.9 billion in 2001. Research several of
the above statistics for the year 1990 (see
US Census Bureau http://www.census.gov/).
What differences do you observe between
the years 1990 and 2000? Which categories
changed the most? The least? Based on this
information, had you been an investor with a
crystal ball in 1990, which companies would
- 50 -
have been a good choice for investments for
the upcoming decade of the 90’s.
The following are projects that may be
assigned as “on-line projects, projects to
involve research on-line.”
5. Coincidences: Presidential Births and
Deaths Considering coincidences among
the presidents, we turn our attention to the
birth and death dates of American
presidents. Research the forty-one
presidential births, and thirty-six presidential
deaths. Were any presidents born and did
any presidents die on the same day of the
year? (Note: Later in this section, we will
examine if these common days were indeed
coincidences.)
8. Coincidence in Literature. Research
the following: In 1898, Morgan Robertson
wrote a book entitled Futility. It described
the maiden voyage of a transatlantic luxury
liner named the Titan. Although it was
flaunted as being unsinkable, it collides with
an ice burg and sinks. Of the 3,000
passengers, many perish. The boat had 24
lifeboats.
When did the Titanic sink? Was it touted as
a transatlantic luxury liner that was
unsinkable? Did it strike an ice berg? How
many passengers were on the Titanic? How
many lifeboats?
6. Presidential Coincidences Aside from
the aforementioned presidents Lincoln and
Kennedy, two of the other presidents who
were shot has coincidences associated with
their presidencies; Research James Garfield
and Ronald Reagan. Find as many
coincidences as you possibly can about these
presidents as well as their assassination
attempts.
One last note on this topic. When
interacting with numbers presented by the
media, it should be noted that an all too
common political ploy is to publicize either
percentages of the population or rates of
change instead of the raw data. Many times
the “fear” is that the numbers themselves are
so large that the public would become
distressed. For example, when the AIDS
epidemic swept the nation in the 1980’s and
1990’s, television news commentators
nightly quoted how fast the AIDS epidemic
was spreading instead of informing us as to
how many people were inflicted with the
virus. This was done to soften public
outcry, the actual numbers were thought to
frightening for the general public. Checkout
your own level of numeracy: When viewing
nightly news or reading the daily paper or
other media sources, be on the look out for
rates of change, percents, or numbers that
may be spun so that the presentation of the
facts are not as alarming as they first
appear.
7. Coincidence in Literature. Research
the following: In 1883, Edgar Allen Poe’s
book, The Arthur Gordon Pym of
Naugatuck, had a sequence where four men
are adrift in a boat, and they kill and eat the
cabin boy Richard Parker. When in the
future did this sequence of events actually
occur?
- 51 -
Probability
Poverty and Probability On Thursday, August 26, 2004, CNN.Com reported “Census: More
Americans living in poverty. WASHINGTON (AP) -- The number of Americans living in
poverty increased by 1.3 million last year, while the ranks of the uninsured swelled by 1.4
million, the Census Bureau reported Thursday. It was the third straight annual increase for both
categories. While not unexpected, it was a double dose of bad economic news during a tight reelection campaign for President Bush. Approximately 35.8 million people lived below the
poverty line in 2003, or about 12.5 percent of the population.”
Poverty, in case you are interested, was defined to be a household income under 18,400 dollars per year for
a family of 4, a household income under 15, 260 dollars per year for a family of 3, a household income under
12,120 dollars per year for a family of 2 and under 8,980 dollars per year for a person living alone. Do you
think this is a substantial number of people who live in poverty? What is the probability that
someone you meet lives in poverty? Well, if there are 293 million US citizens in August, 2004,
and 35.8 million live beneath the poverty level, then the ratio or percent of the population who
35,800,000
live in poverty is
 0.122 or roughly 1 in 8 Americans live in poverty. Because
293,000,000
people tend to socialize with others within their own economic status level; within their own
group is their perceived average slice of reality; this does not automatically imply one in eight
people you meet will live in poverty.
Probability is used to shine light on ‘the other side of the argument.’ For example, let’s look at
both sides of an argument using the same exact statistics. In 1990, there were 33.4 million
people who lived beneath the poverty level. It would be obvious to point out that between 1990
and 2003, 2 ½ million more US citizens lived beneath the poverty level. But, in 1990, the
population of the United States was roughly 248 million. So, the probability you will meet
33, 400, 000
someone who lives in poverty is
 0.135 . Between 1990 and 2003, the number of
248, 000, 000
people living on poverty increased by 2 ½ million, while the percent of US citizens living on
poverty dropped from 135 per 1000 to 122 per 1000.
The probability an event E is simply a measure of the likelihood event E will occur.
Intuitively, people have a “feel” for the concept of the likelihood that an event will occur. For
example, if we ask you what the probability is for you to obtain a heads up if you were to toss a
coin, you know it is just as likely to toss a head than to toss a tail. And you probably know the
probability associated with a heads up is ½. In mathematical terms, we call the tossing of the
coin the event and we call the result, be it heads or tails, the outcome.
The probability an event E will occur is
the number of desired outcomes divided
by the total possible number of outcomes.
* The outcomes need to be equally likely to occur.
- 52 -
P( E ) 
Number of desired outcomes
Total number of outcomes
For example, if you roll a die, the total number of outcomes is 6 because you may roll a 1, 2, 3,
4, 5 or 6. The set of all outcomes comprise what is called the sample space. If we ask what is
the probability of rolling a two, the event E is the rolling of the die, the number of desired
outcomes is 1, specifically, it is the desire to roll a two. We say the probability of rolling a two is
1/6, we write P(E) = P(roll a 2) = 1/6. Based on this thought process, we have P(2 or more) = 5/6
because there are the same 6 possible outcomes and 5 of these 6 outcomes, rolling a 2, 3, 4, 5 or
6 are desired.
Since we are using the language of probabilities, we interpret the meaning of these ratios as
follows. If you rolled a single die six times, on average you would see the two appear exactly
once. Is this exactly once each time you roll the dice six times? Possibly. Possibly not. If you
rolled a die a large number of times, say 1000, a good guess would be that about 160 to 170 rolls
would be recognized as a ‘roll of a two’. Try it. This is because the theoretical probability of
rolling a two is 1 chance in 6, which is 1 divided by 6, or 0.1666, but this is only what is
expected to happen. It is not guaranteed. So, when you really roll the die over and over again,
anything can happen. The result of any single roll is not predictable. But, the more you role, the
closer you get to the theoretical “1 out of 6 times” of rolling the two. Maybe after ten rolls you
might not get a two at all, but after two hundred rolls, our guess is that you will probably see the
two 0.166 x 200 or approximately 33 times. Roll the dice a thousand times and obtain a two
0.166 x 1000 166.7 , or around the interval of 160 to 170 times.
Example One
When rolling a single die, find the following probabilities:
a) P(4 or less) b) P(an even number) c) P(a number less than 11) d) P(a number greater than
11)
Solution
a) The probability of rolling a 4 or less is 4/6 or 2/3. There are 4 desired outcomes and they are
rolling a 1, 2, 3 or 4. The sample space consists of the same 6 possible outcomes, 1, 2, 3, 4, 5 or
6 as discussed above.
b) The probability of rolling an even number is 3/6 or 1/2. There are 3 successful outcomes and
they are rolling a 2, 4 or 6. Again, the sample space consists of the same 6 possible outcomes as
discussed above.
c) The probability of rolling a number less than 11 is 6/6 or 1. All 6 possible outcomes from the
sample space are numbers less than an 11 and thus would be a desired outcome. In other words,
every time you roll a single die, you will see a number less than 11. The probability of 1
implies the desired event is certain to occur.
d) The probability of rolling a number greater than 11 is 0/6 or 0. None of the 6 possible
outcomes from the sample space would be a success because none are greater than an 11. If you
- 53 -
roll a single die, you will never see a number greater than 11. A probability of zero means
your set of desired outcomes will never occur.
Numerically speaking, the range of numbers for the probability of any desired event to occur
spans from the smallest number, which is 0 for an impossible event to the largest number,
which is 1 for an event that will definitely occur. All probabilities are numbers between 0 and
1. The likelihood of an event occurring has a probability closer to 1 if it is more likely to happen
and closer to 0 if it is less likely to happen.
Once again, working with P(E) = P(roll a two) = 1/6 we can quickly determine the probability
__
__
for the compliment of E, written P( E ). P( E ) is the probability event E will not occur. We
have the event E is to “roll a two” and the compliment set is found by removing all the elements
of event E from the sample space, S. Using set notation, we have S\E = E . The probability
associated with E can be found by subtracting P(E) from P(S). We know P(S) = 1. Why?
__
P( E ) = P(roll a 1, 2, 3, 4, 5 or 6} = P(S) – P(roll a two) = 1 – 1/6 = 5/6. The probability of
rolling a number is not a 2 is 5/6. Let’s look at this from an alternative point of view. The basic
Number of desired outcomes
premise of determining a probability is to use the ratio of
.
Total number of outcomes
In our example, n( E ) = 5 and n(S) = 6. Thus, P( E ) =
n( E ) 5
 .
n( S ) 6
There are two basic strategies in determining a probability for an event E. We can apply the
ratio of the number of desired outcomes divided by total possible number of outcomes. Or if we
know the probability of an event E, we can find the probability of E compliment by subtracting
P(E) from 1. Often we employ the compliment strategy when the number of desired possible
outcomes is difficult and seemingly impossible to calculate. We will see this technique applied
often in our discussion of probabilities.
Example Two
Question - Is race is an important “filter” on who goes to prison? On New Year’s Eve of
2001, there were 1,955,705 people in U.S. prisons and jails. The population of the country that
year was approximately 275 million people.
a) To what degree does the US incarcerate it’s citizens?
Solution: The probability you were incarcerated was 1,955,705
275,000,000
or
711
 0.007 . This ratio implies nearly 7 out of every 1,000 Americans were
100,000
incarcerated in 2001.
In 2001, the United States population was comprised of the following proportions. There were
approximately 211 million Caucasians and 500,000 were incarcerated. There were
approximately 34 ½ million African Americans and 625,000 were incarcerated. There were
- 54 -
approximately 35 million Latin Americans and 213,000 were incarcerated, among the 10 million
Asians 9,900 were incarcerated and from the 2 ½ million Native Americans, 17,700 were
incarnated. Note: Approximately 600,000 of those incarcerated did not readily fit in any of the
above ethnic groups.
b) When you break down the statistics, is incarceration is an equal opportunity
punishment? For each ethnic group, find the proportion of these citizens who were
incarcerated. Write the result in terms of so many out of 100,000 US citizens.
In 2001, the probability you were incarcerated if you were Caucasian was
500, 000
 0.00237 or nearly 237 per 100,000 Caucasians. That same year, the
211, 000, 000
probability you were incarcerated if you were African American was
625, 000
 0.01812 or nearly 1812 per 100,000 African Americans. Again that same
34,500, 000
 0.00609 or
year, the probability you were incarcerated if you were Latino 213, 000
35, 000, 000
nearly 609 per 100,000 Latin Americans, the probability you were incarcerated if you were
 0.00099 or 99 per 100,000 Asians and the probability you were
Asian was 9,900
10, 000, 000
 0.00708 or nearly 708 per
incarcerated if you were Native American was 17, 700
2,500, 000
100,000 Native Americans.
The inferences you draw are based on your belief system, which has been evolving since you
were born. In the US in 2001, there were 211 million Caucasians and only 34 ½ million African
Americans. This means there were more than six times the number of Caucasians than African
Americans sprinkled through out the US population that year. Yet, there were more African
American inmates incarnated, 625,000, than Caucasian inmates, 500,000. Moreover, the rate per
100,000 within each respective ethnic group was over seven times greater in the African
American population compared to the Caucasian population. We’ll let you continue along this
train of thought as you consider ratios involving the other ethnic groups.
- 55 -
Determining the ‘reasonability of media’s statistics’
Remember numbers are not controversial, rather it is one’s interpretations that may be construed
as controversial. Let’s examine one case in point. We will use our probability knowledge mixed
with a little common sense to analyze a random media publication taken from the following
source: http://www.westernprisonproject.org/Publications/Factsheets/Prison_Index_Incarceration_Fact_Sheet.pdf.
The Prison Index
The Prison Index: Taking the Pulse of the Crime Control Industry is the first index of statistics about our nation’s
criminal justice system ever published. Containing 611 facts and 17 graphs and charts, this 48page volume presents, in black-and-white, the state of crime control in America. Below is just a sample of some of
the insightful and useful information, in easy to read index form.
Get the real facts about crime and punishment in the US today!
Incarceration is not an equal opportunity punishment.
On December 31, 2001, there were 1,955,705 people in U.S. prisons and jails.
As of December 31, 2001, the U.S. incarceration rate was 709 per 100,000 residents (7 out of every 1,000
Americans). But when you break down the statistics you see that incarceration is not an equal opportunity
punishment.
Select U.S. incarceration rates:
Whites: 235 per 100,000
Blacks: 1815 per 100,000
Latinos: 609 per 100,000
Asian: 99 per 100,000
Native: 709 per 100,000
Race is an important “filter” on who
goes to prison:
Males: 1,318 per 100,000
White males: 708 per 100,000
Black males: 4,848 per 100,000
Hispanic males: 1,668 per 100,000
Break it down by age and race, and you can see what is going on even clearer:
For Black males ages 25-29: 13,391 per 100,000. (That’s 13.4% of Black men in their late 20s)
Or you can make some international comparisons:
South Africa was internationally condemned for its racial policies under apartheid.
South Africa under apartheid (1993), incarceration rate for Black adult men: 851 per 100,000
U.S. under George Bush (2001), incarceration rate for Black adult men: 4,848 per 100,000
Select U.S. Incarceration rates from Bureau of Justice Statistics, Prison and Jail Inmates at Midyear 2000; Gender and Age &
Race statistics from BJS, Prison and Jail Inmates at Midyear 2001, Tables 1 and 15; Calculation for Black adult men uses data
from Tables 14 and 15 of the spreadsheet version of Prison and Jail Inmates at Midyear 2001 to count only Black men 18 and
older; Statistics as of December 31, 2001 from BJS, Prisoners in 2001. South Africa figures from Sentencing Project, Americans
Behind Bars: The International Use of Incarceration. The incarceration rates by race alone were calculated by Mother Jones for
the Prisons: Debt to Society, Racial Inequality page.
A Joint Project of The Western Prison Project and the Prison Policy Initiative
Western Prison Project POB 40085 Portland, OR 97240
Prison Policy Initiative POB 20038 Cincinnati, OH 45202
- 56 -
Is the above web page alarmist? Is the data presented on the page accurate? Are these different
questions? How were these numbers derived? What about the issue of coincidence versus
correlation (cause and effect)? Where do we start in our quest to place the information presented
above into some reasonable context within our mind?
We will confine our analysis to the International Comparisons made in the last paragraph, simply
because they appear to be the most “alarming” portion of the flyer. The flyer states, South
Africa was internationally condemned for its racial policies under apartheid. South Africa, under
apartheid (1993), had an incarceration rate for Black adult men of 851 per 100,000 and in the
US, under George W. Bush (2001) the incarceration rate for black adult men was 7,226 per
100,000.
We must ask ourselves before we do any serious number crunching, is the adult black man from
South Africa defined to be from the same age group as the African American male from the
United States. In other words, is the publication comparing equivalent demographics? If the
answer is yes, we continue. If not, the comparison is not justified. If we assume the answer is
in the affirmative, how or where do we start to analyze the argument?
We will need to make some assumptions in investigating the given data. As we examine the
statement “7,226 per 100,000” in terms of reasonableness, does this statistic even seem
reasonable? There are several ways to determine the reasonability of this statement. One
obvious route would be to track down the original source and verify the data. An alternative
would be to use our own numerical literacy and evaluate the reasonableness of the data our
selves. Back to our assumptions, is it reasonable to assume there are more male than female
African Americans incarcerated? Let’s say (for argument sake), 80 % of the African Americans
incarcerated are male. Since there are 625,000 African Americans incarcerated, 80 % of those is
0.80 x 625,000 = 500,000 or ½ a million African American males. Now, we test the statement
“7,226 per 100,000” or roughly 7.2 % of the African American males are incarcerated. Applying
some basic algebra, we have the equation, 0.072 x Number of adult African American males =
500,000. If we divide both sides of the equation by 0.072, we will identify the approximate
number of adult African American males in the U.S. After dividing, we have the Number of
adult African American males is approximately 7 million. This means there were nearly 7
million adult African American males in the United States in 2001. Does this seem reasonable?
We have seen earlier there are roughly 34.5 million African Americans in the US, so 7 million
out of 34.5 million is about 0.20 or 20 percent. It does seem reasonable that 20 percent of the
population of African Americans are adult males. If the number seems to be reasonable, why
does the article seem to be inflammatory? What “inferences” are the authors of the article
assuming the reader will make?
The phrase “under George W. Bush” needs to be considered. Is it logical to infer blame for this
alarming statistic and assign it to President George W. Bush? Were the adult African American
males incarcerated when the President took office? President Bush assumed the Office in 2000.
What proportion of the adult African Americans males were incarnated in either 2000 or 2001?
Could there be other explanations or other “directions to place blame” that would be more
rational or logical? If one investigated a possible correlation with welfare reform, possibly a
stronger and more rational argument could be constructed. Where in the summer of 1996,
- 57 -
Congress and the President at that time passed the "Personal Responsibility and Work
Opportunity Reconciliation Act of 1996." This act radically transformed the nation's welfare
system, a full four years before George W. Bush took the Oath of Office. Could this act have
been a major contributing factor for an escalating incarceration rate some five years later? Can
you think of other factors that would explain this statistic other than “under George W. Bush”?
Though these questions we present are rhetorical in nature, they are worth considering in
determining the reasonableness of the data presented in the flyer. Is the statement “under George
W. Bush” an example of nothing more than a coincidence or is he responsible for the
incarceration rate of adult African American males in the U.S. in 2001? A little numerical
literacy applied to data presented by media can go a long way in determining whether an
argument is a representation of a valid correlation or simply an observed coincidence.
Example Three
The Spinner
For the spinner below, if the wand is spun around, what is the probability…?
a)
b)
c)
d)
e)
f)
g)
it lands on a blue portion?
it lands on a 3?
it lands on a blue 3?
it lands on an even number?
it lands on a even number less than 6?
it lands on a number divisible by either 2 or 3?
What is the sample space?
Solution
a)
b)
c)
d)
e)
Of 8, there are 4 blue portions and 4/8 = ½ is the probability.
Of 8, there is 1 blue portion and 1/8 is the probability.
Of 8, there are no blue 3’s, 0/8 = 0 is the probability.
Of 8, there are four even numbered portions, and 4/8 = ½ is the probability.
Of 8, there are two even numbers, 2 and 4 , that are smaller than 6 and 2/8 = ¼ is the
probability.
f) Of 8, there are 5 numbers divisible by either a 2 or a 3. They are 2, 3, 4, 6, 8, thus 5/8 is the
probability.
g) {Blue 1, Blue 2, Blue 6, Blue 8, Red 3, Red 4, Red 5, Red 7}
Let’s go back to rolling a die. What happens if we roll two dice and observe the sum shown?
First, what is the sample space? The sample space is the collection of all possible outcomes.
Well, the smallest sum possible is a one on each die, and a one and a one yields a sum of a two.
The largest possibility is a six on both die, which gives a sum of twelve. All integers between
two and twelve are the possible sums when rolling two die. So, how many possible outcomes are
there? Below is the sample space:
- 58 -
1-1
1-2
1-3
1-4
1-5
1-6
2-1
2-2
2-3
2-4
2-5
2-6
3-1
3-2
3-3
3-4
3-5
3-6
4-1
4-2
4-3
4-4
4-5
4-6
5-1
5-2
5-3
5-4
5-5
5-6
6-1
6-2
6-3
6-4
6-5
6-6
The sample space consists of the 36 different possible combinations from rolling two die. Which
sum is least likely to occur?
Example Four
Lady Luck We roll two die and observe the sum shown. Use the sample space shown above to
find the probability of rolling the sum indicated and state whether the event is more likely to
occur or less likely to occur.
a) five
d) not a double or a six
b) not an eleven
e) a double and an eight
c) not a double
f) not a double and not a four
Solutions
a) There are 4 different ordered pairs from the sample space with a sum of 5 and you see them
along the diagonal of 4 –1, 3 –2. 2 –3, and 1 – 4. With 36 total outcomes possible, the
probability of rolling two die and obtaining a sum of 5 is denoted P(5) = 4/36 = 1/9.
b) This may be easier to answer using the compliment of the event. There are only two ways to
roll a sum of eleven, with a 5 – 6 or a 6 –5. So, P( 11 ) = 1 – 2/36 = 34/36 = 17/18.
c) Again, using the compliment, it is quicker since there are only six doubles, 1-1, 2-2, 3-3, 4-4,
5-5, 6-6. So, P(not a double) = 1 – 6/36 = 30/36 = 15/18.
d) We need to account for all of the non doubles as well as all of the sixes. The desired event
consists of the union of two sets of possibilities. The set of non-doubles and the set of ordered
pairs with a sum of six. The diagonal containing all the ordered pairs with a sum of six has 5
elements. There are 30 non-doubles and the cardinality of the union of the two sets is 30 + 5 – 4
= 31. thus, P(non-double or a six) = 31/36.
e) We need the desired outcome to be a double and an eight. There is only one outcome, 4 – 4,
that is both, a double and a sum of 8. P(double to equal a sum of 8) = 1/36.
f) We need the desired outcomes to be non doubles that are not fours. There are 30 non doubles,
and if we remove the non-doubles with a sum of a 4, we are left with 28 desired outcomes. So,
P(non double that is not a four) = 28/36 = 7/9.
- 59 -
Exercise Set
Before proceeding, recall that a standard
deck of playing cards consists of four each
of the numbers 2 though 10, 4 jacks, 4
queens, 4 kings and 4 aces. The face cards
or picture cards refer to the jacks, queens
and kings. There are 52 cards separated into
4 suits, clubs, diamonds, hearts, and spades
and within each suit are 13 distinct cards,
ace through 10, jack, queen and king. The
diamonds and hearts are traditionally red,
the clubs and spades are traditionally black.
15. What is the probability of tossing 1
head?
16. What is the probability of tossing at
least 1 head?
17. What is the probability of tossing no
heads?
For problems 18 to 22, we toss three coins.
18. What is the sample space?
For problems 1 to 12, assume 1 card is
drawn from a standard deck of 52 cards.
Find the probability of drawing a …
1. two 2. red two
19. What is the probability of tossing 2
heads?
20. What is the probability of tossing 1
head?
3. red two or a diamond
4. red two or not a diamond
5. red two and a diamond
21. What is the probability of tossing at
least 1 head?
22. What is the probability of tossing no
heads?
6. red two and not a diamond
7. a heart
For problems 23 to 29, we roll two die and
observe the sum shown.
8. face card or heart
23. What is the sample space?
9. a heart or a non-jack
24. What is the probability of rolling a sum
of a seven?
10. a heart and a non-jack
11. a red card or a black card
12. a red card and a black card
25. What is the probability of rolling a sum
of not a seven?
26. What is the probability of rolling a
double?
For problems 13 to 17, we toss two coins.
13. What is the sample space?
27. What is the probability of rolling a
double or a six?
14. What is the probability of tossing 2
heads?
28. What is the probability of rolling a
double and a six?
- 60 -
29. What is the probability of rolling a
double and not a six?
For question 30 to 35, we will discuss Gun
Control They say that Texans love their
guns. But, in Arizona, a century after the
wild-wild west days, about 67,000 people
legally carry guns. And in 2004 there are
approximately 5,456,453 people residing in
the state. Hence, the probability of
encountering someone who has a gun in
67, 000
Arizona is
 0.01228 . To
5, 456, 453
1
 0.0122 or 1 in
phrase it another way,
82
82 people in the state of Arizona tote a gun,
compared to Texas, where 1 out of every 92
people legally tote a gun. But, to quote
Gomer Pyle, surprise, surprise, Utah loves
their guns the most because 1 in every 40
people in Utah legally carry a gun.
30. 13,500 Arizonian women have gun
permits. What’s the probability of
encountering a gun toting women if you
meet someone from Arizona? Phrase your
chances of encountering a woman who
carries a gun in ordinary English.
31. 52 Arizonians are women over the age
of 80 who carry a gun. What’s the
probability of encountering a gun toting
women who is 80 years or older if you meet
some from Arizona? Phrase your chances of
encountering an elderly gun carrying woman
in ordinary English.
32. What is the probability that an
Arizonian who legally carries a gun is a
woman?
33. What is the probability that an
Arizonian who legally carries a gun is a
man?
34. In 2002, if 236,738 people in Texas and
57,907 people in Utah own guns, what is the
population of each state?
35. How many more times likely are you to
encounter a some one who has a legally
concealed gun if you run into a native from
Utah compared to a native from Arizona?
36. Earlier in the text we mentioned that in
2000, in this country, there were 6,594
injuries at fixed site amusement parks. We
also mentioned that roughly 317,000,000 or
over 300 million people visit fixed site
amusement parks in a given year. What is
the probability that someone who visited an
amusement park in 2000 would suffer an
injury? Then express the answer in 1 in
“how many” people were injured.
For questions 37 to 41, recall also that in
other recreational activities that same year,
82,722 people were injured on trampolines,
62,812 were injured in swimming pools,
544,561 were injured on bicycles and 20,000
were injured at music concerts We have no
way of knowing how many people bounced
on a trampoline, swam in a swimming pool,
peddled a bicycle or frequented a musical
concert that year. But it is safe to say the
chance of injury for any of those activities
was far higher than an amusement park visit.
To explore this another way, answer the
following question?
37. In 2000, how many people would have
to have bounced on a trampoline if the
probability (risk) of getting injured was
identical to the risk of visiting an
amusement park?
38. In 2000, how many people would have
to have swam in a swimming pool if the
probability of getting injured was identical
to the risk of visiting an amusement park?
- 61 -
39. In 2000, how many people would have
to have peddled a bicycle if the probability
of getting injured was identical to the risk of
visiting an amusement park?
40. In 2000, how many people would have
to have frequented a musical concert if the
probability of getting injured was identical
to the risk of visiting an amusement park?
41. Recalling how many people, 6.4 billion
people, live on this planet, explain the point
of exercises 37-40.
42. Project: Identify information presented
from any media source that has the
appearance of being “alarmist”. Use the
math skills we’ve developed to confirm the
argument or refute the argument.
Probability: Independent and Dependent Events
Now, let’s discuss the probability of two or more events occurring. We need to be careful
because we need to distinguish between whether or not the likelihood of one event influences the
likelihood of the other event.
The probability of a National Football League team winning the Super Bowl and a Democrat
winning the White House is a case where the outcome of event A, a football team winning the
Super Bowl, has no effect on the outcome of event B, which party wins a presidential election.
When considering two events, where outcome of one event has no affect on the outcome of the
other event, we call these independent events. Some other examples of independent events are
choosing a card from a standard deck of playing cards and getting a four, replacing it, shuffling
the deck and choosing a second card and getting an ace. Knowledge that you drew the four the
first time around does not affect the probability you will draw the ace. Whether or not you
pulled the four in the first event is irrelevant, because when you draw the second time, there are
still 4 aces in the 52 card deck. How does the probability of drawing an ace change if you leave
the first 4 drawn out of the deck? Tossing a coin and it landing on tails and rolling a pair of die
and getting a double are independent events, because whether or not your coin landed on heads
or tails, the chance of rolling a double is still 1/6. Another pair of independent events are rolling
a die and getting a 4, and then rolling a second die and getting a 5. In each of the previous
mentioned events, the knowledge of the first event occurring does not affect the likelihood of the
second event occurring.
When the likelihood of one event does affect the likelihood of the other event occurring, we say
the events are dependent on one another. Knowing a person weighs less than 150 pounds
decreases the likelihood the person is over 6 foot tall. Other examples include choosing a card
and getting a four, not replacing it and then choosing a card and getting an ace. Knowledge you
have drawn the four does affect the likelihood you will draw an ace, because this means there are
still 4 aces left in the remaining 51 cards (rather than the independent event with 52 cards).
Let’s see if we can visualize independent events. Toss two die. The result for the first die does
not affect the result for the second die. If first you roll a three, when you toss the second die, this
outcome does not depend on the value of the previous roll. So, if we asked what is the
probability of rolling a sum of a five when tossing two die, the answer is 4/36 or 1/9. This is
because there are only four possible ordered pairs with a sum of 5, a (1,4), (4,1), (2,3) or (3,2).
Independent events.
- 62 -
Let’s toss two coins in a row and ask what is the probability of tossing two heads in a row. Each
event’s outcome has no effect on the next event’s outcome. In other words, whether or not you
tossed a head the first time does not effect the likelihood you’ll toss a head the second time. The
probability of tossing a head is also ½ and the probability of tossing a head a second time is ½ so
the probability of tossing two heads in a row is ½ of a ½. In other words, the probability of
tossing the first head is ½, then to toss it twice in a row, it is ½ of the first ½. This is the
multiplication principle.
The multiplication principle says that to calculate
the probability two independent events will occur in a
row, just multiply the two respective probabilities.
Clearly, this can be extended to a sequence of events. The probability of tossing four heads in a
row is ½ x ½ x ½ x ½ = 1/16.
Example One
Rolling a single die repeatedly
A single die is rolled three times in a row. What is the probability …
a) all three rolls are either a 1 or a 2?
b) all three rolls are above 2?
c) all three rolls are not 2?
d) all are a 1 or all are a 2?
Solution
2 2 2
 
6 6 6
4 4 4
b)
 
6 6 6
5 5 5
c)
 
6 6 6
1 1 1
 
d)
6 6 6
a)
23
63
43
 3
6
53
 3
6
1 1 1 2
    3
6 6 6 6

- 63 -
Exercise Set
Probability of Independent Events
For problems 1 through 4, a card is chosen
at random from a deck of 52 cards. It is then
replaced, the deck shuffled and a second
card is chosen.
1. What is the probability of getting a jack
and an eight?
2. What is the probability of getting a jack
and then an eight?
3. A diamond and then a heart?
9. A nationwide survey found that 35 % of
adults in the United States like ice cream. If
two couples (4 adults) sit down for a meal,
what is the probability that all four like ice
cream?
10. Spin a spinner numbered 1 to 6, and toss
a die. What is the probability of getting a
number less than 5 on the spinner and a
number less than 5 on the die?
11. Spin a spinner numbered 1 to 6, and toss
a die. What is the probability of getting the
same number on the on the spinner that you
got on the die?
4. A non-diamond and then a red ace?
5. A dresser drawer contains one pair of
socks of each of the following colors: black,
brown, white, orange and stripped. Each
pair is folded together in matching sets. You
reach into the sock drawer and choose a pair
of socks without looking. The first pair you
pull out is stripped. Wanting something
different, you replace this pair and reach into
the drawer again. What is the probability
that you will get the stripped pair of socks
twice?
For problem 12 to 14, use a jar contains jelly
beans, and there are 100 pink, 200 purple,
50 white, 70 black, 30 orange and 300
yellow.
12. If two jelly beans are chosen from the
jar, with replacement, what is the probability
that both are green?
If two jelly beans are chosen from the jar,
with replacement, what is the probability
that
13. both are pink?
For problems 6 –8, use a coin that is tossed
and a single 6-sided die that is rolled.
6. Find the probability of getting a tail on
the coin and an even number on the die.
14. If two jelly beans are chosen from the
jar, with replacement, what is the probability
that both are not the same color?
7. Find the probability of getting a tail on
the coin or an even number on the die.
For number 15 to 18, use three cards that are
chosen from a standard deck of 52 playing
cards with replacement.
8. Find the probability of getting a tail on
the coin and an not a six on the die.
15. What is the probability of getting no
jacks or no diamonds?
16. What is the probability of getting 3
diamonds?
- 64 -
17. What is the probability of not getting 3
diamonds?
For problems 25 through 29, two cards are
chosen in succession from a deck of 52
cards.
18. What is the probability of getting aces?
For problems 19. and 20., use a 5-item
false quiz, where a student has
probability of correctly answering
question. Assume that answering
question are independent events.
true4/5
each
each
19. What is the probability that the student
will get all answers correctly?
20. What is the probability that the student
will get at least 4 answers correctly?
For problems 21 and 22, use a bowl of m
and m’s that contains 12 blue, 15 brown, 10
black 16 yellow, 10 red and 8 green m and
m’s. Reaching into the bowl, you grab a
single candy. It is brown. You toss it back.
Yuck. You reach in again. Brown. Again,
you toss it back.
25. What is the probability of getting a jack
and then an eight?
26. What is the probability of getting a jack
and an eight?
27. What is the probability of getting a
diamond and then a heart?
28. What is the probability of getting two
diamonds?
29. What is the probability of getting two
sixes?
For problems 30-35, use the following: a
bowl of m and m’s contains 12 blue, 15
brown, 10 black 16 yellow, 10 red and 8
green m and m’s. Reaching into the bowl,
you grab a handful of five pieces of candy.
21. What is the probability of getting all
brown m and m’s 3 times in a row?
30. What is the probability of getting all
brown m and m’s?
22. What is the probability of getting two
brown m and m’s and then an m and m that
is not brown?
31. What is the probability of getting 4
brown m and m’s?
For problem 23 and 24, use a school survey
that found that 8 out of 10 children like ice
cream. You are throwing a birthday party
for your kid. You are setting the table.
23. What is the probability that the first
three children you put out plates for like ice
cream?
24. What is the probability that the first
three children you put out plates for don’t
like ice cream?
32. What is the probability of getting 3
brown m and m’s?
33. What is the probability of getting 2
brown m and m’s?
34. What is the probability of getting 1
brown m and m’s?
35. What is the probability of getting no
brown m and m’s?
- 65 -
Probability Using Counting Theory
Many times to find either the number of desired outcomes or just to find the total number of
possible outcomes, the cardinality of the sample space requires counting techniques introduced
in Chapter One. Techniques like the Fundamental Counting Principle, Permutations or
Combinations become useful to us once again.
Example One
Suppose 12 men and 18 women apply for jobs. If four people are selected randomly to fill job
positions, a) find the probability only one man received the job b) find the probability everyone
who received the job was a woman.
Solution
Well, probability is defined as the number of desired outcomes divided by the number of
possible outcomes.
a) The number of outcomes where 1 man and 3 woman have been given jobs is characterized as
a combination problem because the order does not matter. Each person is given an equivalent
job. So, from 12 men, we choose 1, from 18 women, we choose 3. We want to identify all
possible combinations where 1 male and 3 females that can be formed into a selected group, so
we multiply these numbers together, 12 C1 18 C3 . Now, we have the number of possible 1 man
and 3 woman committees. The number of possible outcomes is found by finding the number of
ways four people can be chosen from a group of 30 people, choose 4 to get the job, or 30 C 4 . So,
C1 18 C3
0.357
C
30 4
b) The number of successful outcomes, 4 woman have be given the jobs is again a combination
problem. From 18 women, we choose 4 and from the 12 men, we choose 0. The number of
successful outcomes is again 30 people, choose 4 or 30 C 4 . So, the probability all the jobs go to
C  C
women is 12 0 18 4 0.112
30 C 4
the probability 1 man gets the job is
12
Example Two
Cards What is the probability that someone dealt a five card poker hand will get dealt a hand
with…
a) four aces
b) two aces and two sixes
c) not a straight flush
d) a pair
- 66 -
Solution
a) For the 4 aces, we choose all 4 of these aces, 4 choose 4, or 4 C4 and from the 48 other nonaces, we choose 1 of them 48 C1 . And we recall, there are 52 C5 possible 5 card poker hands. So,
C4 48 C1
 0.000018 .
52 C5
b) For the 2 aces, we choose 2 of the 4 or 4 C2 and for the 2 sixes, we again choose 2 of the 4
sixes or 4 C2 . Our fifth card could be any of the cards except for the 4 aces and the 4 sixes. So,
we have 44 C1 . And we recall, there are 52 C5 possible 5 card poker hands. So, we have
6(6)(44)
4 C 2  4 C 2  44 C1

 0.00061 .
52 C5
52 C5
c) For the non-straight flush, we have 2,598,960 possible 5 card hands – 4(9) possible straight
2,598,924
flushes. So, we
 0.997 .
2,598,960
d) For the pair, we need two of a kind and three other unique cards. For example, let’s consider
a pair of jacks. For two of a kind, we select two of the four jacks, 4 C2 . There are 13 different
cards that we could have as a pair, the aces, two’s, three’s and so on until we get to the kings.
So, now we have 13 4 C2 . For the other three cards, they must be different types, say a three,
five and six. For the three, we select 1 of the four three’s, 4 C1 , for the five, we select 1 of the
four five’s, 4 C1 , and for the six, e select 1 of the four six 4 C1 . Now, we must ensure the number
of ways to choose 3 different types of cards from the 12 remaining types that were not pairs.
13 4 C 2  12 C3  4 C1 4 C1 4 C1
This is 12 choose 3. So, we have
 0.423 .
52 C5
we have
4
Example Three
Birthday Problem You are having a party, some 20 guests are gathered, but there is tension in
the air. No one is mingling. No one is reaching out to get to know another. You need an ice
breaker. So, you say, I bet two of you have the same birthday? You go around the room, giving
each person the easiest topic of conversation of all, each person states their own birthday.
Maybe someone has a funny retort. Maybe someone has a funny story. Or maybe, just maybe,
two people have the same birthday. Something in common. And conversations begin.
Regardless of the year, what is the probability two people have the same birthday? How many
people would you have to gather to have more of a chance two people have the same birthday
than not? What is the probability that if 30 people attended the party, two would have the same
birthday? Or 40 party goers? How about 367 guests?
Solution
A general overview of the mathematical argument we are about to make is this: We will find the
probabilities associated with people who do not have the same birthdays. We will then find the
- 67 -
probability of the compliment set, the probability of at least two people having the same
birthday.
Our argument to find the solution to this question starts with assuming all birthdays are equally
likely to occur on any of the 365 days of the year and we will forgo the argument with respect to
leap year.
Starting with person 1, we record the birthday; one of 365 possible days in the year. Person 2
can have a birthday on any of the remaining 364 days that wasn’t the date of Person 1’s birthday.
Person 3 can have a birthday on any of the remaining 363 days that weren’t the birth dates of the
first two people. And so on.
The probability of person 2 having a birthday different than person 1 is found by realizing that
any of the other 364 days could be the birthday, just not the one day that belongs to Person 1. A
similar sentence could be echoed for Person 3, their birthday needs to be any of the remaining
363 days of the year that are different from the two already taken, so to speak.
Applying the multiplication principle, the probability of the first three people having different
365 364 363 365 P3
birthdays is



365 365 365 3653
Extending this pattern, the probability all 20 people have different birthdays is
365 364 363 346 365 P20


... 

365 365 365 365 36520
The complimentary probability for this statement is what we are seeking.
P20
365 364 363 346
1


... 
1 365 20
 0.41 . There is a 41 percent chance two people among a
365 365 365 365
365
group of twenty people will have the same birthday.
If you have 23 people at the party, 1 
P23
 0.507 or slightly over 50 percent chance two will
36523
365
have the same birthday.
P
 0.706 or 70 percent chance that two
365
individuals would have the same birthday. If you do not believe us, try it the next time such a
crowd is gathered. In case your curious, if you gathered 40 people, it would be highly likely that
two people had the same birthday. Hard to believe? Like we said, try it…
If 30 people had gathered at the party, there is 1 
365 30
30
- 68 -
Exercise Set
Probability Using Counting Theory
For problems 1 and 2, use a bowl of m and
m’s containing 12 blue, 15 brown, 10 orange
16 yellow, 10 red and 8 green m and m’s.
Reaching into the bowl, you grab five pieces
of candy.
1. What is the probability of getting 2
brown pieces and 3 red pieces?
2. What is the probability of getting only 1
brown piece?
For problems 3 to 12, find the probability of
obtaining each of the following 5 card poker
hands:
3. Royal Flush.
4. Straight Flush (5 cards in sequence in a
single suit, but not a royal flush)
5. Four Aces
6. Four of a Kind
7. Flush of Hearts
8. Flush
9. Four hearts
10. Three of a kind
11. Two Sixes and Three Aces
12. Two Aces and Two Kings
13. A pair
For questions 14 to 16, suppose license plate
"numbers" consist of all possible
arrangements of 3 letters followed by 2
digits.
14. What is the probability that a randomly
chosen license plate number has no repeated
symbol?
15. What is the probability that a randomly
chosen license plate number has the last
digit a one?
16. What is the probability that a randomly
chosen license plate number has no repeated
symbol and the last digit is a one?
For problems 17 and 18, in a shipment of
1,000 Play Stations, 46 are defective.
17. If a person buys two Play Stations, what
is the probability that both are defective?
18. If a person buys two Play Stations, what
is the probability that neither is defective?
For questions 19. through 21., suppose 4
men and 6 women are on a crowded
elevator. At the next floor, three people get
off the elevator.
19. Find the probability all are women.
20. Find the probability two are women
21. Find the probability none are women
22. What is the probability that in a group of
3 people at least two were born in the same
month?
23. What is the probability that in a group of
5 people at least two were born in the same
month?
24. What is the probability that in a group of
7 people at least two were born in the same
month?
25. What is the probability that in a group of
9 people at least two were born in the same
month?
26. What is the probability that in a group of
11 people at least two were born in the same
month?
27. What is the probability that in a group
of 20 people at least two were born in the
same month?
28. What is the probability that in a group
of 5 people at least two had the same
birthday?
29. What is the probability that in a group
of 10 people at least two had the same
birthday?
30. What is the probability that in a group
of 20 people at least two had the same
birthday?
- 69 -
31. What is the probability that in a group
of 30 people at least two had the same
birthday?
32. What is the probability that in a group
of 60 people at least two had the same
birthday?
33. What is the probability that in a group of
500 people at least two had the same
birthday?
For problems 34 through 38, two cards are
chosen in succession from a deck of 52
cards.
34. What is the probability of getting a jack
and then an eight?
35. What is the probability of getting a jack
and an eight?
36. What is the probability of getting a
diamond and then a heart?
37. What is the probability of getting two
diamonds?
38. What is the probability of getting two
sixes?
For problems 39-46, use the following: a
bowl of m and m’s contains 12 blue, 15
brown, 10 black 16 yellow, 10 red and 8
green m and m’s. Reaching into the bowl,
you grab a handful of five pieces of candy.
39. What is the probability of getting all
brown m and m’s?
40. What is the probability of getting 4
brown m and m’s?
41. What is the probability of getting 3
brown m and m’s?
42. What is the probability of getting 2
brown m and m’s?
43. What is the probability of getting 1
brown m and m’s?
44. What is the probability of getting no
brown m and m’s?
The Binomial Probability Profile
Often referred to as Bernoulli Trials, after John Bernoulli who first published in this area, the
probability studied in this section is the practical probability when one asks what’s the likelihood
you go to a party and meet five perspective mates, all of whom never married? Or meet five
perspective mates, all of whom are living alone? If you listen softly, this is the same question as
how many times can you roll snake eyes (a one on each die) five times in a row, or flip a coin
and have it land on heads five times in a row.
Let’s pick the first scenario, choosing a mate who has never been married and highlight this
example for you the Binomial Probability model. If you meet 5 perspective mates, this can be
considered the same experiment, meeting a mate, repeated over and over again, specifically here
five times. There are only two possible outcomes, the perspective mate has either never married,
or not. The trials are all independent because if you were to meet someone who was never
married, this does not affect the likelihood (probability) the next person you were to meet has
never been married. Finally, the outcomes for each experiment does not change, the probability
of someone in a certain age range who was never married is published by the US Census Bureau
for the year 2000.
- 42 -
These similarities, you may recognize, exist for our other scenarios we posed too. Those all
involve the same experiment, the quest for a perspective mate who lives alone, or the snake eyes
reoccurring as dice are tossed, the heads reappearing as a coin is flipped, repeated over and over
again, and again with five repetitions being specified for each event. In each question, there are
only two possible outcomes, like the perspective mate lives alone or they don’t live alone, the
snake eyes reoccur or doesn’t, the heads reappear or they don’t. The trails are all independent,
meeting a perspective mate who lives alone does not affect the probability the next person lives
alone. Rolling snake eyes does not enhance or deter the possibility snake eyes will show on the
next roll and clearly a flip of a coin showing a tail falls under the same guise. Finally, the
outcomes for each experiment does not change, the probabilities of each occurring remains the
same, each perspective mate has the same likelihood of living alone, according to the US Census
Bureau, that is, each roll of the dice has a probability of 1/36 that snake eyes will show, each flip
of the coin has a probability of ½.
Probabilistic events such as these are called Bernoulli Trials if the following are satisfied:
 the same experiment is repeated over and over again
 there are only two possible outcomes
 the trails are independent
 the probability for each outcome must remain the same.
Example One
According to the US Census Bureau, 2000, “the times they are a-changin’” as Bob Dylan used to
say. In 1970, the probability you encountered a 20 to 24 year old woman who never married was
0.36, but in the year 2000, the probability more than doubled to be 0.73. For a 20 to 24 year old
man, the probability also grew, from 1970, 0.55 to 2000, 0.84. For 34 year olds, the drastic
increase was even more acute. In 1970, the probability you encountered a 30 to 34 year old
woman who never married was 0.06, in the year 2000, the probability almost quadrupled to 0.22.
For 30 to 34 year old men, the results were nearly as striking because the probability you met a
man who never married in this age group more than tripled, from 0.09 in 1970 to 0.30 in the year
2000.
Assuming the probabilities have not changed much since the year 2000, let’s explore these
statistics. You’re at a social gathering and you encounter 5 women between 20 and 24 years old.
What is the probability …
a) three were never married
b) none of the 5 have ever been married
c) at least one has been married
d) at most 1 has been married
Let’s also ask the same question with regard to 20 to 24 year old men. What is the probability …
e) three were never married
f) none of the 5 have ever been married
g) at least one has been married
h) at most 1 has been married
- 79 -
Solution
a) Let’s clarify our problem. Slowly. We meet five women (n = 5) and if three of the women
were never married (k = 3), where the probability of encountering a 20-24 year old woman who
never married is 0.73 for each meeting (p = 0.73). We can now proceed. The long way first.
We want to explore the possible arrangements where three women were never married and two
were married . Assuming the first three women met were never married, the probability would
be (0.73)(0.73)(0.73). The probability a woman was married would then be 1 – 0.73 = 0.27. The
probability the remaining two women were married would be 0.27(0.27). By virtue of the
multiplication principle, we see the probability of meeting five women, the first three unmarried,
the last two married is calculated by (0.73)(0.73)(0.73)(0.27)(0.27) or more precisely
(0.733)(0.272). There are 10 different arrangements or orders in which someone encounters five
women, three unmarried, two married. To see this, use M for married and N for never married.
The ten various arrangements when meeting three women who never married and two who were
married. Or more concisely, NNNMM. The other nine ways would be NNMMN, NMMNN,
NMNMN, NNMMN, MNNMN, MNNNM, MMNNN, NNMNM, and NMMNN. What is a
shorter way of counting all of these possible arrangements? C(5,2), which we will write again
5
5(4)
3
more concisely as   
10. So, we multiply 10 with  0.73 (0.27) 2 to arrive at the
2
 2
answer, 0.28. This means that for every 5 women you meet in the age group 20 – 24, more than
one in four times you meet 5 women, 3 will have never been married.
Now, in terms of the Binomial Probability Profile, this pattern includes a repetitive train of
thought, that is, an experiment that is repeated over and over again. As politically incorrect as
this sounds, the trial is meeting women and a success is finding a woman who was never
married. We first consider the number of arrangements, which we can always find with a
combination. We multiply this combination with the probability of the success each time the
success occurs. This result is multiplied by the compliment of the probability’s success the
number of times it occurs. Always. Never to deviate. So, our formula is written:
n
k
b(n, k , p)     p  (1  p) n k
k
where n is the number of trials, k is the number of successes and p is the probability for the
success.
 5
3
a) b(5,3, 0.73)     0.73 (0.27)53  0.284
 3
b) If none of the women had ever married, then we want 5 successes out of 5 trials.
 5
5
b(5,5,0.73)   0.73 (0.27) 0  0.207
 5
c) If at least one women was once married, this means we have several cases to consider: one,
two, three, four or five married. If we speak in terms of a success, then we have several cases
here as well: If five were once married, then none were never married, if four were once
married, then one was never married, and continuing in this vain, two, three or four could also be
never married. So, we have 0, 1, 2, 3 or 4 successes to make the statement at least one woman
was once married true. Let’s use what we know about finding probabilities and use the
- 80 -
probability of the compliment of a set: 1 – the opposite probability, 1 – P(5 were never married.)
 5
5
1  b(5,5, 0.73)  1     0.73 (0.27)0  1  0.207  0.793 . Said in another way, if you were to
 5
meet 5 women 10 times, on the average, nearly eight of those times at least 1 of the five were
once married.
d) If at most 1 was once married, then either one or none were once married. Thus, 5 would
have been never married or 1 was once married, which implicitly states 4 were never married.
We have
 5
5
5
4
b(5,5, 0.73)  b(5, 4, 0.73)     0.73 (0.27)0     0.73 (0.27)1  0.207  0.383  0.59 The
 5
 4
next time you meet 5 women between the ages of 20-24, over half the time at most 1 was once
married.
 5
3
e) b(5,3, 0.84)     0.84  (0.16)53  0.152
 3
f) If none of the men were never married, we want 5 successes out of 5 trials.
 5
5
b(5,5, 0.84)     0.84  (0.16)0  0.418
 5
 5
5
g) 1  b(5,5, 0.84)  1     0.84  (0.16)0  0.582 . Over half the time you meet five men, at
 5
least 1 of them was once married.
 5
5
5
4
h) b(5,5, 0.84)  b(5, 4, 0.84)     0.84  (0.16)0     0.84  (0.16)1  0.816 . So, 8 out of
 5
 4
every 10 times you meet 5 men, at most 1 of them was once married.
Example Two
You roll a pair of dice, what is the probability you get snake eyes three times out of 5 rolls, and
oury next question is what is the probability of getting snake eyes five times in a row? What
does your intuition tell you? Pretty low likelihood, small probability; you are right.
Solution
a) The probability of getting snake eyes is 1/36, so we have 5 trials, 3 success, each with a
3
 5
   1
(35 ) 2  0.000203
probability of 1/36. So, we have b 5,3, 1
36
36
36
 3
b) The probability of getting snake eyes is 1/36, so we have 5 trials, 5 success, each with a
5
 5
   1
(35 )0  1.654 x108 . This tells us
probability of 1/36. So, we have b 5,5, 1
36
36
36
 5
0.00000001654 is the probability or the likelihood of tossing 5 pairs of snake eyes in a row is
less than 1 in 200 million.


 


 
Example Three
- 81 -
A fair coin is tossed 8 times. What is the probability of getting at most 7 heads?
Solution
Each reoccurring probability is simple enough, the probability of getting a head when you flip a
coin is ½. But, to calculate the answer directly, we would have to considered all of the options
possible that could yield at least 7 heads. So, we would add, loosely phrased,
P(0 H) + P(1 H) + P(2 H) + P(3 H) + P(4 H) + P(5 H) + P(6 H) + P(7 H). This calculation is
lengthy. Instead, we will use the strategy employing the probability of the compliment. This
would mean 1 - the opposite of (P(0 H) + P(1 H) + P(2 H) + P(3 H) + P(4 H) + P(5 H) + P(6 H)
8
 8
+ P(7 H)). This would be 1 – P(8 H) = 1b 8,8, 1  1    1 ( 1 )0  0.996
2
2
 8 2


 
Exercise Set
For all 29 questions from this section: Write
the probability in the form b(n,k,p) and then
find the probability to six decimal places.
8. no one lives alone.
9. there is at least one household with more
than one person.
For questions 1 to 9:The following data extracted from
For problems 10-13, The following data extracted from
the US Census Bureau Occupation 2000, Issues august, 2003 .
Bases on the most detailed level of
occupations available in Census 2000 from
509 occupations. Below is a table with the
Ten Occupations Employing the Most
Women for the United States, 2000. There
were 60,630,069 women 16 years and older
employed in the civilian capacity in 2000 in
the United States.
the US Census Bureau Population Profile of the United States:
2000, Chapter 5.
In 1970, the probability a
household consisted of someone living alone
was 0.17, while by the year 2000, the
probability had grown to 0.26. Suppose you
live a cul-de-sac where there are 8
households and suppose the probability of
some living alone has not changed very
much since the year 2000. Find the
probability using the data from the year
2000 where:
1. there are two house holds where someone
lives alone.
2. there are three house holds where
someone lives alone.
3. there are four house holds where
someone lives alone.
4. there are five house holds where
someone lives alone.
5. there is someone living alone on your
cul-de-sac.
6. there is one household where someone
lives alone.
7. there are no families on your cul-de-sac,
everyone lives alone.
Occupations
Secretaries, Administrative Assistants
Elementary, Middle School Teachers
Registered Nurses
Cashiers
Retail Salespersons
Bookkeeping, Accounting, Auditing
Nursing, Psychiatric, Home Health
Aides
Customer Service Representatives
Child Care Workers
Waiter, Waitress
Number
3,597,535
2,442,104
2,065,238
2,030,805
1,775,889
1,526,803
1,469,736
1,396,105
1,253,306
1,228,977
You meet six employed women (in the
civilian capacity) at a party.
- 82 -
10. What is the probability half of the
women are elementary or middle school
teachers?
11. What is the probability none of the six
are in any of the top ten occupations listed
above?
12. What is the probability at most two of
the six are secretaries or administrative
assistants?
13. What is the probability at least one of
them are either child care workers or
nursing, psychiatric or Home Health Aide
Specialists?
For problems 14-17, use a fair coin that has
been tossed six times.
14 What is the probability of tossing a tail
once?
15 What is the probability of tossing at
most 3 tails?
16 What is the probability of tossing at least
5 tails?
17. What is the probability of tossing at
most 5 tails?
For problem 18-20, we roll 2 dice seven
times and observe the sum.
18. What is the probability of rolling the
sum of a four exactly 5 times?
19. What is the probability of rolling the
sum of a five or seven exactly 5 times?
20. What is the probability of rolling the
sum of a four at least once?
Problems 21-26. The professional. A true
professional basketball player has nerves of
icy steel. If they miss their foul shot, they
are not affected the next time they shoot.
The likelihood of hitting or missing the foul
shot is not going to be affected by their prior
stint at the line. Independent probabilities.
If they miss three times in a row, they are
not affected the next time they shoot. Fives
misses in a row, still, they are not affected
the next time they shoot. The same is true
for a professional baseball player. If they do
not get a hit, they are not affected the next
time they are at bat, ten hitless at bats, no
problem. There is no such thing as a slump
in professional sports. In other words, the
probability of success remains the same, and
they are independent.
A basketball player has a shooting
percentage of 0.850 at the free throw line.
21. What is the probability he will hit 11
out of 12 free throws?
22. What is the probability he will hit 10 or
more out of 12 free throws?
23. What is the probability he will miss 12
free throws in a row?
Consider the baseball player who has a
batting average of 0.300.
24. What is the probability he will go hitless
in a game, say 0 for 4?
25. What is the probability he will get a hit
in all 4 at bats?
26. What is the probability he will go 2 for
4?
Problems 27-29. Woe is me. You order
blank Cd’s, 10 of them. You research and
learn that they probability of getting a
defective Cd is 0.06.
27. You think woe is me, just my luck and
you wonder what the probability is of
getting at least 1 defective Cd. Well, what is
it?
28. What is the probability of getting half
defective?
29. What is the probability of getting the
majority of Cd’s defective?
- 84 -
Conditional Probability
Let’s compare two questions:
 What is the probability that the sum of two dice tossed is smaller than a 4?
 If we know one of the die shows a one, what is the probability the sum of two
dice tossed is smaller than 4?
The answer to the first question is 3/36 or 1/12 because the possible successful outcomes
are (1,1), (1,2) and (2,1) from the 36 events that make up the sample space. But, if we
are privy to inside information like we already know that one die shows a 1, we call this a
conditional probability because we are given a condition for the probability of an event
to occur. The probability the sum on two dice tossed is smaller than a 4 is increased to
2/6 = 1/3 because we are given that a one occurred on the first roll and thus we are
reduced to six equally likely possibilities, (1,1), (1,2), (1,3), (1,4), (1,5) and (1,6). Two of
these six, (1,1) and (1,2) have a sum less than a four.
 We write “P(sum less than 4 a one was rolled first)”
 We read “the probability of a sum less than 4 given that a one was rolled first.”
In ordinary English, conditional probability asks what is the probability that one event
will occur given that another event has already occurred.
Which is more likely, the probability that some one who speaks Spanish is from Mexico or that
someone from Mexico speaks Spanish? In the first case, we are saying given the person speaks
Spanish, what’s the probability they are from Mexico. Written with probability notation, we
have P(a citizen of Mexico| person speaks Spanish). In other words, we are restricting our
sample space to be those who speak Spanish, and we are asking of these people, what is the
probability they are from Mexico. In the second case, we have a sample population of citizens
from Mexico, and we want to know, of those people, what is the probability a random person
speaks Spanish and we denote it P(person speaks Spanish| a citizen of Mexico).
Rephrased, we are asking you to decide if it is more likely a person from Mexico speaks Spanish
or is it more likely if a person who speaks Spanish, then they are from Mexico. It is more likely
that a person who is from Mexico speaks Spanish. Employing conditional probability notation,
we would write
 P(a citizen of Mexico| person speaks Spanish) < P(person speaks Spanish| a citizen of
Mexico).
Remember, probabilities are just numbers with feelings and emotions like any number.
If you are from Mexico, then you speak Spanish or if you speak Spanish, then you are from
Mexico are logically speaking, neither always true or always false. Conditional probabilities
help us determine which is more likely.
- 70 -
Example One
Test Results A teacher gave her class two tests. 60 % of the class passed both tests, and 80 % of
the class passed the first test. What percent of those that passed the first test also passed the
second test?
Solution:
P(second | first ) 
P(first and second ) 0.60

 0.75
P(first )
0.80
Example Two
US Army Demographics
The following information was taken from the ARMY DEMOGRAPHICS, FY03, Army Profile, US Army, Headquarters, Dr. Betty D. Maxfield,
Office of Army Demogrpahics.: [email protected]
Race by Gender - Enlisted
Male
Female
White
208,698 23,667
Black
81,024 27,317
Hispanic 37,882 7,176
Asian
11,925 2391
Other
11,225 2392
Total
350,754 62,943
Total
232,365
108,341
45,058
14,316
13,617
413,697
Find the following probabilities and explain the significance in a complete sentence.
a) P(male)
b) P(black female)
c) P(female| black)
d) P(black| female)
e) P(Asian| male)
f) P(male| white or black)
g) P(white or black| male)
Solution
350,754
 0.848 or nearly 85 out of 100 of the enlisted in the Army are men.
413,697
27,317
b) P(black female) =
 0.066 or nearly 6 2/3 out of 100 of the enlisted in the Army are
413,697
black females.
a) P(male) =
c) Note: Because it is a conditional probability, we must restrict our sample space to only those
27,317
enlisted personal who are black. P(female| black) =
 0.252 or nearly 25 out of 100 of
108,341
the Army’s enlisted personnel that are black are women.
- 71 -
27,317
 0.434 or nearly 43 out of 100 of the Army’s enlisted personnel
62,943
that are women are black.
d) P(black| female) =
11,925
 0.034 or nearly 3 ½ out of 100 of the Army’s enlisted personnel
350,754
that are men are Asian.
e) P(Asian| male) =
208,698  81,024
 0.851 or men comprise nearly 85 out of 100 of
232,365  108,341
the Army’s enlisted personnel that are either white or black.
f) P(male| white or black) =
208,698  81024
 0.826 or nearly 83 out of 100 of the Army’s
350,754
enlisted men are either white or black.
g) P(white or black| male) =
Exercise Set
Conditional Probability
For problems 1 through 3, an opinion poll
was administered to the students at the
University of America. Use the table below
because it reflects the response to the
following question, "Are you for or against
the death penalty?"
For
Against Not Sure Total
Men
20,000 10,000
5,000
35,000
Women
10,000 5,000
5,000
20,000
Total
30,000 15,000 10,000
55,000
A student is selected at random, find the
probability the student was ...
1. male
2. male, given that the student was against
the death penalty
3. for the death penalty given that
the student was a woman
For problems 4 to 10, use the
following data. A fox News poll in
2003 indicated 69 % of Americans
favor the death penalty. This is 7
percentage points down from 6 years
earlier. Moreover, according to a
new Gallup Poll, there is an even
closer split over the issue of what
should be done with a convicted
murderer. When asked if you were
in favor of the death penalty, a
whopping 74 % of answering the
poll responded ‘yes’, but the support
dropped to 53 % when people were
offered the alternative “LWOP –
Life Without the Possibility of
Parole.” Below is a mock table
reflecting what the demographics
would look like for such a poll,
separated by education. HS stands
for High School Education or Less,
C stands for College Graduate – 2 or
4 year degrees, and P stands for postbachelor’s, that is, Master’s Degree
or above. Similarly, DP stands for
Death Penalty, LWOP stands for
Life Without the Possibility of
- 72 -
Parole and No stands for No
Opinion.
HS
C
P
DP
1251
799
176
LWP
758
780
310
No
91
21
14
A person was selected at random.
Find the probability the person …
4. was for the death penalty.
5. was against the death penalty.
6. had a 2 or 4 year education, given
that the person responded LWP.
7. responded LWP given the person
had a Master’s degree or higher.
8. was for the death penalty, given
that the person had a high school
education or less.
9. was for the death penalty, given
the person has a 2 or 4 year degree.
10. was for the death penalty, given
the person had a Master’s degree or
higher.
For question 11 to 15, use the following
data: In early 2004, many polls, such as
those published by the PEW Research
Center on March 16th of that year, focused
their questions on nation of Iraq. Below is
representative mock table reflecting the
relative demographics of some of these
polls. The question asked was “After
Saddam Hussein, the Iraqi People will be” a)
Worse Off b) Better Off c) Don’t know.
The results are the number of respondents to
the question, by nationality.
Worse Better
Know Total
Off
Off
US
9
84
3
96
Britain
11
82
2
95
Germany
27
65
8
100
Jordan
70
25
3
98
Pakistan
61
8
10
79
Total
178
264
26
468
A person was selected at random.
Find the probability the person …
11. answered “Worse Off”
12. answered “Worse Off”, given
that they were from Jordan
13. was from Jordan, given that they
answered “Worse Off”
14. answered “Worse Off”, given
they were from the US
15. was from the US, given that they
answered “Worse Off”
16. A jar contains pink and white
jelly beans. Two jelly beans are
chosen without replacement. If the
probability of selecting a pink jelly
bean and a white marble is 0.50, and
the probability of selecting a pink
jelly on the first draw is 0.60, find
the probability of selecting a white
marble on the second draw, given
that the first marble selected was
pink.
17. At an Italian restaurant, 60 % of the
customers order a wine. If 50 % of the
customers order a pasta and wine, what is
the probability that a customer who orders a
wine will also order pasta?
18. The probability that a student is absent
and that it is Friday is 0.05. Since there are 5
school days in a week, the probability that it
is Friday is 0.2. What is the probability that
a student is absent given that today is
Friday?
For problems 19 to 23, which is greater?
19. P(he or she speaks English| he or she is
American) or P(he or she is American| he or
she speaks English)?
20. P(he or she is wealthy| he or she is a
orthodontist) or P(he or she is a orthodontist|
he or she is wealthy)?
21. P(it is raining| humidity is high) or
P(humidity is high| it is raining)?
22. P(person is in the NBA | person is
considered tall) or P(person is considered
tall | person is in the NBA)?
- 73 -
23. P(vegetable is green | vegetable is
broccoli) or P(vegetable is a broccoli |
vegetable is green)?
Miscellaneous Problems Using
Probability
1. If a number is chosen at random from the
following set,{2,3,4,5,6,7,8,9,10} what is
the probability that it is not prime?
2. A number from 1 to 5 is chosen at
random. What is the probability that the
number chosen is not odd?
For questions 3 through 6, the experiment is
rolling a single die and tossing 2 coins.
3. Describe the sample space.
4. Find the probability of rolling a three on
the die.
5. Find the probability of tossing at least
one head.
6. Find the probability of rolling a three on
a die or tossing at least one head.
For questions 7 to 8, according to a recent
elementary school survey, a child waiting in
line in a school cafeteria has a probability of
1/7 of not selecting a drink during lunch.
Assuming independence by making believe
there is no such thing as peer pressure, find
the probability ...
7. three children in a row won't select a
drink.
8. the first child will select a drink, the next
two children to follow won't.
For problems 9 and 10, two cards are drawn
from a deck. The first card is replaced back
into the deck after it is drawn. Find the
probability ...
9. of selecting a two on the second draw if a
two is drawn on the first draw.
10. of selecting a two on the first draw and
a two on the second draw.
For problems 11 to 15, a pair of dice are
rolled, find the probability …
11. of rolling a sum of a 2
12. of rolling a sum of a 5
13. of rolling a sum of not a double
14. of rolling a sum of a 6 given that a 2
appeared on the first die.
15. of rolling sum less than 10 if we know a
5 or 6 was not rolled on the first die.
Bayes Theorem
Certainly, we all have some concern about the accuracy of those polls we see every day
presented by the media. We routinely ask ourselves, are the polls biased or are they merely just
trying to locate the collective pulse of their audience? We often assume there is some hidden
agenda, be it political or social, behind the wording of the questions. We know the words chosen
when creating the polls can be, and often are chosen to solicit specific answers. But, maybe
instead, we should ask ourselves, what information is provided for us and what information is
deleted from us? And we do have the capacity to find the deleted information. The following
two stories are taken straight from viewing television one evening.
“According to a recent Gallop Poll, when asked which side of the political debate on abortion do
you sympathize with more: the right-to-life movement that believes abortion is the taking of
human life and should be outlawed, OR, the pro-choice movement that believes a woman has the
right to choose what happens to her body, including the right to decide to have an abortion, 43 %
of those polled answered Right-to-life.” Interestingly, half of the republicans polled sided with
the Right-Life argument. Does this mean we cannot draw along party lines on this issue and that
- 74 -
the abortion debate transcends party lines, polarizing us in directions we may not want to admit,
as the media would like us to believe?
“Here in Winstonboro County, 15% of the population developed lung cancer. And while 40% of
the county’s residents are smokers, 80% of those developing lung cancer are smokers.” While
this is informative, doesn’t it seem natural that you would want to know what the probability of
the exact opposite? Wouldn’t you want to know what the probability was for someone who
smokes to develop lung cancer? If you smoke, you may want to know the risks. Said another
way, you do not know right off if you have lung cancer or not, but you certainly know whether
or not you smoke. What are your chances, as a smoker, of developing lung? Isn’t this a more
useful question. It seems the report is dancing around this core point. Why not tell us directly?
Next, we will develop the skill to answer the question directly.
Let’s explore these two questions. In the first scenario, the Gallop Poll, 43% answered Right-tolife, and half of the republicans answered Right-to-life. When trying to connect party lines to the
abortion issue, the more relevant question is “if someone is opposed to
abortions, what is the likelihood they are republican?” Or more precisely, in mathematical
terms, of those respondents polled who aligned themselves with the Right to Life movement,
what is the probability they are republican? More abstractly, does the issue of abortion divide us
along party lines or does it transcend party lines as the media implies?
Let us begin by obsessing over the poll itself by defining the type of poll we have here. One
aspect of the poll is to place someone in a definitive position, either your are “for” or you are
“against” the Right to Life movement. How was this done? The poll was worded so that the
respondents were forced to declare themselves either Pro-choice or Right-to-life. Not both. Poll
participants were then asked to check off the party that most closely reflects their views, thus
they had to declare themselves to be either democrat, republican or independent. No one was left
sitting on the proverbial fence.
Now, let’s slow these sentences down. Inherent in the poll itself are three characteristics. Those
answering our poll comprise the sample space. Each individual from the sample space declared
themselves pro Right-to-life or pro-choice, and then further declared themselves democrat,
republican or independent, there was no overlap between these sets. No one was both Pro-life
and Pro-choice. No one was a member of two political parties. Two sets are said to be mutually
exclusive where there is no overlap between the two sets. Next, the poll consisted of everyone
who participated. You’re probably reading this and saying ‘no kidding’. But this means
something to us, that the union of the sets comprise the whole sample space. In other words, all
of the Pro-choice and all of the Right-to-life respondents comprise everyone. Similarly, all of
the respondents from the three political parties comprise all of the respondents too. Finally,
every set has at least one member. In other words, if no one was an Independent, we would not
include it in our sample space. When sample spaces are separated into non-empty, mutually
exclusive sets, we say the space has been partitioned.
A partition is where the sample space it divided into sets where
1. the sets are mutually exclusive
2. the union of all sets is the whole sample space
- 74 -
3. there is no empty set
 Bayes Rule can be applied to partitioned sample spaces.
Now, let’s examine all of the data so that we may answer our real question, which is ‘does
abortion divide us along party lines?’ According to the poll, 43 % of the respondents declared
themselves to be Right-to-life. So, everyone else declared themselves to be Pro-choice. That is
57 %, right?
Now, we will dig deeper into the data taken from the poll. According to the poll, of those who
declared themselves to be Right-to-life, 40 % percent were democrats, 55 % were republicans
and 5 % were independent. Of the respondents who declared themselves to be Pro-choice, half
were democrats, 45 % were republicans and again, 5 % were independent. Lot’s of data crying
out for some order. Let’s use a tree diagram to re-examine the data.
Now, let’s selectively read the tree diagram. It seems that 40% of the Right-to-life respondents
were democrat. Thus, 40 % of 45 % of the population were democrats who labeled themselves
Right-to-life. That is 0.40(0.45) = 0.18 or 18 percent of the population. Continuing in this
manner, 55 % of the 45 % of the respondents were Republicans who declared themselves Rightto-Life, of 0.55(0.45) = 0.2475 or 24.75 %. Let’s re-examine our tree diagram now.
- 75 -
Adding up these percentages, we have the sum 0.18 + 0.2475 + 0.0225 + 0.275 + 0.2475 +
0.0275 or 1, thus 100 percent of the whole population. The republicans comprise 0.2475 +
0.2475 or 49.5 % of the respondents. Notice, half of the Republicans declared themselves Rightto-life, the other half were Pro-choice.
Now, let’s ask some of our own questions.
1. Given a democrat, what was the probability he or she was Pro-Choice? Right-to Life?
Solution
Written in terms of conditional probability, we have
P( PC D)
0.275
P( PC | D) 

 0.604 and
P ( D)
0.18  0.275
0.18
P( RTL | D) 
 0.396 or 1  0.694  0.396
0.18  0.275
2. Given a republican, what was the probability he or she was Pro-Choice? Right-to-Life?
Given a democrat, what was the probability he or she was Pro-Choice? Right-to Life?
Solution
Written in terms of conditional probability, we have P( PC | R) 
P( RTL | R) 
0.2475
 0.5 and
.2475  0.2475
0.2475
 0.5
.2475  0.2475
3. Given some one who was Right-to-Life, what was the probability he or she was a democrat?
A republican?
Given a democrat, what was the probability he or she was Pro-Choice? Right-to Life?
Solution
Written in terms of conditional probability, we have
P( D RTL) (0.45)(0.40)
P( D | RTL)

 0.40 and P( R | RTL )  0.55
P( RTL)
0.45
4. Given someone was Pro-Choice, what was the probability he or she was a democrat? A
republican?
Given a democrat, what was the probability he or she was Pro-Choice? Right-to Life?
Solution
Written in terms of conditional probability, we have P( D | PC )  0.50 and P( R | PC )  0.45
- 76 -
It seems now you are armed with enough information to ask your self if the abortion issue falls
along party lines or is bi-partisan (transcends party lines).
Using the mathematics developed in the previous example we can quickly assess the smoking
example. Recall, 15% of the population developed lung cancer and 40 % of the county’s
residents were smokers, we knew that 80% of those who developed lung cancer were smokers.
But, we wanted to know given the probability for someone who smoked to develop lung cancer?
You want to know the precise risk involved in smoking.
Solution
We want to know, of those who smoke (given someone smokes), what is the probability from
this group that someone will develop ling cancer.
P( A  B)
We will use P( B | A) 
. We want
P( A)
P( S  LC )
P( Lung Cancer | Smo ker)  P( LC | S ) 
P( S )
What do we know? Well, 15% of the population developed lung cancer can be written P(LC) =
0.15 and 40 % of the county’s residents were smokers can be written P(S) = 0.40. Also, 80% of
those who developed lung cancer were smokers can be written
P(S | LC) = 0.80. So,80% of 15% of the populations developed lung cancer.
P( S  LC ) (0.15)(0.80)
P( LC | S ) 

 0.30 or 30 % risk.
P( S )
0.40
- 77 -
Exercise Set
For problems 1-7, use the following a
popular opinion poll about the upcoming
presidential election, where 30 % responded
they were Democrat (D), 45 % responded
they were Republican (R), and surprisingly
25% said they were Independent (I). After
the election, it turned out 65 % of the
Democrats voted (v) against the ban of
liquor on Sundays, 80 % of the Republicans
voted against the ban and only 20 % of the
Independents voted against the ban.
1. What is the probability that someone
voted against the ban?
2. What is the probability that someone who
does not vote against the ban is a Democrat?
3. What is the probability that who was
Independent did not vote against the ban?
4. What is P( D | v) ?
5. What is P ( D | v ) ?
6. What is P (v | D ) ?
7. What is P (v | R ) ?
For problems 8-14, use the following a
popular opinion poll given to students at
Southwestern college regarding the death
penalty, where 20 % of the respondents were
Freshman, (F), 30 % were Sophomores (So),
35 % were Juniors (J) and 15 % were
Seniors (Sr). Of the Seniors, 70 % said they
opposed the death penalty (d ) , of the
Juniors only 30 % opposed the death
penalty, and for the Sophomores, 40 % were
in favor of the death penalty, while half of
the freshmen were in favor and half were
against.
8. What is the probability that a freshman
was against the death penalty?
9. What is the probability that someone who
was against the death penalty was not a
sophomore?
10. What is the probability some who was
in favor of the death penalty was a senior?
11. What is P( So | d ) ?
12. What is P (d | Sr ) ?
13. What is P(d | F or So ) ?
14. What is P( J | d ) ?
15. “In a recent poll, 35 % said that Bobby
Bonds (b) was the greatest baseball player
ever. Of those who thought Bonds was the
greatest ever, 80 % felt that the wide spread
use of steroids has not cheapened (c ) the
very fabric of baseball and that the records,
once hallowed, are still ‘legit’. It should be
noted, though, that 60 % of all polled did
think that the records were cheapened by the
wide spread us of steroids among today’s
athletes.” Is this just an ‘old school, new
skool’ issue, case of the younger generation
feeling one way, the older another. Or is
this reflective of a larger problem is our
society, an ever widening divergence
between generations in what we adhere to as
morality. It is clear that a large percent of
those who thought that someone other than
Barry Bonds was the greatest ever also
thought steroid use among today’s athletes
has cheapened baseball. What was that
percent?
- 78 -
Expected Values
Suppose you are considering purchasing a small insurance company. Is it a profitable venture?
How would you answer such a question? For ease, let us suppose each policy costs $500 per
year. You check the yearly records from the past 20 years and it appears the numbers don’t
really change all that much. Yearly, 9995 out of 10,000 policy holders never result in a claim, 2
out of 10,000 result in a claim of an average of $2,000, 1 out of 10,000 result in a claim of an
average of $5,000 and 2 out of 10,000 result in huge payoffs, claims that average half a million
dollars. Should this scare us off from purchasing the company?
In ordinary English, we are searching for the expected value, and we may find this with respect
to the policy holders or for the company. Let’s find the expected value for the company.

if it is positive, then we should buy the company, because in the long run, for each policy
holder, we will take in more money than we pay out.

If it is negative, then we should not buy the company, because in the long run, for each
policy holder, we will pay out more money than we take in.
Expected values, weighted averages or expected winnings are mathematically all one and the
same. The expected value is found multiplying the event (often stated as an amount) by the
probability that event (or the amount stated) will occur and then adding each subsequent similar
product. The expected value is the result of the sum of all these products.
n
a p a p a
i 1
i
i
1 1
2
p2  a3 p3  ...  an pn , 1  i  n,
n is the number of different events (or outcomes )
where ai is eachindividualevent , and
pi is the probability that " i th " event will occur
So, to find the expected value, we take each claim amount and multiply it by the probability that
claim amount could occur. We then sum together each result.
9995
2
1
2
0
2, 000 
5, 000 
500, 000 100.90
10, 000 10, 000
10, 000
10, 000
We see, annually, for each policy holder, the average to pay off is $100.90 and the policy holder
paid $500. We expect to make $500 - $100.90 = $399.10 per policy holder each year. The next
question would be, is that enough to run the business?
If we had only three clients, the expected value tells 3 x $ 399.10 = $1,197.30 is the predicted
amount our company could expect to net. Who knows if the company will be profitable because
maybe one of the unlikely half a million dollar claims would come through and all we would
- 85 -
have is $1,500 (3 x $500) to draw from. But, if we had 10,000 clients, we could reasonably
expect to take in $399.10 x 10,000 = $3,991,000 or nearly 4 million dollars per year. If we had
25,000 clients, our company could reasonably expect to take in $399.10 x 25,000 = $9,977,500
or nearly 10 million dollars annually. Even though we pay out an average of 2 claims equaling a
half a million dollar each for every 10,000 policy holders, we make money because most of our
policy holders result in us making $500 per year. The more clients we have, the more money we
can expect to flow into the company. But, we need to be aware that to find the expected value, it
does not matter how many policy holders we have, the expected value refers to what is the
expected amount of money the company will either take in or pay off per client.
In general, the theoretical probabilities of experiments or events are experienced after many,
many trials have been conducted. So, if you are playing a con game, play as many games as
possible, if you are taking in clients, get as many as possible, if you are answering questions on
an exam, answer them all with the same strategy. The rule to remember is that the expected
value is more likely to be approached in the long haul.
Example One
Con game On a street corner, you come across a folding table, two chairs and a bag. A street
vendor is smiling. In the bag, there are 10 one dollar bills, 15 five dollar bills, 5 twenty dollar
bills, 2 one hundred dollar bills and 15 blank pieces of paper. You are charged $10 to play. The
player, blind folded, reaches into the bag and withdraws a single item. You keep what draw.
What is the expected value for the player? Would you play?
Solution
10
15
5
2
15
1 5 
20  100 
0  8.19 . You expected value is $8.19 - $10 = - $1.81.
47
47
47
47
47
This number means you may expect to lose $1.81 every time you play the game, even though
there may be occasions where you will win $20 or $100.
CHECK: Note, a good check is too add across the numerator, 10+15+5+2+15 = 47, which
matches the number of pieces in the bag, thus matching the denominator, thus insuring you have
accounted for all probabilities associated with each dollar amount.
Example Two
Standardized Tests We have all taken these huge standardized exam, SAT, ACT, MCAT,
LSAT, ABCTE, PCAT, TDEFL, GRE, GMAT and so on. Often, their scores on these
standardized exams are based on weighted averages, as well as a comparison of your score to
others performance. We will concentrate on the weighted average portion of the exam. The
question, when should one guess if they can eliminate one or more choices is crucial to your
success. This test taking strategy can separate two equal prepared students on a grade scale.
On a two hundred question multiple choice exam, with each question having choices a through e,
this standardized test is graded as follows, for each correct answer you get a point, for each
incorrect answer you lose ½ a point. For each question you leave blank, you gain or lose no
- 86 -
points. What is the expected value and would you guess if there is only one correct answer per
question and if
a) you couldn’t eliminate any answers
b) you could eliminate one answer
c) you could eliminate two answers
d) you could eliminate three answers
Solution
First, the number of questions, 200, is irrelevant because we need the expected value per
question.
4 1
1

 (1)   0.2
a)
2 5
5
You could expect to lose – 0.2 points for each question if you simply guess. Or stated more
clearly, you could expect to lose 1 point for every 5 questions you guess when you can not
eliminate a choice. This would be a poor choice on your part, it is not to your advantage. Don’t
employ this strategy.
3 1
1

 (1)   0.125
b)
2
4
4
You could expect to lose – 0.125 points for each question if you when you can eliminate a
choice. Or stated more clearly, you could expect to lose 1 point for every 8 questions you guess
when you can eliminate a single choice. Don’t employ this strategy, it is not to your advantage.
2 1
1

 (1)  0
c)
2
3
3
You could expect to lose no points for each question. Or stated more clearly, you neither gain
nor lose if you can eliminate two choices. Should you then make an educated guess if you can
safely eliminate two choices, thus knowing the answer is one of three choices? No. Do not
employ a strategy unless it is to your advantage. This is a rule for life, not just math.
1 1
1

 (1)  0.25
d)
2 2
2
You could expect to gain 0.25 points per question. Or stated more clearly, you could expect to
gain 1 point for every 4 questions you guess when you can eliminate three of the five choices.
Should you do this? Absolutely.








But, if you employ this test taking strategy, this is where a 200 question exam does matter.
Employ the strategy consistently. Every time you can eliminate 3 or more choices, then guess.
Remember, expected values are more likely to reach their expected value the more the game is
played. When the game is a standardized test, there is no “fun loving game” connotation to it. It
is very important to you. This translates into the words this is a serious strategy, take it
seriously.
Every time you can eliminate at least three choices, then guess.
It is advantageous to do so.
- 87 -
Example Three
Con Game 2 The game revolves around a standard deck of cards. The game is as follows, the
player draws one card. If the player draws a two through nine, the player gets $5. If the player
draws a ten, the player pays $100. If the player draws a face card (J, Q or K) the player pays $ 2.
If the player draws a red ace, the players get $50, if the player draws the ace of clubs, the player
pays $5 and if the player draws the ace of spades, the player pays $25. What is the expected
value of the game for the player?
Solution
32
4
12
2
1
1
5  (100)  (2)  50  (5)  (25)   3.73 or the player could expect to
52
52
52
52
52
52
lose $3.73 every time the player plays.
Example Four
Your final grade You exam grades are 78, 90, 75 and 92. Each exam is 15 percent of your
grade. Your homework average is a 50, it is 10 percent of your grade. Your project earned you
a 100, which is 5 percent of your grade. You final exam is 25 percent of your grade. You earned
a 75. What is your average? What would your homework average had to have been to earn the
next letter grade higher than the one you earned?
Solution
Your grade is the weighted average or expected value:
78(.15)  90(.15)  75(.15)  92(.15)  50(.10)  100(.05)  75(.25)  79 , a C.
To earn a B, replace the homework grade of a 50 with an x and we replace the overall final
grade with an 80. 78(.15)  90(.15)  75(.15)  92(.15)  x(.10)  100(.05)  75(.25)  80
Solving for x,
0.10 x  80  (78(.15)  90(.15)  75(.15)  92(.15)  100(.05)  75(.25))  6
6
 60
0.10
To earn a B, the homework average would have to be 60 percent.
x
- 88 -
Exercise Set
1. You have the following grades on your
three exams, 75, 84, 91. Your homework
average is a 40. You want to make a C in
the course. What do you need to make on
the final exam if the Test Average is half
your grade, 25 % of the grade is your
homework, and 25 % of your grade is your
final exam.
For problems 2 and 3, use the game where a
bag contains 3 red and 4 blue balls. You pay
7 dollars to play the game:
2. you draw one ball. If it is red I pay you
$5 and if it is blue I pay you $8. Find your
expected gain or loss for this game.
3. you may play further the following way.
Draw another ball without replacing the
first. If both are red I pay you $11, if both
balls are blue I pay you $4 and if the balls
are different colors then I pay you 6 dollars.
Find your expected gain or loss for this
second type of game.
4. On a street corner, you come across a
folding table, two chairs and a bag. A street
vendor is smiling. In the bag, there are 20
one dollar bills, 10 five dollar bills, 2 ten
dollar bills, 6 twenty dollar bills, 2 one
hundred dollar bills and 25 blank pieces of
paper. You charge $ 12 to play. The player,
blind folded, reaches into the bag and
withdraws a single item. You keep what
draw. What is the expected value for the
player? Would you play?
5. The game revolves around a standard
deck of cards. The game is as follows, the
player draws one card. If the player draws a
two or three, the player gets $ 2, four
through nine, the player gets $100. If the
player draws a ten, the player pays $50. If
the player draws a face card (J, Q or K) the
player pays $ 25. If the player draws a red
ace, the players get $200, if the player draws
the ace of clubs, the player pays $5,000 and
if the player draws the ace of spades, the
player pays $ 1. What is the expected value
of the game for the player?
For problems 6 and 7, use the following
data: In the past five years, your life as a
realtor in the summer months (June, July
and August) has been quite predictable.
Below is a table of the expected
commissions and the probability each
commission will occur.
Expected Commissions for the summer months and
the probability they will occur.
$1,000 $2,000 $3,000 $4,000 $5,000
40 %
20%
15 %
15 %
10%
6. What is your expected commission per
sale for those hot summer months?
7. If you made 1 sale all summer, what your
income be for the summer? How about if
you made 4 sales? 10 sales? 20 sales?
For problems 8 and 9, use the following
data: In the past five years, your life as a
realtor in the winter months (December,
January and February) has been just as
predictable. Below is a table of the expected
commissions and the probability each
commission will occur.
Expected Commissions for the winter months and the
probability they will occur.
$1,000 $2,000 $3,000 $4,000 $5,000
30 %
10%
5%
25 %
30%
8. What is your expected commission per
sale for those cold winter months?
9. If could vacation in the summer or the
winter, when would you vacation?
For problems 10 – 12, five cards are dealt
from a standard 52 card deck. Find the
expected value and state whether or not you
would play the game.
1
10. You get $ 1,000,000 for a flush. You
pay $ 10 for any other hand dealt to you.
- 89 -
11. You get $ 1,000,000 for a royal flush.
You pay $ 1 for any other hand dealt to you.
12. You get $ 20 for a straight flush. You
get $ 100 for a pair. You pay 0.25 for any
other hand dealt to you.
13. On a 155 question multiple choice
exam, with each question having choices a
through f, this standardized test is graded as
follows, for each correct answer you get 3
points, for each incorrect answer you lose 2
points. For each question you leave blank,
you gain or lose no points. What is the
expected value and would you guess if there
is only one correct answer per question if
a) you couldn’t eliminate any answers
b) you could eliminate one answer
c) you could eliminate two answers
d) you could eliminate three answers
For problems 14 – 17. Your grade is
calculated on a standard scale: 0 – 59 is an
F, 60 – 69 is a D, 70 – 79 is a C, 80 – 89 is a
B and 90 – 100 is an A. You final grade is
calculated where your exam average is
worth 50 % of your grade, your homework
is worth 25 %of your grade, and the final
exam is 25 % of your grade. You had 3
exams, your scores were 89, 90 and 85.
Your final exam grade was a 92. Your
homework grades were the average of the
nine homework assignments, where you
earned a 50, 60, 30, 40, 70 and did not
complete the other 4 assignments.
14. What is your final course average?
15. What would your final course average
have been if your homework grades were
80, 70, 80, 70, 80, 70, 90, 90, 70?
16. What would your final course average
be if your homework grades were 80, 90, 80,
90, 80, 100, 100, 100, 100?
17. Is there a point to questions 14 – 17?
- 90 -