Download 1 Probability Basics 1.1 Meaning of Probability It`s hand 229 of day 7

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Inductive probability wikipedia , lookup

Birthday problem wikipedia , lookup

Ars Conjectandi wikipedia , lookup

Probability interpretations wikipedia , lookup

Probability wikipedia , lookup

Transcript
1
Probability Basics
1.1 Meaning of Probability
It’s hand 229 of day 7 of the 2006 World Series of Poker
(WSOP) Main Event, the biggest poker tournament ever
played, and after 8,770 entrants have been eliminated, it is
now down to the final three players. The winner gets a cash
prize of $12 million, second place just over $6.1 million,
and third place gets about $4.1 million. Jamie Gold is the
chip leader with $60 million in chips, Paul Wasicka has $18
million, and Michael Binger has $11 million. The blinds are
$200,000 and $400,000, with $50,000 in antes. (Please note:
for readers unfamiliar with Texas Hold’em, consult the
Appendices A and B for a brief explanation of the game and
related terminology used in this book.) Gold calls with 4♠
3♣, Wasicka calls with 8♠ 7♠, Binger raises with A♥ 10♥,
and Gold and Wasicka call. The flop is 10♣ 6♠ 5♠. Wasicka
checks, perhaps expecting to raise, but by the time it gets
back to him, Binger has bet $3.5 million and Gold has
raised all-in. What would you do, if you were Paul Wasicka
in this situation?
There are, of course, so many issues to consider. One
relatively simple probability question that may arise is this:
given all the players’ cards and the flop, if Wasicka calls,
what is the probability that he will make a flush or a straight?
This is the type of calculation we will address in this book,
but before we get to the calculation, it is worth examining the
question a bit. What does it mean to say that the probability of
some event, like Wasicka’s making a flush or a straight, is,
say, 55%? It may surprise some readers to learn that
considerable disagreement continues among probabilists and
statisticians about the definition of the term probability. There
are two main schools of thought.
The frequentist definition is that if one were to record
observations under the exact same conditions over and over,
with each observation independent of the rest, then the event
in question would ultimately occur 55% of the time. In other
words, to say that the probability of Wasicka making a flush
or a straight is 55% means that if we were to imagine
repeatedly observing situations just like this one, or if we
were to imagine dealing the turn and river cards repeatedly,
each time reshuffling the remaining 43 cards before dealing,
then Wasicka would make a flush or a straight 55% of the
time.
The Bayesian definition is that the quantity 55% reflects
one’s subjective feeling about how likely the event is to
occur: in this case, because the number is 55%, the feeling is
that the event is slightly more likely to occur than it is not to
occur.
The two definitions suggest very different scientific
questions, statistical procedures, and interpretations of results.
For instance, a Bayesian may discuss the probability that life
exists on Mars, while a frequentist may argue that such a
construct does not even make sense. Frequentists and
Bayesians have had spirited debates for decades on the
meaning of probability, and consensus appears to be nowhere
in sight.
While experts may differ about the meaning of probability,
they have achieved unanimous agreement about the
mathematics of probability. Both frequentists and Bayesians
agree on the basic rules of probability—known as the Axioms
of Probability. These three axioms are discussed in Section
1.3, and the methods for calculating probabilities follow from
these three simple rules. There is also agreement about the
proper notation for probabilities: we write P(A) = 55%, for
instance, to connote that the probability of event A is 55%.
1.2 Basic Terminology
Before we get to the axioms of probability, a few terms
must be clarified. One is the word or. In the italicized
question about probability in Section 1.1 above, it is unclear
whether we mean the probability that Wasicka makes either a
flush or a straight but not both or the probability that Wasicka
makes a flush or a straight or both. English is ambiguous
regarding the use of or. Mathematicians prefer clarity over
ambiguity, and have agreed on the convention that A or B
always means A or B or both. If one means A or B but not
both, one must explicitly include the phrase but not both.
Of course, in some situations, A and B cannot both occur.
For instance, if we were to ask what the probability is that
Wasicka makes a flush or three 8s on this hand, one can see
that they both cannot happen: if the turn and river are both
8s, then neither the turn nor the river can be a spade, since
Wasicka already has the 8 of spades. If two events A and B
cannot both occur, i.e., if P(A and B) = 0, then we say that
the events are mutually exclusive. We usually use the
notation AB to denote the event A and B, so the condition for
mutual exclusivity can be written simply P(AB) = 0.
The collection of all possible outcomes is sometimes called
the sample space, and an event is a subset of elements of the
sample space. For instance, in the WSOP example described
in the beginning of Section 1.1, if one were to consider the
possibilities that might come on the turn, one might consider
the sample space to be all 52 cards. Of course, if we know
what cards have been dealt to the three players and which
cards appeared on the flop, then these nine cards have zero
probability of appearing on the turn, and one may consider the
remaining 43 cards equally likely. The event that the turn is
the 7♦ is an event consisting of a single element of the sample
space, and the event that the turn is a diamond is an event
consisting of 13 elements.
Given an event A, we use the notation Ac to mean the
complement of A, or in other words, the event that A does not
occur. If A is the event that the turn is a diamond, for instance,
then Ac is the event that the turn is a club, heart, or spade. For
any event A, the events A and Ac are always mutually
exclusive and exhaustive, meaning that together they cover
the entire sample space.
1.3 Axioms of Probability
Three basic rules or axioms must govern all probabilities:
1: P(A) ≥ 0.
2: P(A) + P(Ac) = 1.
3: If A1, A2, A3,… are mutually exclusive events, then
P(A1 or A2 or … An ) = P(A1 ) + P(A2 ) + … + P(An ),
where n may be a positive integer or ∞.
Axiom 1 says that all probabilities must be at least 0, and
because of this and Axiom 2, no probability can be greater
than 1. According to Axiom 2, if the probability that A will
not occur, is, say, 45%, then we know that the probability of A
occurring is 55%. It is sometimes convenient to compute the
probability of A not occurring en route to computing the
probability of A.
Axiom 3 is sometimes called the addition rule for mutually
exclusive events. Note that it is usually rather obvious when
events are mutually exclusive. For example, if you play a
single hand of Texas Hold’em, A1 might be the event that you
are dealt pocket aces, and A2 the event you are dealt pocket
kings. They cannot both occur, so these two events are
mutually exclusive, and according to Axiom 3, the probability
of your being dealt either pocket aces or pocket kings on this
single hand is equal to the probability of your being dealt
pocket aces plus the probability of your being dealt pocket
kings. This may seem obvious, but it is worth noting that the
fact that we may add these two probabilities together to find
the probability that one will occur is not something that can
be proven; it is a fundamental principle we agree to assume
when discussing probabilities.
Note also that Axiom 3 does not apply directly to the
question discussed in Section 1.1, involving the probability of
Wasicka making a flush or a straight. It is possible (if, for
instance, the turn and river are the 9♣ and Q♠) for both to
occur, so the event that Wasicka makes a straight and the
event that he makes a flush are not mutually exclusive.
A final comment worth noting about the axioms of
probability is that they imply that if there are n equally likely
events, exactly one of which must occur, then the probability
associated with any one of the events is simply 1/n. This
may again seem obvious, but note that it can be proven
directly from Axiom 3 (see Exercise 1.3). This fact is
important for many questions involving Texas Hold’em,
because in many cases the outcomes are equally likely. The
subject of equally likely events is taken up further in Section
2.1.
1.4 Venn Diagrams
Most readers have likely seen Venn diagrams so they will be
reviewed only briefly here. One question that is sometimes
overlooked in discussing Venn diagrams is: why can one use
shapes such as those shown in Figure1.1 to address
probability questions?
The answer has everything to do with the Axioms of
probability discussed in Section 1.3. All the rules of
probability discussed in this book follow from these three
simple Axioms. Note that they apply not only to
probabilities, but also to other constructs including areas.
Suppose you have a piece of paper of area 1, and that A1, A2,
A3, … represent shapes on the paper. Suppose also that P(A)
means the area occupied by shape A (instead of probability);
Ac is the portion of the paper not occupied by shape A; AB
denotes the intersection of A and B, i.e., the portion of the
paper occupied by both A and B; and A or B indicates the
union of A and B, i.e., the portion of the paper occupied by
shapes A or B or their intersection. Then the three axioms
would still be true: any imaginable shape on the paper would
necessarily have an area greater than or equal to zero; the
area covered by the shape plus the area not covered by the
shape would equal 1; and shapes would be mutually
exclusive if their area of intersection is 0, so if shapes A1, A2,
A3,… are mutually exclusive, then obviously the total area
covered by them is simply the area of A1 + the area of A2 +
the area of A3 + ….
Thus, the same axioms that govern probability also apply
to areas on a paper of total area 1. We will see the connection
between probability and area several times in this book, and it
is worth noting that in further studies of probability at the
graduate and research levels, the connections are made very
explicit between probability and measure theory, the latter of
which is essentially the study of principles related to areas,
volumes, and other means of measuring shapes or other
objects.
1.5 General Addition Rule
Axiom 3 in Section 1.3 is sometimes called the addition rule
for mutually exclusive events. How does one compute the
corresponding probability for events that may not be mutually
exclusive? In general, for any events A and B,
P(A or B) = P(A) + P(B) – P(AB).
The above may be called the general addition rule because it
applies to events A and B whether they are mutually exclusive
or not. Of course, when A and B are mutually exclusive, the
last term, P(AB), is equal to zero. The general addition rule
follows directly from the axioms of probability and may be
demonstrated quite easily by breaking up A or B into mutually
exclusive events. That is, observe that
A or B = ABc or AB or AcB,
and that these three events, ABc, AB, and AcB, are mutually
exclusive. Therefore,
P(A or B) = P(ABc or AB or AcB) = P(ABc) + P(AB) + P(AcB).
One can also form the partition A = ABc or AB, and since ABc
and AB are mutually exclusive, it follows from Axiom 3 that
P(A) = P(ABc) + P(AB). Similarly, B = AcB or AB, so P(B) =
P(AcB) + P(AB). Thus,
P(A) + P(B) - P(AB) = [P(ABc) + P(AB)] + [P(AcB) + P(AB)]
- P(AB)
c
= P(AB ) + P(AB) + P(AcB)
= P(A or B).
The following five examples refer to the WSOP hand
discussed previously. For each example, assume we have only
the information given in Section 1.1, i.e., that Wasicka had 8♠
7♠, Binger had A♥ 10♥, Gold had 4♠ 3♣, and the flop was 10♣
6♠ 5♠. Assume that the remaining 43 cards are all equally
■
likely to appear as turn or river cards and suppose that
Wasicka had called.
Example 1.5.1 — What is the probability of Wasicka making
a straight flush on the turn?
Answer — Wasicka needs the 4♠ or 9♠ to make a straight
flush, but Gold has the 4♠, so the only remaining possibility is
the 9♠. Since there are 43 equally likely possibilities for the
turn, exactly one of which must occur, the probability
associated with each of them appearing on the turn must be
1/43, and therefore the probability of the turn being the 9♠ is
1/43.
■
Example 1.5.2 — What is the probability of Wasicka making
a straight flush on the turn or river?
Answer — Note that, from the definition of the word or given
in Section 1.2, this question is really asking what the
probability is of Wasicka making a straight flush on the turn
or river or both. However, the only card that can give
Wasicka a straight flush is the 9♠, and it cannot appear both
on the turn and the river. Thus, if A is the event that the 9♠
comes on the turn and B is the event that the 9♠ comes on the
river, then A and B are mutually exclusive, and by Axiom 3,
the probability of A or B is P(A) + P(B), which is simply 1/43
+ 1/43 = 2/43.
■
Note that P(B) in the answer above is 1/43, not 1/42. It might
help to think of P(B) as the frequency with which the river
will be the 9♠, if we were to repeat this scenario over and over
again. Clearly, each of the 43 remaining cards is equally
likely to appear on the river, so the probability associated with
the 9♠ is simply 1/43.
Example 1.5.3 — What is the probability that the turn card
does not give Wasicka a straight flush and the river card also
does not give Wasicka a straight flush.
Answer — This is the complement of the event in Example
1.5.2, so by Axiom 2, the answer is 1 – 2/43 = 41/43.
■
Example 1.5.4 — What is the probability of Wasicka making
a flush on the turn?
Answer — Eight spades are left in the deck: A♠, K♠, Q♠, J♠,
10♠, 9♠, 3♠, and 2♠. As noted in Example 1.5.1, each of the 43
remaining cards has a probability of 1/43 of appearing on the
turn. Let A1 be the event that the turn is the A♠, A2 be the event
that the turn is the K♠, etc. Note that A1, A2, …, A8 are mutually
exclusive, since it is impossible for more than one of these cards
to appear on the turn. Thus by Axiom 3,
P(A1 or A2 or … or A8) = P(A1) + P(A2) + … + P(A8)
= 1/43 + 1/43 + … + 1/43
= 8/43
~ 18.6%.
Two simple approximations used frequently by poker players
are known as the Rule of Two and the Rule of Four. If you are
behind your opponent presently, we call an out a card that will
give you the lead. Let x be the number of outs left in the deck.
The Rule of Two says that your probability of an out coming
on the next card (e.g. the turn) is approximately 2x%. The
Rule of Four says that your probability of an out coming on
the turn or river (or both) is approximately 4x%. Of course,
the term out can be replaced by an element of any collection
of x cards. The idea behind the Rule of Two is that, if one is
interested in the probability of one of x possible cards coming
on the turn, given only one's hole cards and the 3 flop cards,
then there are 47 cards left in the deck, each equally likely to
appear, so the probability of one of these x cards coming is
x/47 ~ x/50 = 2x/100. For the Rule of Four, the idea is that,
■
again given only one's hole cards and the 3 flop cards,
P(out on turn or river) = P(out on turn) + P(out on river) –
P(both). If x is small, then this last term, the probability of an
out coming on both the turn and river, will be small and is
therefore neglected, and the other terms are each equal to x/47
~ 2x/100. Note that 2x/100 is a slight underestimate of x/47,
but this difference is offset by the exclusion of the term
P(both), resulting in a quite accurate estimate for small x. The
approximation becomes poor, however, when x is large, as
discussed further in Chapter 2.4.
In Example 1.5.4, x = 8, so the Rule of Two yields the
approximation 16%, whereas the actual answer was 18.6%.
However, in Example 1.5.4 we also assumed the cards of
Binger and Gold were known; otherwise, the correct answer
would be 8/47 ~ 17.0%.
Example 1.5.5 — What is the probability of Wasicka making
a flush on the turn or the river?
Answer — This one is not so simple. If A is the event that the
turn is a spade and B is the event that the river is a spade, then
A and B are not mutually exclusive. We will return to the
correct solution to this problem after discussing permutations
and combinations, in Example 2.4.3.
For now, we may apply the Rule of Four to obtain an
approximate answer. 8 spades remain, so x = 8, and the Rule
of Four yields 32% as the approximate solution.
■
Example 1.5.6 — Mark Newhouse, who became one of the
WSOP main event's final 9 players (called the “November 9”)
in 2013, made the November 9 again in 2014. Of the 6683
entrants in the 2014 main event, if each player uses the same
strategy, then what is the probability that any one given player
would become part of the November 9?
Answer — The given player would be equally likely to wind
up in 1st place, 2nd place, …, or 6683rd place, so the
probability of being in the top 9 is 9/6683 ~ 0.135%.
■
Example 1.5.7 — On day 5 in the 2015 WSOP Main Event,
with Andrew Moreno went all in with A♦K♦ and Dax Greene
called with A♠A♥. The flop came 5♦ K♠ K♥, giving Moreno 3
of a kind. Suddenly Greene needed the A♣ to come (and the
K♣ not to come) to win the hand. Given the players' cards and
the 3 cards on the flop, what is the probability of the A♣
coming on the turn or river?
Answer — Since the A♣ is Greene's only out, the Rule of
Four suggests an estimate of 4% for the A♣ to come on the
turn or river. Computed exactly,
P(A♣ on turn or river) = P(A♣ on turn) + P(A♣ on river)
– P(A♣ on both turn and river)
= P(A♣ on turn) + P(A♣ on river)
= 1/45 + 1/45 ~ 4.44%.
■
(In the actual hand, the turn and river were the 7♠ and 8♣, so
Moreno won the pot.)
Exercises
1.1 Sometimes it matters whose hand is second best. With
the Gold, Wasicka, and Binger hands, for instance, if
both Wasicka and Binger had called all-in and Binger
had won the hand overall but Wasicka’s hand had beaten
Gold’s hand, then Wasicka would not have been
eliminated since he had more chips than Binger: Binger
would have ended up with 33 million chips, Wasicka
with 14 million chips, and Gold with 42 million chips. If
all 3 players were all-in and Gold had won the hand, of
course, the tournament would have been over and Gold
would have won because he had more chips than his
opponents at the start of the hand; by convention in such
situations, the player who started the hand with more
chips, which in this case was Wasicka, would get the
prize money for second place.
Suppose all 3 players went all-in on the flop, and that
the probability that Gold wins the hand is 17.17%, the
probability that Wasicka wins the hand is 53.82%, the
probability that Binger wins the hand and Wasicka’s
hand beats Gold’s hand is 24.03%, and the probability
that Binger wins the hand and Wasicka and Gold split
the side-pot is 1.00%. What is the probability that
Binger wins the hand and Gold’s hand beats Wasicka’s
hand?
1.2 Suppose that just before the hand described in the
beginning of Section 1.1, Gold’s probability of winning
the tournament was 60/89, and Binger’s probability of
winning the tournament was 11/89. What is the probability
that Wasicka would win the tournament? Which Axiom of
probability are you using?
1.3 Suppose that events A1, A2,…, An are equally likely
events and that exactly one of them must occur. Using
Axiom 3, prove carefully that P(A1 ) = 1/n.
1.4 Suppose you are at the final table of the WSOP main
event and you feel that, based on the number of chips
you have and your experience, you have a 30% chance
of winning the tournament. Is this a Bayesian or
frequentist type of probability?
1.5 Find the general addition rule for three events. That is,
find a general formula for P(A or B or C), for any events
A, B, and C, in terms of P(A), P(B), P(C), P(AB), P(AC),
P(BC), and P(ABC).
1.6 Using the Axioms of probability, prove Boole’s
inequality which states that P(A1 or A2 or …An ) ≤ P(A1 )
+ P(A2 ) + … + P(An ), for any events A1, A2,… An, where
n may be a positive integer or ∞.
1.7 In a huge hand from the 2015 WSOP Main Event, with
just 14 players left, Justin Schwartz called with 3♠3♣,
Alex Turyansky raised with Q♥Q♣, Joe Mckeehen called
with 6♠6♣, and Schwartz called. The flop came 2♦ 3♦ 6♥,
giving both Mckeehen and Schwartz three of a kind.
Schwartz checked, Turyansky bet 700,000, Mckeehen
called, Schwartz went all in, Turyansky folded, and
Mckeehen called. Schwartz would need the 3♥ (and no
6♦) to take the lead. Given these 3 players' cards and the 3
cards on the flop, what is the probability of the 3♥
coming on the turn or river? (In the actual hand, the turn
and river were 7♣ 5♣. Schwartz was eliminated in 14th
place, Mckeehen won the huge 22.32 million chip pot and
went on the win the tournament and the $7.7 million first
place prize.)
1.8 Compare the answer to the previous exercise involving
Schwartz, Mckeehen and Turyansky in the 2015 WSOP
Main Event to the approximation obtained using the Rule
of Four. Why does the Rule of Four yield an
underestimate in this case?
1.9
1.10
♣♦♥♠