Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Lesson 3: A Survey of Probability Concepts Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-1 Outline Learning Exercises Definitions Basic rules of Probability Independence Tree Diagram Bayes’ Theorem Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-2 Learning exercise 1: University Demographics  Current enrollments by college and by sex appear in the following table. College Ag-For Arts-Sci Bus-Econ Educ Engr Law Undecl Totals Female 500 1500 400 1000 200 100 800 4500 Male 900 1200 500 500 1300 200 900 5500 Totals 1400 2700 900 1500 1500 300 1700 10000  If we select a student at random, what is the probability that the student is :  A female or male, i.e., P(Female or Male).  Not from Agricultural and Forestry, i.e., P(not-Ag-For)  A female given that the student is known to be from BusEcon, i.e., P(Female |BusEcon).  A female and from BusEcon, i.e., P(Female and BusEcon).  From BusEcon, i.e., P(BusEcon). Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-3 Learning exercise 1: University Demographics College Ag-For Arts-Sci Bus-Econ Educ Engr Female 500 1500 400 1000 200 100 800 4500 Male 900 1200 500 500 1300 200 900 5500 1400 2700 900 1500 1500 300 1700 10000 Totals Law Undecl Totals P(Female or Male) =(4500 + 5500)/10000 = 1 P(not-Ag-For) =(10000 – 1400) /10000 = 0.86 P(Female | BusEcon) = 400 /900 = 0.44 P(Female and BusEcon) = 400 /10000 = 0.04 P(BusEcon) = 900 /10000 = 0.09 P(Female and BusEcon) = P(BusEcon) P(Female | BusEcon) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-4 Learning exercise 2: Predicting Sex of Babies  Many couples take advantage of ultrasound exams to determine the sex of their baby before it is born. Some couples prefer not to know beforehand. In any case, ultrasound examination is not always accurate. About 1 in 5 predictions are wrong.  In one medical group, the proportion of girls correctly identified is 9 out of 10, i.e., applying the test to 100 baby girls, 90 of the tests will indicate girls. and  the number of boys correctly identified is 3 out of 4. i.e., applying the test to 100 baby boys, 75 of the tests will indicate boys.  The proportion of girls born is 48 out of 100.  What is the probability that a baby predicted to be a girl actually turns out to be a girl? Formally, find P(girl | test says girl). Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-5 Learning exercise 2: Predicting Sex of Babies  P(girl | test says girl)  In one medical group, the proportion of girls correctly identified is 9 out of 10 and  the number of boys correctly identified is 3 out of 4.  The proportion of girls born is 48 out of 100.  Think about the next 1000 births handled by this medical group.  480 = 1000*0.48 are girls  520 = 1000*0.52 are boys  Of the girls, 432 (=480*0.9) tests indicate that they are girls.  Of the boys, 130 (=520*0.25) tests indicate that they are girls.  In total, 562 (=432+130) tests indicate girls. Out of these 562 babies, 432 are girls.  Thus P(girl | test says girl ) = 432/562 = 0.769 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-6 Learning exercise 2: Predicting Sex of Babies With the information given, we can fill in the following table in sequence from (1) to (9), with the initial assumption of 1000 babies in total. For the question at hand, i.e., P(girl | test says girl ), we only need to fill in the cells from (1) to (6). Test says girl Test says boy Totals Girl (4) 432 (7) 48 (2) 480 Boy (5) 130 (8) 390 (3) 520 Totals (6) 562 (9) 438 (1) 1000 P(girl | test says girl ) = 432/562 = 0.769 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-7 Learning exercise 2: Predicting Sex of Babies 1000*P(girls)  480 = 1000*0.48 are girls  520 = 1000*0.52 are boys 1000*P(boys)  Of the girls, 432 (=480*0.9) tests indicate that they are girls. 1000*P(girls)*P(test says girls|girls)  Of the boys, 130 (=520*0.25) tests indicate that they are girls. 1000*P(boys)*P(test says girls | boys)  In total, 562 tests indicate girls. 1000*[P(girls)*P(test says girls|girls) + P(boys)*P(test says girls|boys)]  Out of these 562 babies, 432 are girls.  Thus P(girls | test syas girls ) = 432/562 = 0.769 1000*P(girls)*P(test says girls|girls) 1000*[P(girls)*P(test says girls|girls) + P(boys)*P(test says girls|boys)] Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-8 Example 1 (to be used to illustrate the definitions)  A fair die is rolled once.  Peter is concerned with whether the resulted number is even, i.e., 2, 4, 6.  Paul is concerned with whether the resulted number is less than or equal to 3, i.e., 1, 2, 3.  Mary is concerned with whether the resulted number is 6.  Sonia is concerned with whether the resulted number is odd, i.e., 1, 3, 5.  A fair die is rolled twice.  John is concerned with whether the resulted number of first roll is even, i.e., 2, 4, 6.  Sarah is concerned with whether the resulted number of second roll is even, i.e., 2, 4, 6. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Definitions: Experiment and outcome  A random experiment is the observation of some activity or the act of taking some measurement.  The experiment is rolling the one die in the first example, and rolling one die twice in the second example.  An outcome is the particular result of an experiment.  The possible outcomes are the numbers 1, 2, 3, 4, 5, and 6 in the first example.  The possible outcomes are number pairs (1,1), (1,2), …, (6,6), in the second example.  Sample Space – the collection of all possible outcomes of a random experiment. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-10 Definition: Event  An event is the collection of one or more outcomes of an experiment.  For Peter: the occurrence of an even number, i.e., 2, 4, 6.  For Paul: the occurrence of a number less than or equal to 3, i.e., 1, 2, 3.  For Mary: the occurrence of a number 6.  For Sonia: the occurrence of an odd number, i.e., 1, 3, 5.  For John: the occurrence of (2,1), (2,2), (2,3),…, (2,6), (4,1),…,(4,6), (6,1),…,(6,6) [John does not care about the result of the second roll]. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-11 Definition: Intersection of Events  Intersection of Events – If A and B are two events in a sample space S, then the intersection, A ∩ B, is the set of all outcomes in S that belong to both A and B S A     The intersection The intersection The intersection The intersection Ka-fu Wong © 2007 AB B of Peter’s event and Paul’s event contains 2. of Peter’s event and Mary’s event contains 6. of Paul’s event and Mary’s event contains nothing. of Peter’s event and Sonia’s event contains nothing. ECON1003: Analysis of Economic Data Lesson3-12 Definition: Mutually Exclusive events  A and B are Mutually Exclusive Events if they have no basic outcomes in common  i.e., the set A ∩ B is empty S A Ka-fu Wong © 2007 B ECON1003: Analysis of Economic Data Lesson3-13 Example 2: Intersections and Mutually Exclusive events  Peter’s event and Paul’s event are not mutually exclusive – both contains 2.  The intersection of Peter’s event and Paul’s event contains 2.  Peter’s event and Mary’s event are not mutually exclusive – both contains 6.  The intersection of Peter’s event and Mary’s event contains 6.  Paul’s event and Mary’s event are mutually exclusive – no common numbers.  The intersection of Paul’s event and Mary’s event contains nothing.  Peter’s event and Sonia’s event are mutually exclusive – no common numbers.  The intersection of Peter’s event and Sonia’s event contains nothing. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-14 Definition: Union  Union of Events – If A and B are two events in a sample space S, then the union, A U B, is the set of all outcomes in S that belong to either A or B S A Ka-fu Wong © 2007 B The entire shaded area represents AUB ECON1003: Analysis of Economic Data Lesson3-15 Definition: Exhaustive events  Events are collectively exhaustive if at least one of the events must occur when an experiment is conducted.  Peter’s event (even numbers) and Sonia’s event (odd numbers) are collectively exhaustive.  Peter’s event (even numbers) and Mary’s event (number 6) are not collectively exhaustive.  Events E1, E2, … Ek are Collectively Exhaustive events if E1 U E2 U . . . U Ek = S  i.e., the events completely cover the sample space Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-16 Complement  The Complement of an event A is the set of all basic outcomes in the sample space that do not belong to A. The complement is denoted A S A Ka-fu Wong © 2007 A ECON1003: Analysis of Economic Data Lesson3-17 Example 3 Let the Sample Space be the collection of all possible outcomes of rolling one die: S = [1, 2, 3, 4, 5, 6] Let A be the event “Number rolled is even” Let B be the event “Number rolled is at least 4” Then the two events contains A = [2, 4, 6] Ka-fu Wong © 2007 B = [4, 5, 6] ECON1003: Analysis of Economic Data Lesson3-18 Example 3 S = [1, 2, 3, 4, 5, 6] A = [2, 4, 6] B = [4, 5, 6] Complements: B  [1, 2, 3] A  [1, 3, 5] Intersections: A  B  [4, 6] Unions: A  B  [5] A  B  [2, 4, 5, 6] A  A  [1, 2, 3, 4, 5, 6]  S Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-19 Example 3 S = [1, 2, 3, 4, 5, 6] A = [2, 4, 6] B = [4, 5, 6]  Mutually exclusive:  Are A and B mutually exclusive? No. The outcomes 4 and 6 are common to both  Collectively exhaustive:  Are A and B collectively exhaustive? No. A U B does not contain 1 and 3 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-20 Probability  Probability – the chance that an uncertain event will occur (always between 0 and 1) 0 ≤ P(A) ≤ 1 For any event A 1 .5 0 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Certain Impossible Lesson3-21 Assessing Probability  There are three approaches to assessing the probability of an uncertain event: 1. classical probability probability of event A  NA number of outcomes that satisfy the event  N total number of outcomes in the sample space (Assumes all outcomes in the sample space are equally likely to occur.) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-22 Counting the Possible Outcomes  Use the Combinations formula to determine the number of combinations of n things taken k at a time C(n, k)  n! k!(n  k)!  where  n! = n(n-1)(n-2)…(1)  0! = 1 by definition Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-23 Example 4: Combination C(n, k)  n! k!(n  k)!  Suppose there are 2 stocks in our portfolio. We would like to select 1 stocks and sell them. What are all the possible combinations.  [1], [2]  C(2,1) = 2!/(1!(2-1)!)=2 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-24 Example 4: Combination n! C(n, k)  k!(n  k)!  Suppose there are 3 stocks in our portfolio. We would like to select 1 stocks and sell them. What are all the possible combinations.  [1], [2], [3]  C(3,1) = 3!/(1!(3-1)!)=3*2*1/[1*(2*1)]=3  Suppose there are 3 stocks in our portfolio. We would like to select 2 stocks and sell them. What are all the possible combinations.  [1,2], [1,3], [2,3]  C(3,2) = 3!/(2!(3-2)!)=3*2*1/[(2*1)*1]=3 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-25 Example 5: Combination n! C(n, k)  k!(n  k)!  Suppose there are 4 stocks in our portfolio. We would like to select 1 stocks and sell them. What are all the possible combinations.  [1], [2], [3], [4]  C(4,1) = 4!/(1!(4-1)!)=4*3*2*1/[1*(3*2*1)]=4  Suppose there are 4 stocks in our portfolio. We would like to select 2 stocks and sell them. What are all the possible combinations.  [1,2], [1,3], [1,4], [2,3], [2,4], [3,4]  C(4,2) = 4!/(2!(4-2)!)=4*3*2*1/[(2*1)*(2*1)]=6 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-26 Assessing Probability Three approaches (continued) 2. relative frequency probability probability of event A  nA number of events in the population that satisfy event A  n total number of events in the population the limit of the proportion of times that an event A occurs in a large number of trials, n 3. subjective probability an individual opinion or belief about the probability of occurrence Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-27 Probability Postulates 1. If A is any event in the sample space S, then 0  P(A)  1 2. Let A be an event in S, and let Oi denote the basic outcomes (mutually exclusive). Then P(A)   P(Oi ) A (the notation means that the summation is over all the basic outcomes in A) 3. P(S) = 1 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-28 Probability Rules  The Complement rule: P(A)  1 P(A) i.e., P(A)  P(A)  1  The Addition rule:  The probability of the union of two events is P(A  B)  P(A)  P(B)  P(A  B) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-29 A Probability Table Probabilities and joint probabilities for two events A and B are summarized in this table: B Ka-fu Wong © 2007 B A P(A  B) P(A  B ) P(A) A P(A  B) P(A  B ) P(A) P(B) P(B ) P(S)  1.0 ECON1003: Analysis of Economic Data Lesson3-30 Example 5: Addition Rule P(A  B)  P(A)  P(B)  P(A  B) Consider a standard deck of 52 cards, with four suits: ♥♣♦♠ Let event A = card is an Ace Let event B = card is from a red suit Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-31 Example 5: Addition Rule P(A  B)  P(A)  P(B)  P(A  B) P(Red U Ace) = P(Red) + P(Ace) - P(Red ∩ Ace) = 26/52 + 4/52 - 2/52 = 28/52 Type Color Red Black Total Ace 2 2 4 Non-Ace 24 24 48 Total 26 26 52 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Don’t count the two red aces twice! Lesson3-32 Conditional Probability  A conditional probability is the probability of one event, given that another event has occurred: P(A  B) P(A | B)  P(B) The conditional probability of A given that B has occurred P(A  B) P(B | A)  P(A) The conditional probability of B given that A has occurred Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-33 Example 6: Conditional Probability P(A | B)  P(A  B) P(B)  Of the cars on a used car lot, 70% have air conditioning (AC) and 40% have a CD player (CD). 20% of the cars have both.  What is the probability that a car has a CD player, given that it has AC ? i.e., we want to find P(CD | AC) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-34 Example 6: Conditional Probability P(A | B)  P(A  B) P(B)  Of the cars on a used car lot, 70% have air conditioning (AC) and 40% have a CD player (CD). 20% of the cars have both. Ka-fu Wong © 2007 CD No CD Total AC .2 .5 .7 No AC .2 .1 .3 Total .4 .6 1.0 ECON1003: Analysis of Economic Data Lesson3-35 Example 6: Conditional Probability P(A | B)  P(A  B) P(B)  Given AC, we only consider the top row (70% of the cars). Of these, 20% have a CD player. 20% of 70% is 28.57%. CD No CD Total AC .2 .5 .7 No AC .2 .1 .3 Total .4 .6 1.0 P(CD  AC) .2 P(CD | AC)    .2857 P(AC) .7 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-36 Multiplication Rule  Multiplication rule for two events A and B: P(A  B)  P(A | B)P(B)  also P(A  B)  P(B | A)P(A) Examples: 1. P(test says girl and girl) = P(girls) * P(test says girls | girls) 2. P(test says boy and boy) = P(boys) * P(test says boys | boys) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-37 Example 7: Multiplication Rule P(A  B)  P(A | B)P(B) P(Red ∩ Ace) = P(Red| Ace)P(Ace)  2  4  2       4  52  52 number of cards that are red and ace 2   total number of cards 52 Type Ka-fu Wong © 2007 Color Red Black Total Ace 2 2 4 Non-Ace 24 24 48 Total 26 26 52 ECON1003: Analysis of Economic Data Lesson3-38 Example 8: Independence Should I go to a party without my girlfriend?  The probability of the my going to party is 0.7 (i.e., I go to 70 out of 100 parties on average).  If I tend to go to whichever party my girlfriend (Venus) goes, my party behavior depends on Venus’s. That is, my probability of going to a party conditional on Venus’s presence is larger than 0.7 (extreme case being 1.0).  If I tends to avoid going to whichever party Venus goes, my party behavior also depends on Venus. That is, my probability of going to a party conditional on Venus’s presence is less than 0.7 (extreme case being 0.0).  If in making the party decision, I never consider whether Venus is going to a party, my party behavior does not depends on Venus’s. That means, the probability of going to a party conditional on Venus’s presence is 0.7. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-39 Example 8: Independence Should I go to a party without my girlfriend?  Define events:  A: a young man goes to a party  B: his girlfriend goes to the same party.  Assume P(A) =0.7  His party behavior does not depend on his girlfriend’s only if P(A|B) =P(A) = 0.7. And, event A is said to be independent of event B.  P(the young man and his girlfriends shows up in a party) = P(A & B) = P(B)*P(A|B).  If he always goes to whichever party his girlfriend goes, P(A|B) = 1. Hence, P(A & B) = P(B)*P(A|B) = P(B).  If he always avoid to whichever party his girlfriend goes, P(A|B) = 0. Hence, P(A & B) = P(B)*P(A|B) = 0.  If in making the party decision, he never considers whether his girlfriend is going to a party, P(A|B) = 0.7. Hence, P(A & B) = P(B)*P(A|B) = P(B)*P(A) = P(B)*0.7. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-40 Statistical Independence  Two events are statistically independent if and only if: P(A  B)  P(A)P(B)  Events A and B are independent when the probability of one event is not affected by the other event  If A and B are independent, then Ka-fu Wong © 2007 P(A | B)  P(A) if P(B)>0 P(B | A)  P(B) if P(A)>0 ECON1003: Analysis of Economic Data Lesson3-41 Example 9: Statistical Independence  Of the cars on a used car lot, 70% have air conditioning (AC) and 40% have a CD player (CD). 20% of the cars have both. CD No CD Total AC .2 .5 .7 No AC .2 .1 .3 Total .4 .6 1.0  Are the events AC and CD statistically independent? Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-42 Example 9: Statistical Independence CD No CD Total AC .2 .5 .7 No AC .2 .1 .3 Total .4 .6 1.0 P(AC ∩ CD) = 0.2 P(AC) = 0.7 P(AC)P(CD) = (0.7)(0.4) = 0.28 P(CD) = 0.4 P(AC ∩ CD) = 0.2 ≠ P(AC)P(CD) = 0.28 So the two events are not statistically independent Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-43 Bivariate Probabilities Outcomes for bivariate events: B1 B2 ... Bk A1 P(A1B1) P(A1B2) ... P(A1Bk) A2 P(A2B1) P(A2B2) ... P(A2Bk) . . . . . . . . . . . . . . . Ah P(AhB1) P(AhB2) ... P(AhBk) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-44 Joint and Marginal Probabilities  The probability of a joint event, A ∩ B: P(A  B)  number of outcomes satisfying A and B total number of elementary outcomes  Computing a marginal probability: P(A)  P(A  B1 )  P(A  B2 )  ...  P(A  Bk ) Where B1, B2, …, Bk are k mutually exclusive and collectively exhaustive events Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-45 Example 10: Marginal Probability P(Ace)  P(Ace  Red)  P(Ace  Black)  Type Ka-fu Wong © 2007 2 2 4   52 52 52 Color Red Black Total Ace 2 2 4 Non-Ace 24 24 48 Total 26 26 52 ECON1003: Analysis of Economic Data Lesson3-46 Tree Diagrams  A tree diagram is useful for portraying conditional and joint probabilities. It is particularly useful for analyzing business decisions involving several stages. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-47 EXAMPLE 11: Tree Diagram In a bag containing 7 red chips and 5 blue chips you select 2 chips one after the other without replacement. Construct a tree diagram showing this information. 6/11 7/12 5/12 R2 R1 5/11 B2 7/11 R2 B1 4/11 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data B2 Lesson3-48 EXAMPLE 11: Tree Diagram The tree diagram is very illustrative about the relation between joint probability and conditional probability Let A (B) be the event of a red chip in the first (second) draw. 6/11 P(B|A) = 6/11 R2 P(A) = 7/12 7/12 5/12 R1 5/11 B2 7/11 R2 B1 4/11 Ka-fu Wong © 2007 P(A and B) = P(A)*P(B|A) = 6/11 * 7/12 B2 ECON1003: Analysis of Economic Data Lesson3-49 Example 12: Using a Tree Diagram Of the cars on a used car lot, 70% have air conditioning (AC) and 40% have a CD player (CD). 20% of the cars have both. P(AC ∩ CD) = .2 All Cars P(AC ∩ CD) = .5 P(AC ∩ CD) = .2 P(AC ∩ CD) = .1 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-50 Odds  The odds in favor of a particular event are given by the ratio of the probability of the event divided by the probability of its complement  The odds in favor of A are P(A) P(A) odds   1- P(A) P(A) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-51 Example 13: Odds  Calculate the probability of winning if the odds of winning are 3 to 1: 3 P(A) odds   1 1- P(A)  Now multiply both sides by 1 – P(A) and solve for P(A): 3 x (1- P(A)) = P(A) 3 – 3P(A) = P(A) 3 = 4P(A) P(A) = 0.75 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-52 Overinvolvement Ratio  Does A1 depends on B1 and B2 differently?  The probability of event A1 conditional on event B1 divided by the probability of A1 conditional on activity B2 is defined as the overinvolvement ratio: P(A 1 | B1 ) P(A 1 | B 2 )  An overinvolvement ratio greater than 1 implies that event A1 increases the conditional odds ratio in favor of B1: P(B1 | A1 ) P(B1 )  P(B 2 | A1 ) P(B 2 ) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-53 Example 14: Overinvolvement ratio  If the probability to get lung cancer is 0.5% for smokers and 0.1% for non-smokers, what is the overinvolvement ratio? Prob(lung cancer | smokers) = 0.005 Prob(lung cancer | non-smokers) = 0.001 Overinvolvement ratio = P(L|S)/P(L|N) = 5 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-54 Bayes’ Theorem P(E i | A)   P(A | E i )P(E i ) P(A) P(A | E i )P(E i ) P(A | E 1 )P(E 1 )  P(A | E 2 )P(E 2 )  ...  P(A | E k )P(E k )  where: Ei = ith event of k mutually exclusive and collectively exhaustive events A = new event that might impact P(Ei) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-55 Bayes’ Theorem P(A1 | B)  P(A1 )P(B | A1 ) P(A1 )P(B | A1 )  P(A 2 )P(B | A 2 )  Bayes’ Theorem can be derived based on simple manipulation of the general multiplication rule. P(A1|B) = P(A1 & B) /P(B) = [P(A1) P(B|A1)] / P(B) = [P(A1) P(B|A1)] / [P(A1 & B) + P(A2 & B)] = [P(A1) P(B|A1) ]/ [P(A1) P(B|A1) + P(A2) P(B|A2)] Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-56 Example 15: Bayes’ Theorem  A drilling company has estimated a 40% chance of striking oil for their new well.  A detailed test has been scheduled for more information. Historically, 60% of successful wells have had detailed tests, and 20% of unsuccessful wells have had detailed tests.  Given that this well has been scheduled for a detailed test, what is the probability that the well will be successful? Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-57 Example 15: Bayes’ Theorem  Let S = successful well U = unsuccessful well  P(S) = .4 , P(U) = .6 (prior probabilities)  Define the detailed test event as D  Conditional probabilities: P(D|S) = .6 P(D|U) = .2  Goal is to find P(S|D) Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-58 Example 15: Bayes’ Theorem Apply Bayes’ Theorem: P(D | S)P(S) P(S | D)  P(D | S)P(S)  P(D | U)P(U) (.6)(. 4)  (.6)(. 4)  (.2)(. 6)  .24  .667 .24  .12 So the revised probability of success (from the original estimate of .4), given that this well has been scheduled for a detailed test, is .667 Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-59 Example 16: Bayes’ Theorem  Duff Cola Company recently received several complaints that their bottles are under-filled. A complaint was received today but the production manager is unable to identify which of the two Springfield plants (A or B) filled this bottle. The following table summarizes the Duff production experience. % of Total Production % of under-filled bottles A 55 3 B 45 4  What is the probability that the under-filled bottle came from plant A? Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-60 Example 16: Bayes’ Theorem % of Total Production % of under-filled bottles A 55 3 B 45 4  What is the probability that the under-filled bottle came from plant A? P(A)P(U | A) P(A/U)  P(A)P(U | A)  P(B)P(U | B) .55(.03)   .4783 .55(.03)  .45(.04) The likelihood the bottle was filled in Plant A is reduced from .55 to .4783. Without the information about U, the manager will say the under-filled bottle is likely from plant A. With the additional information about U, the manager will say the under-filled bottle is likely from plant B. Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-61 Lesson 3: A Survey of Probability Concepts - END - Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson3-62