Download Probability

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Gambler's fallacy wikipedia , lookup

Transcript
Probability Intro
Professor Jim Ritcey
EE 416
Revised Fall 2013
Please elaborate with your own sketches
Disclaimer
• These notes are not complete, but they should
help in organizing the class flow.
• Please augment these notes with your own
sketches and math. You need to actively
participate.
• It is virtually impossible to learn this from a
verbal description or these ppt bullet points. You
must create your own illustrations and actively
solve problems.
Random Experiments
• An experiment is prescribed with N outcomes
• finite {1 2 3} countable {1 2 3 …} uncountable {z>0}
• A chance mechanism selects outcome – uncertainty
• We have some knowledge of the likelihood of
occurrence – often measured empirically through
relative frequency in repeated independent trials
• Kolmogorov developed consistent axiomatic
framework for probability
Relationship to Set Theory
• Set Theory
Probability
• Universe
• Elements
• Algebra of subsets
Sample space
Outcomes
Events
• To each event the theory allows/assigns a number
• Probability (event) indicates its likelihood of
occurrence 0<= p <= 1
Set Operations – PPT Notation
• Complement
A^c = {c: c in S but c not in A}
• Union
(cup) A+B = {c: c in A or c in B }
• Intersection (cap) AB = {c: c in A and c in B }
• Often we use cap/cup/overbar (complement) in
traditional math notation
• MATLAB provides set operations as functions very useful when working with large sets
Algebra 0f Events
• Each experiment has outcomes in sample space S
• Subsets of outcomes are called events.
• The set of events must be an algebra –closure
•
•
•
•
For events A,B,C, … then
All complements are events A^c, B^c, …
All finite unions & intersections are events
All countable unions & intersections are events
• For finite sample spaces the events consist of the
set of all subsets. But too large for infinite S
Probability Triple
• (S, E, P) (Sample Space, Events, Probability)
• Sample space – set of all possible outcomes
• Events – family/collection of all subsets to which
we assign probabilities. Subject to closure under
set operations
• Probability – maps events to unit interval [0,1]
subject to 3 Axioms. For events A,B,C,…
• (1) P(A) >= 0 non-negative probability
• (2) P(S) = 1 something happens w probability 1
• (3) A, B disjoint P(A+B) = P(A) + P(B)
• (3’) Can extend (3) to countable unions
Examples
•
•
•
•
S = {T,H} = {0,1} 1 coin toss
E = { S^c = {}, {0}, {1}, S = {0,1} }
Given P(1) can compute probability of any e in E
In the model, p=P(1), is a parameter
• S = {TT,HT,TH,HH} = {00,10,01,11} 2 coin toss
• E = { S^c = {}, {0}, {1}, S = {0,1} }
• Given P(1) can compute probability of any e in E
• Each event can be given an intuitive name
S = {00,10,01,11}
{H=1,T=0}
e
0
1
2
3
outcomes
1
0
0
0
0
Empty set = nothing happens
2
0
0
0
1
Exactly {11} “2 heads”
3
0
0
1
0
Exactly {01} “tail then head”
4
0
0
1
1
{01} or {11} “head on second toss}
5
0
1
0
0
Exactly {10}
1
0
0
1
Exactly “2 of the same”
1
1
1
0
“Anything but 2 heads” complement of #2
6
7
10
14
15
16
S = something happens
Probability of an Event
• Event is a subset of outcomes
• An event is realized (occurs) if any of its members
occurs
• The outcomes of an experiment are by definition
disjoint
• P (event) is the sum of the probabilities of
outcomes that define it 
• P(event) = P(outcome_1) + …
• But events can contain many outcomes
How to we know P
• P( any event ) is known in the theory. But how?
• Relative frequency in independent repeated trials
• Uniformity considerations – insufficient reason to
suspect otherwise “at random”
• Subjective – expert opinion, must be consistent
• What is the probability that the sun will rise
tomorrow p=1? p<1?
• What is the probability that you will obtain above
3.4 in EE416?
• Engineering usage focuses on relative frequency
• Based on past ( but relevant) history
Theorems are derived from Axioms
• P(A^c) = 1 - P(A)
• P(S) = 1  P(empty set) = 0
• P(A+B) = P(A) + P(B) - P(AB)
• P(A+B) < = P(A) + P(B)
• A subset of B then P(A) <= P(B)
• Proofs are instructive usually they relate to
disjoint events
Partitions
•
•
•
•
If A us an event, and A in B in S (sample space)
Let B_1, …B_N partition B
Then
P(A) = P(AB_1) + … +P(AB_N)
• Simplifies more complicated events
General Problem of Computing Prob
• Given a combination of events, with knowing prob
of each separately, determine prob of combination
• Typical combinations “at least one” “k or more of”
• Always can be written as set operations
• Always can be enumerated, but can be a long list
1 Coin Toss
•
•
•
•
•
•
•
•
Experiment – toss a coin S –{T,H} |S| =2
Here let the events be the power set Pwr(S)
Set of all subsets with 2^2 -4 elements
S^c = Empty set, H, T, {H or T} =S closed family
Assign P(H) = p, p = 1/2 only for a fair coin
Consistent with axioms if p in [0,1] & P(T) = 1-p
Often insufficient reason to take p neq 1/2
Often p should be measured empirically, the
fraction of heads in many trials (how many?)
• The outcomes have prob {pT= 1-p, pH = p}
Geometric Probability
• In many board games spinners are used. You cut a
circular region into n congruent pizza slices, and
flip a spinner. It is obvious that the probability of
any outcome is p = 1/n.
• This can be argued to be the ratio of the arc
length of the wedge to arc length of the circle
• Geometric probabilities reduce to calculation of
relative length/area/volume and are based on an
“at random” assumption. This implies uniformity.
2 Coin Toss
• S = {TT, TH, HT, HH} ordered (T1, T2)
• Pwr(S) has 2^4 = 16 possible events
• It is not possible to combine outcomes in any
other way. Next assign a consistent probability to
• (p1, p2,p3,p4) p_i in [0,1] sum p_i = 1
• Insufficient reason takes p_i = ¼
• But it depends on the random experiment (toss)
• Allowable is any consistent set of probabilities.
• Then eg, P(T1=TorT2=T) = P(HT + TH + TT) =
=P(HT) + P(TH) + P(TT)
Independence
•
•
•
•
Two events A,B are independent when
P(AB) = P(A)P(B) multiply probabilities
This is a property of the probability assignment
Disjoint events are
• Dependent events – statistical relationship/linking
• This allows for prediction of one given the other
• Not a one-way causal relationship
2 Coin Toss
•
•
•
•
•
•
•
•
•
•
Label the outcomes (00, 01, 10, 11) where T=0,H=1
Suppose we measured
P(00) = 0.10 P(01) = 0.30
P(10) = 0.22 P(11) = 0.38
Are the tosses independent?
P(t1=0)=P(00)+P(01) = 0 .1+0.3 =0.40 P(t1=1)= 1-.4
P(t2=0)=P(00)+P(10) = 0.1+0.22=0.32 P(t2=1 = 1- .32
Independence says that
P(t1=0)P(t2=0) = P(00) but this does not hold!
Under this P, Events are not independent!
Are disjoint events independent? NO
• Let A,B mutually exclusive AB = empty set
• P(AB) = P(empty set) = 1-P(S) = 0
• They are not independent
• Disjoint events are highly dependent – if one event
occurs, the other cannot have occurred.
• This must be distinguished from independence
• P(AB)=P(A)P(B) which might be zero if P(A)=0
• A property of the Prob Assnment, not the events
Communications Networks
•
•
•
•
A network is made up of nodes and links (edges)
The link either work 1 or fail 0 with p=P(1) q=P(0)
Links can be connected in various ways
Find Probability P_N that a network works
• Series Net: all links work
P_N = p^N
• Parallel Net: at least 1 link works P_N = 1 - q^N
• Need to work through the derivation
• Networks can combine series and parallel
Target Detection
• A radar takes N looks at a target, target is
detected (when present) with p=0.9
• How does detection prob improve as we increase N
• P_N := 1 –(1-p)^N but we need to explore
numerically
• For small N,
• P = [ p, 2p-p^2, 3p -3p^2 +p^3, … ]
• Is this a series or parallel problem?
Birthday Problem
•
•
•
•
•
Birthdays occur at random on [1:365] no leap yr
Sample r people’s birthdays, n=365
find P(hit) =P( 2or more birthdays in sample)
P(no hit) = # ways to select wo rep/# ways w rep
1-P(hit)= (n)_r / n^r where (n)_r = n!/(n-r)!
• Numerical implications are interesting
Birthday Problem Numerics
• Place r balls at random in n bins. What is the probability that
2 or more land in any bin
• N=365, r=23 for a 50:50 chance of hits
• Randomness causes clumping!
• Table shows P(hit) vs r, number of balls placed
• Balls can be calls, bins can be channels/frequencies
20
0.4
23
0.507
30
0.706
35
0.814
40
0.89
Birthday Problem Analysis
•
•
•
•
•
•
•
Give a simple expression for
P(r,n) = 1 –(n)_r/n^r, when n is large
Write Q = 1-P as a product
Q=1-P(r,n) (n)_r/n^r = Prod_1^(r-1) [1-j/n]
Go to a log form exp( sum of logs)
Approximate the sum & simplify
This is computational probability
• Or just compute use Gamma functions
Birthday Prob (n=365)
Matlab Code
•
•
•
•
•
function birthdayprob( N, K);
%function birthdaypob( N, K);
%compute all birthday probabilities.
kall =[0:K-1];
%Pnohit =(N)_k/N^k;
•
•
•
•
•
•
Pnohit = exp( cumsum( log(1 -kall/N) ) );
Pnoapp = exp( -(kall+1).*(kall)/(2*N) );
% a very good approximation!
X = [1:K]'*[ 1 1];
P1 = [ Pnohit;1-Pnohit]';
P2 = [ Pnoapp; 1-Pnoapp]';
•
•
•
•
•
% note that we are plotting both tails
figure(1); semilogy( X, P1,'-*');
hold on; semilogy( X, P2,'-o'); hold off
title('Exact vs Approx hit Probabilities');
xlabel('Number of Balls Thrown');grid;
Ranking Problems
• You meet Alice, she has two brothers Bob and
Chris. You don’t know anything more.
• What is the probability that Alice is the oldest?
• Use Insufficient Reason to determine
• P(A oldest) = 1/3
• Now she tells you that Alice is older than Bob.
• Determine P(A is oldest).
• Hint: Consider the simplest sample space