Download Chapter 2

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Randomness wikipedia , lookup

Simulated annealing wikipedia , lookup

Pattern recognition wikipedia , lookup

Generalized linear model wikipedia , lookup

Birthday problem wikipedia , lookup

Probability amplitude wikipedia , lookup

Transcript
CHAPTER
2
REVIEW OF STATISTICS: PROBABILITY AND
PROBABILITY DISTRIBUTIONS
QUESTIONS
2.1.
See Sections 2.2, 2.4, 2.5, and 2.6.
2.2.
No. Notice that a pair of events, A and B, are mutually exclusive if they
cannot occur jointly, that is, P(AB) = 0. Independence, on the other hand,
means that P(AB) = P(A) P(B). Consider this example. Let A = the card is
a heart and B = the card is an ace. A card is drawn from a deck of 52 cards.
We know that P(A) = 1/4 and that P(B) = 1/13. The probability of the event
that a card is both a heart and an ace is P(AB) = 1/52 = P(A) P(B). Hence
the two events are independent.
But they are not mutually exclusive
because the ace of hearts could be drawn.
2.3.
(a) True, (b) True.
2.4.
(a) Yes, they are also collectively exhaustive.
(b) (i) Events E1 and A2 occur together, (ii) events E3 or A3 occur,
(iii) E1 or A1 occur and similarly for the other three combinations;
(iv) events E2 A1 , E3 A2 , E4 A3 occur (Each pair occurs together).
Note that forecasts and actual events need not coincide. It is possible that
E1 was predicted, but the actual growth was A 4 and vice versa.
2.5.
PDF relates to a continuous variable and PMF to a discrete variable.
2.6.
The CDF of a discrete variable is a step function, whereas that of a
continuous variable is a continuous curve.
2.7.
Making the substitution, we obtain P( A | B) 
P( B | A) P( A)
. This is simply
P( B)
Bayes’ formula. If we think of A as a possible hypothesis about some
phenomenon, Bayes’ theorem shows how opinions about this hypothesis
held a priori should be modified in light of actual experience. In Bayesian
9
language, P(A) is known as prior probability and P( A | B) is known as
posterior (or revised) probability.
PROBLEMS
4
2.8.
(a)
 xi 1 = x0 +
x + x2 + x3 (Note: x0 = 1).
i 1
(b)
(c)
6
6
i 2
i 2
 ayi  a yi  a( y2  y3  y4  y5  y6 )
2
2
2
i 1
i 1
i 1
 (2 xi  3 yi )  2 xi  3 yi  2( x1  x2 )  3( y1  y2 )
3
(d)
2
 xi yi  x1 y1  x2 y1  x3 y1  x1 y2  x2 y2  x3 y2
i 1 j 1
(e)
4
4
4
i 1
i 1
i 1
 i  4   i  4 = (1 + 2 + 3 + 4) + (4)(4) = 26
3
(f)
 3i  3  32  33 = 39
i 1
10
(g)  2 = (2)(10) = 20
i 1
3
3
3
x 1
x 1
x 1
(h)  (4 x 2  3)  4 x 2   3  4(12  22  32 )  (3)(3) = 47
5
2.9.
(a)
 xi
( i from 1 to 5)
i 1
5
(b)
 i xi (i from 1 to 5)
i 1
k
(c)
 ( xi2  yi2 ) (i from 1 to k)
i 1
2.10.
(a) [500 (500 + 1)] / 2 = 125,250
(b)
100
9
1
1
k k
= [100 (101)] / 2 – [9 (10)] / 2 = 5,005
100
(c) 3 k = 3(5,005) = 15,015, using (b) above.
10
10
2.11.
(a) [10 (11)(21)] / 6 = 385
(b)
(c)
20
9
1
1
19
10
1
1
20(21)(41) 9(10)(19)
= 2,585

6
6
k2  k2 
k2  k2 
19(20)(39) 10(11)(21)
= 2,085

6
6
10
(d) 4 k 2 = 4(385) = 1,540, using (a) above.
1
2.12.
(a) Since
 f( X ) = 1,
(b + 2b + 3b + 4b + 5b) = 15b = 1. Therefore, we
have b = 1/15.
(b) P(X 2) = 6/15;
2.13.
P(X 3) = 10/15;
P(2X 3) = 4/15
(a) Marginal distributions:
X
1
2
3
Y
1
2
3
4
f(X)
0.20
0.40
0.40
f(Y)
0.15
0.10
0.45
0.30
(b) Conditional distributions:
f(X|Y)
f(Y|X)
P(X = 1 | Y = 1) = 0.03 / 0.15 = 0.20 P(Y = 1 | X = 1) = 0.03 / 0.20 = 0.15
P(X = 2 | Y = 1) = 0.06 / 0.15 = 0.40 P(Y = 2 | X = 1) = 0.02 / 0.20 = 0.10
P(X = 3 | Y = 1) = 0.06 / 0.15 = 0.40 P(Y = 3 | X = 1) = 0.09 / 0.20 = 0.45
……….
P(Y = 4 | X = 1) = 0.06 / 0.20 = 0.30
……….
……….
The remaining conditional distributions can be derived similarly.
2.14.
Let B represent the event that a person reads the Wall Street Journal and let
A1, A2, and A3 denote, respectively, the events a Democrat, a Republican,
and an Independent. We want to find out P( A2 | B) :
11
P( A2 | B) =

P( B | A2 ) P( A2 )
P( B | A2 ) P( A2 )  P( B | A1 ) P( A1 )  P( B | A3 ) P( A3 )
(0.6)(0.4)
= 0.558
(0.6)(0.4)  (0.3)(0.5)  (0.4)(0.1)
Note that the prior probability of sampling a Republican is 0.4 or 40%. But
knowing that someone is found reading the Wall Street Journal, the
probability of sampling a Republican increases to 0.558 or 55.8%. This
makes sense, for it has been observed that proportionately more
Republicans than Democrats or Independents read the
Journal.
This
example is an illustration of Bayes’ Theorem.
2.15.
This is P ( A  B ) or P( A  B) = 0.9.
2.16.
(a) No, for the probability that this happens is 0.2 and not zero.
(b) Let A denote having children and B denote work outside home. If these
two events are to be independent, we must have P(AB) =P(A) P(B). In the
present case, P(AB) = 0.2 and P(A) = 0.5 and P(B) = 0.6. Since in this case
P(AB) P(A) P(B), the two events are not independent.
2.17.
From Table 2-9, it can be seen that
X Below poverty Above poverty
f(Y) 
Y
White
0.08
0.67
0.75
Black
0.03
0.09
0.12
Hispanic
0.03
0.10
0.13
0.14
0.86
1.00
f(X) 
(a) f (X | Y = White) =
f ( X ,Y  White) 0.08
0.1067

f (Y  White)
0.75
f (X | Y = Black) =
f ( X ,Y  Black ) 0.03

= 0.2500
f (Y  Black )
0.12
12
f (X | Y = Hispanic) =
f ( X ,Y  Hispanic ) 0.03
0.2308

f (Y  Hispanic )
0.13
(b) They are not. For them to be independent, f(X,Y) = f(X) f(Y) must hold
true for all combinations of X and Y values, which is not the case here.
2
2.18.
(a) For it to be a proper PDF, we must have c  (4 x  2 x 2 )dx  1. That is,
0
2
8
3
2 
 16 16 
4
c  x 2  x 3  =1. That is, c   =1, or c = 1, which gives c  .
3
8
3 0
 2 3
2
2
3
3 4
2 
3 4 1
(4 x  2 x 2 )dx   x 2  x3      .

81
8 2
3 1 8  3  2
2
(b)
2
(c) P(x > 2) = 1 – P(x < 2) = 1 
1
2.19.
(a) f(x) =
12
 5 (2 x  x
2
3
(4 x  2 x 2 )dx = 1 – 1 = 0 .

80
 xy)dy 
0
18 12 2
x x .
5
5
12 
 18
Therefore, P(x > 0.5) = 1 – P(x < 0.5) = 1    x  x 2  dx = 0.65.
5
5 
0
0.5
Following a similar procedure, you can verify that P(y < 0.5) = 0.65.
(b) f ( x | y ) 
f ( x , y)

f ( y)
f ( x , y)
1
x(2  x  y )
1
 f ( x , y)dx  x(2  x  y)dx
0
=

0
6 x(2  x  y )
.
4  3y
13

x(2  x  y )
2 y

3 2