Download Probability Distributions.

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
4.2 Probability Distributions
Definition. A random variable is a variable whose value is a numerical
outcome of a random phenomenon. The probability distribution of a
random variable tells us what the possible values of the variable are
and how probabilities are assigned to those values.
Discrete Random Variables
Definition. A discrete random variable X has a finite number of possible values. The probability distribution of X lists the values and their
probabilities:
Value of X x1 x2 x3 · · · xk
Probability p1 p2 p3 · · · pk
Example 4.9. A household is a group of people living together, regardless of their relationship to each other. Many sample surveys such
as the Current Population Survey select a random sample of households. Choose a household at random, and let the random variable X
be the number of people living there. Here is the distribution of X.
Household size
Probability
1
2
3
4
5
6
7
.251 .321 .171 .154 .067 .022 .014
The probability that a randomly chosen household has more than two
1
members is
P (X > 2) = P (X = 3) + P (X = 4) + P (X = 5) + P (X = 6) + P (X = 7)
= .171 + .154 + .067 + .022 + .014 = .428
Equally Likely Outcomes
Definition. If a random phenomenon has k possible outcomes, all
equally likely, then each individual outcome has probability 1/k. The
probability of any event A is
count of outcomes in A
count of all possible outcomes
count of outcomes in A
.
=
k
P (A) =
Example 4.10. Roll two dice and record the pips (dots) on each of the
two up-faces. Figure 4.8 (see TM-65) shows the 36 possible outcomes.
If the dice are carefully made, all 36 outcomes are equally likely. So
each has probability 1/36. Gamblers are often interested in the sum
of the pips on the up faces. What is the probability of rolling a 5?
The event “roll a 5” contains the four outcomes: (1,4), (2,3), (3,2),
(4,1). The probability is therefore 4/36 = 1/9 = 0.111. What about
the probability of rolling a 7? In Figure 4.8 (TM-65) you will find
six outcomes for which the sum of the pips is 7. The probability is
6/36 = 1/6 = 0.167.
2
The Mean and Standard Deviation of a Discrete Random Variable
Definition. Suppose that X is a discrete random variable whose distribution is:
Value of X x1 x2 x3 · · · xk
Probability p1 p2 p3 · · · pk
Find the mean of X by multiplying each possible value by its probability
and adding over all the values:
µ = x 1 p1 + x 2 p2 + · · · + x k pk =
n
i=1
x i pi .
Note. The mean of a random variable X is a single fixed number µ.
It gives the average value of X in several senses:
• The mean µ is the average of the possible values of X, each
weighted by how likely it is to occur. That’s what the definition
of µ says.
• The mean µ is the point at which the probability histogram of
the distribution of X would balance if made of solid material. See
Figure 4.9 (and TM-66). Recall that the mean µ of a density curve
has this same property.
• If we actually repeat the random phenomenon many times, record
the value of X each time, and average these observed values, this
average will get closer and closer to µ as we make more and more
repititions. This fact is called the law of large numbers.
3
Definition. Suppose that X is a discrete random variable whose distribution is:
Value of X x1 x2 x3 · · · xk
Probability p1 p2 p3 · · · pk
and that µ is the mean of X. The variance of X is
σ 2 = (x1 − µ)2 p1 + (x2 − µ)2 p2 + · · · + (xk − µ)2 pk =
n
(xi − µ)2pi .
i=1
The standard deviation σ is the square root of the variance.
Continuous Random Variables
Definition. A continuous random variable X takes all values in an
interval of numbers. The probability distribution of X is described by
a density curve. The probability of any event is the area under the
density curve and above the values of X that make up the event.
Note. The distribution of a continuous random variable assigns probabilities as areas under a density curve. See Figure 4.10 (and TM-67).
Definition (for those with some calculus background). Suppose
that X is a continuous random variable with probability distribution
P (X). The mean of X is
µ=
xP (x) dx
and the variance of X is
2
σ = (x − µ)2 P (x) dx,
4
where the integrals are taken over all possible values of X. The standard
deviation σ is the square root of the variance.
5
Related documents