Download Random Variable

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
Transcript
THE UNIVERSITY OF HONG KONG
DEPARTMENT OF STATISTICS AND ACTUARIAL SCIENCE
STAT1301 PROBABILITY AND STATISTICS I, FALL 2010
EXAMPLE CLASS 3
Random Variable
Elements of Theory
Random Variable as a (measurable) Function between a State Space and a Sample Space
A function (with some requirement)
random variable.
-
defined on the state space
{ } is called a
Domain is the state space
Range is usually a numbers set, e.g., or its subsets, for easy manipulation.
The range is called the sample space of the random variable. There is no intrinsic difference on the
nature between a sample space and a state space—they are just two sets with some requirement, called
“measurability.” They are just domain and range of a “function” with some requirement, called
“measurability.”
“Random Variable” is a “Function”
The variable perspective is adopted by an observer of a random experiment. The observer is only able to
observe/know/measure/obtain information based on the sample space. For the observer, all she could see is a
variable dancing (randomly) on the sample space. This is the perspective that we will primarily study in this course.
The function perspective is adopted by someone who would have a “divine” capacity in understanding (a
deterministic part of) the design of the random mechanism, in particular, her capacity in seeing the existence of an
underlying state space as the domain of a function sending elements of the domain to the state space. You will be
studying this perspective in an advanced course of probability.
Distribution: Law of A Random Variable’s Dance
There is a law governing how any random variable to be observed in its sample space. The law is a probabilistic one,
called the probability distribution of a random variable. There are two qualifications for any real-valued function
to be a probability density/mass function, aka, probability function:
1)
for any
,
or ∑
2) ∫
.
Real-valued Random Variables
Generally, we categorize all real-valued random variables in 3 groups: (1) Discrete Random Variables (its sample
space is a discrete subset in ); (2) Continuous Random Variables (its sample space is a “continuous” subset in ); (3)
Partially Discrete and Partially Continuous Random Variables. The Distribution of a Discrete/Continuous Random
Variable is called its Probability Mass/Density Function because of a superficial difference in mathematical
treatments and graphical representation.
Discrete
Random
Variable
Probability
Density
Function
Probability
Mass
Function
Cumulative
Distribution
Function
Expectation
Continuous
{
{
}
-
{
{ }
∑
{ }
}
∫
[
}
Problems
1. Suppose that you are invited to play a game with the following rules: One of the numbers 2, … , 12
is chosen at random by throwing a pair of dice and adding the numbers shown. You win 9 dollars in
case 2, 3, 11, or 12 comes out, or lose 10 dollars if the outcome is 7. Otherwise, you do not win or lose
anything. This defines a function on the set of all possible outcomes {2, … , 12}, the value of the
function being the corresponding gain (or loss if the value is negative). What is the probability that the
function takes positive values, i.e,, that you will win some money?
Solution.
As shown in the figure, the answer is
.
Remark.
The Choice of sample space is relative. As long as a set is the range of some random variable, it is a
sample space in effect, although it might also be serving as the state space of another random variable.
An example is the sample space 1 above. Further in this case, the composite function of random
variable 1 and 2 above becomes a new random variable sending elements directly from the state space
to sample space 2. Also note that the probability distribution on each of the three spaces above must
be “consistent.” This consistency makes it possible to define a random variable between any pairs from
the three sets.
2. An urn contains 7 white balls numbered 1, 2, …, 7 and 3 black balls numbered 8, 9, 10. Five balls
are randomly selected without replacement.
Now give the distribution:
I.
II.
III.
IV.
of the number of white balls in the sample;
of the minimum number in the sample;
of the maximum number in the sample;
of the minimum number of balls needed for selecting a white ball.
Solution.
I. As show in the figure (there we use 0 to denote 10 for neatness), the random variable is sending
elements from the state space {12345, 12346, ……, 67890} to the sample space {2,3,4,5}.
The P.M.F. on the sample space is tallied as the following:
=
2
3
4
5
( )( )
( )( )
( )( )
( )( )
(
)
(
)
(
)
(
)
II.
Only 1-6 can possibly be the minimum number in any 5 labels drawn. {1,2,…, 6} will be the sample
space.
1
2
3
4
5
6
( )
( )
( )
( )
( )
( )
(
(
(
(
(
(
)
)
)
)
)
)
III.
Only 5-10 can possibly be the maximum number in any 5 labels drawn. {5,6,7,8,9,10} will be the
sample space. This case is symmetric to II.
10
9
8
7
6
5
( )
( )
( )
( )
( )
( )
(
(
(
(
(
(
)
)
)
)
)
)
IV.
If you draw 4 balls there must be at least 1 white since there are only 3 blacks. If you draw 3 balls it is
possible that all the 3 are blacks. Therefore the sample space is the set {1,2,3,4}.
1
2
3
4
3. A random variable is called discrete whenever there is a countable set
A random variable is said to have the binomial distribution
, where
{ }
for
such that
and
[
.
, if
( )
. Verify that such a random variable is discrete.
Solution.
First be aware of the following identity from elementary algebra:
( )
Substitute
and
by
( )
and
( )
(
)
( )
respectively will give
∑( )
This means there exists a countable set {
discrete random variable.
}
such that
{
}
. Therefore
is a
4. The amount of bread (in hundreds of kilos) that a bakery sells in a day is a random variable with
density
{
a) Find the value of which makes a probability density function.
b) What is the probability that the number of kilos of bread that will sold in a day is, (i) more than
300 kilos? (ii) between 150 and 450 kilos?
c) Denote by and the events in (i) and (ii), respectively. Are and independent events?
Solution.
a) First,
must be
to make
. Second,
∫
b) Since
(i)
is continuous at
∫
∫
{
true:
∫
, therefore
∫
(ii)
c)
has to make ∫
}
,
∫
∫
.
.
{
}
{
Now that
,
.
}
,
∫
.
, they are independent events.
5. A number is randomly chosen from the interval (0,1). What is the probability that:
a) its first decimal digit will be a 1;
b) its second decimal digit will be a 5;
c) the first decimal digit of its square root will be a 3?
Solution.
First, some clarification. A “number” here means a point of on the axis of real numbers. The usual
representation of such a point is called the “decimal expansion” of the number, using digits 0~9. For
example, the mid-point of the interval (0,1) has decimal expansion
, or equivalently,
.
a) Since the manner of drawing a point is random, each of the possible first decimal digits will be
equiprobable, making
as the answer.
b) An unconscious reasoning to this part will be “there is no difference
between a) and b) except for the superficial difference in positions of
the digits”, leading to the same
as the answer.
A more analytical way of thinking about the two questions is to
consider two random variables
{
}
{
}
to be the (deterministic) mechanisms that tell you the first and second decimal digits, respectively, of
the number (randomly) drawn.
From the graphs, it is clear the answers derived from our previous unconscious reasoning are correct.
c) Define the random variable
{
√
}
{
}
6. Find an example of two different random variables
and
with the same distribution
.
Solution.
Question 1 already provide us such an example: is the random variable 2, and is the composite of
random variables 1 and 2. Thus they share the same sample space and distribution.
Another example is to consider tossing a fair coin with sample space {
} R.V.
7. (a) A project will bring $1M profit if completed. If
the probability of completion is 80%, what is the
expected profit?
(b) For every set of real numbers, we define the
indicator function by
{
Show that
[
∫
Solution.
(a) $0.8M.
(b) [
{
}
{
}
{
}
∫
Or, from the figure we have the equation:
[
(
[
{ } )
{ }
[
{ }
∫
is defined as: