Download 7.0 Continuous Random Variable - University of Northern Colorado

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Randomness wikipedia , lookup

Probability wikipedia , lookup

Probability interpretations wikipedia , lookup

Transcript
Continuous Random Variables
Page 1
Continuous Probability Distribution
Counting the Uncountable
Suppose we make n observations of a continuous variable between the values a and b. To
organize these data, we create a relative frequency histogram using m uniform intervals.
Let the function h(x) be defined such that h(xi) is the relative frequency of the ith interval.
0.30
y = f(x)
0.25
0.20
0.15
h(x4)
0.10
0.05
h(x3)
h(x1)
h(x2)
h(xm)
0.00
0
a x11
x22
x33
x44
5
6
.
7.
.
8
9
xm
10
b
11
The probability that the random variable X falls in the interval (a,b) is the sum of the
probabilities of the relative frequency histogram. That is,
P(a < X < b) = P(x1) + P(x2) + P(x3) + … + P(xm).
Notice that the probability P(xi) is the area of the bar in the relative frequency histogram. We
know the height of the bar, h(xi). Let the width of the bar be denoted
b−a
∆x =
m
where m is the number of intervals in the histogram. Thus,
m
P(a < X < b) = h(x1)∆x + h(x2)∆x + h(x3)∆x + … + h(xm)∆x =
∑ h( x )∆x
i =1
Robert A. Powers
i
University of Northern Colorado
Continuous Random Variables
Page 2
To infinity and beyond…
Let us now consider what happens when the number of observations n increases without bound,
and the width of the interval ∆x subsequently decreases without bound.
The first assumption is that the relative frequency histogram h(x) will approach a continuous
function f(x) that gives the true probability of selecting a continuous variable from a to b.
Given that is the case, then
0.3
m
0.25
i =1
0.2
P(a < X < b) = lim ∑ f ( xi )∆x
∆x →0
Hopefully, the right hand side of the equation looks
familiar. This is the definition of a definite integral
of f from a to b. Thus,
a
0.05
b
11
10
9
8
7
6
5
4
3
a
2
0
f ( x )dx .
1
∫
b
0.1
0
P(a < X < b) =
0.15
Continuous Random Variables
When dealing with a continuous random variable, a function f(x) must be known from either
theory or observation that determines the true probability within each subinterval of all possible
values. The function f(x) must be nonnegative for all possible values of the continuous random
variable. The total area between the function f(x) and the x-axis must be 1 for all possible values.
Finally, the probability of randomly selecting the variable X in the interval (a,b) is determined by
the area bounded by the function f(x), the x-axis, and the vertical lines at x = a and x = b.
The following definition states these ideas formally.
Definition: The probability density function (p.d.f.) of a continuous random variable X with
sample space S that is an interval or union of intervals is an integrable function f(x) satisfying
the following conditions:
a. f(x) ≥ 0, x ∈ S
b.
∫
S
f ( x )dx = 1
c. If (a,b) ⊆ S, then the probability of the event {a < X < b} is
P(a < X < b) =
Robert A. Powers
∫
b
a
f ( x )dx .
University of Northern Colorado
Continuous Random Variables
Page 3
1. The following graph represents a probability density function f(x) of the continuous random
variable X. Write an expression for the probability that 1 < X < 4. Identify on the graph the
probability that 1 < X < 4.
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
0
1
2
3
4
5
6
7
8
9
10
9
10
2. Let the continuous random variable X be modeled by the probability density function
g ( x) = e − x for values of x greater than 0. Determine the probability that 1 < X < 3.
Represent the probability using the graph of g(x).
1.5
1.4
1.3
1.2
1.1
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0
Robert A. Powers
1
2
3
4
5
6
7
8
University of Northern Colorado