Download Document

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
Continuous Probability Distributions
Chapter Contents
Uniform Continuous Distribution
Exponential Distribution
Describing a Continuous Distribution
Normal Distribution
Standard Normal Distribution
Normal Approximations
7-1
Continuous Probability Distributions
Chapter Learning Objectives
Define a continuous random variable.
Calculate uniform probabilities.
Know the form and parameters of the normal distribution.
Find the normal probability for given z or x using tables or Excel.
Solve for z or x for a given normal probability using tables or Excel.
Use the normal approximation to a binomial or a Poisson
distribution.
Find the exponential probability for a given x.
Solve for x for given exponential probability.
7-2
Uniform Continuous Distribution
LO7-2
Characteristics of the Uniform
Distribution
If X is a random variable that is
uniformly distributed between
a and b, its PDF has
constant height.
•
•
Denoted U(a, b)
Area =
base x height =
(b-a) x 1/(b-a) = 1
7-3
Characteristics of the Uniform Distribution
7-4
Example: Anesthesia Effectiveness
•
•
•
•
An oral surgeon injects a painkiller prior to extracting a tooth. Given the
varying characteristics of patients, the dentist views the time for
anesthesia effectiveness as a uniform random variable that takes
between 15 minutes and 30 minutes.
X is U(15, 30)
a = 15, b = 30, find the mean and standard deviation.
Find the probability that the effectiveness anesthetic takes between
20 and 25 minutes.
7-5
Example: Anesthesia Effectiveness
P(20 < X < 25) = (25 – 20)/(30 – 15) = 5/15 = 0.3333 = 33.33%
7-6
1. The time that the customers at the “self serve” check out stations
at the Mejers store spend checking out follows a uniform distribution
between 0 and 3 minutes.
a.
Determine the height and draw this uniform distribution.
b. How long does the typical customer wait to check out?
c. Determine the standard deviation of the wait time.
d. What is the probability a particular customer will wait less than one
minute?
e. What is the probability a particular customer will wait between 1.5 and
2 minutes?
7-7
2. A statistics instructor collected data on the time it takes the students
to complete a test. The test taking time is uniformly distributed within a
range of 35 minutes to 55 minutes.
a. Determine the height.
b. How long does the typical test taking time?
c. Determine the standard deviation of the test taking time.
d. What is the probability a particular student will take
less than 45 minutes?
e. What is the probability a particular student will take
between 45 and 50 minutes?
7-8
3. A tube of Listerine Tartar Control toothpaste contains 4.2 ounces. As people
use the toothpaste, the amount remaining in any tube is random. Assume the
amount of toothpaste left in the tube follows a uniform distribution. From this
information, we can determine the following information about the amount
remaining in a toothpaste tube without invading anyone’s privacy.
a. How much toothpaste would you expect to be remaining in the tube?
b. What is the standard deviation of the amount remaining in the tube?
c. What is the likelihood there is less than 3.0 ounces remaining in the
tube?
d. What is the probability there is more than 1.5 ounces remaining in
the tube?
7-9
Characteristics of the Exponential Distribution
•
•
If events per unit of time follow a Poisson distribution, the time until the
next event follows the Exponential distribution.
The time until the next event is a continuous variable.
NOTE: Here
we will find
probabilities
> x or ≤ x.
7-10
Characteristics of the Exponential Distribution
Probability of waiting less than or
equal to x
Probability of waiting more than x
7-11
Example Customer Waiting Time
•
•
•
•
Between 2P.M. and 4P.M. on Wednesday, patient insurance
inquiries arrive at Blue Choice insurance at a mean rate of 2.2 calls
per minute.
What is the probability of waiting more than 30 seconds (i.e., 0.50
minutes) for the next call?
Set  = 2.2 events/min and x = 0.50 min
P(X > 0.50) = e–x = e–(2.2)(0.5) = .3329
or 33.29% chance of waiting more than 30 seconds for the next
call.
7-12
Example Customer Waiting Time
P(X > 0.50)
P(X ≤ 0.50)
7-13
Inverse Exponential
•
•
If the mean arrival rate is 2.2 calls per minute, we want the 90th
percentile for waiting time (the top 10% of waiting time).
Find the x-value
that defines the
upper 10%.
7-14
Inverse Exponential
7-15
Mean Time Between Events
7-16
5. If arrivals occur at a mean rate of 3.6 events per hour, the exponential
probability of:
waiting more than 0.5 hour for the next arrival is:
P(X > .50) = exp(-3.6 × 0.50) = .1653.
6. waiting less than 0.5 hour for the next arrival is:
P(X < .50) = 1 - exp(-3.6 × 0.50) = 1 - .1653 = .8347
7. If arrivals occur at a mean rate of 2.6 events per minute, the
exponential probability of waiting more than 1.5 minutes for the next
arrival is:
P(X > 1.5) = exp(-2.6 × 1.50) = .0202
7-17
8. If arrivals occur at a mean rate of 1.6 events per minute, the
exponential probability of waiting less than 1 minute for the next
arrival is:
(X < 1) = 1 - exp(-1.6 × 1) = 1 - .2019 = .7981.
9. On average, a major earthquake (Richter scale 6.0 or above)
occurs 3 times a decade in a certain California county. What is the
probability that less than six months will pass before the next
earthquake?
Set λ = 3/120 = 0.025 earthquake/month so
P(X < 6) = 1 - exp(-0.025 × 6) = 1 - .8607 = .1393
7-18
Events as Intervals
•
•
Discrete Variable – each value of X has its own probability P(X).
Continuous Variable – events are intervals and probabilities are
areas under continuous curves. A single point has no probability.
7-19
PDF – Probability Density Function
Continuous PDF’s:
•
Denoted f(x)
•
Must be nonnegative
•
Total area under
curve = 1
•
Mean, variance and
shape depend on
the PDF parameters
•
Reveals the shape
of the distribution
7-20
CDF – Cumulative Distribution Function
Continuous CDF’s:
•
•
•
Denoted F(x)
Shows P(X ≤ x), the
cumulative proportion
of scores
Useful for finding
probabilities
7-21
Probabilities as Areas
Continuous probability functions:
•
•
•
Unlike discrete
distributions, the
probability at any
single point = 0.
The entire area under
any PDF, by definition,
is set to 1.
Mean is the balance
point of the distribution.
7-22
Characteristics of the Normal Distribution
•
Defined by two parameters, µ and .
•
Domain is –  < X < +  (continuous scale).
•
Almost all (99.7%) of the area under the normal curve is included in the
range µ – 3 < X < µ + 3.
•
Symmetric and unimodal about the mean.
7-23
Characteristics of the Normal Distribution
7-24
Characteristics of the Normal Distribution
•
Normal PDF f(x) reaches a maximum at µ and has points of inflection at
µ ±
Bell-shaped curve
NOTE: All normal
distributions
have the same
shape but differ
in the axis scales.
7-25
Characteristics of the Normal Distribution
•
Normal CDF
7-26
Characteristics of the Standard Normal Distribution
•
Since for every value of µ and , there is a different normal distribution, we transform a normal
random variable to a standard normal distribution with µ = 0 and  = 1 using the formula.
7-27
Characteristics of the Standard Normal
•
Standard normal PDF f(x) reaches a maximum at z = 0 and has
points of inflection at +1.
•
Shape is unaffected by
the transformation.
It is still a bell-shaped
curve.
Figure 7.11
7-28
Characteristics of the Standard Normal
•
Standard normal CDF
•
•
•
•
A common scale
from -3 to +3 is used.
Entire area under the
curve is unity.
The probability of an
event P(z1 < Z < z2)
is a definite integral
of f(z).
However, standard
normal tables or
Excel functions can
be used to find the
desired probabilities.
7-29
Normal Areas from Table 1
•
The table allows you to find the area under the curve from 0 to z.
•
For example, find P(0 < Z < 1.96):
7-30
Normal Areas from Table 1
•
•
Now find P(-1.96 < Z < 1.96).
Due to symmetry, P(-1.96 < Z) is the same as P(Z < 1.96).
•
So, P(-1.96 < Z < 1.96) = .4750 + .4750 = .9500 or 95% of the
area under the curve.
7-31
Basis for the Empirical Rule
•
•
•
Approximately 68% of the area under the curve is between + 1
Approximately 95% of the area under the curve is between + 2
Approximately 99.7% of the area under the curve is between + 3
7-32
Normal Areas from Cumulative Table 2
•
Cumulative table allows you to find the area under the curve from the left
of z (similar to Excel).
•
For example,
P(Z < 1.96)
P(Z < -1.96)
P(-1.96 < Z < 1.96)
7-33
Normal Areas from Tables
•
•
Both tables yield identical results.
Use whichever table is easiest.
Finding z for a Given Area
•
•
•
Both tables can be used to find the
z-value corresponding to a given probability.
For example, what z-value defines the top 1% of a normal
distribution?
This implies that 49% of the area lies between 0 and z which
gives z = 2.33 by looking for an area of 0.4900 in Table1.
7-34
Finding Areas by using Standardized Variables
•
Suppose John took an economics exam and scored 86 points. The class
mean was 75 with a standard deviation of 7. What percentile is John in?
That is, what is P(X < 86) where X represents the exam scores?
•
So John’s score is 1.57 standard deviations about the mean.
•
P(X < 86) = P(Z < 1.57) = .9418 (from Appendix C-2)
•
So, John is approximately in the 94th percentile.
7-35
The net sales and the number of employees for aluminum fabricators with
similar characteristics are organized into frequency distributions. Both are
normally distributed. For the net sales, the mean is $180 million and the
standard deviation is $25 million. For the number of employees, the mean is
1,500 and the standard deviation is 120. Clarion Fabricators had sales of $170
million and 1,850 employees.
a. Convert Clarion’s sales and number of employees to z values.
b. Locate the two z values.
c. Compare Clarion’s sales and number of employees with those of the other
fabricators.
7-36
•
Finding Areas by using Standardized Variables
NOTE: You can use Excel, Minitab, TI83/84 etc. to compute these
probabilities directly.
7-37
LO7-5: Solve for z or x for a normal probability using tables or Excel.
•
Inverse Normal
• How can we find the various normal percentiles (5th, 10th, 25th, 75th,
90th, 95th, etc.) known as the inverse normal? That is, how can we
find X for a given area? We simply turn the standardizing
transformation around:
Solving for x in z = (x − μ)/ gives x = μ + zσ
7-38
•
Inverse Normal
• For example, suppose that John’s economics professor has decided
that any student who scores below the 10th percentile must retake the
exam.
• The exam scores are normal with μ = 75 and σ = 7.
• What is the score that would require a student to retake the exam?
• We need to find the value of x that satisfies P(X < x) = .10.
• The z-score for with the 10th percentile is z = −1.28.
7-39
•
Inverse Normal
• The steps to solve the problem are:
• Use Appendix C or Excel to find z = −1.28 to satisfy P(Z < −1.28) = .10.
• Substitute the given information into z = (x − μ)/σ to get
−1.28 = (x − 75)/7
• Solve for x to get x = 75 − (1.28)(7) = 66.03 (or 66 after rounding)
• Students who score below 66 points on the economics exam will be
required to retake the exam.
7-40
•
Inverse Normal
7-41
Normal Approximation to the Binomial
•
•
•
Binomial probabilities are difficult to calculate when n is large.
Use a normal approximation to the binomial distribution.
As n becomes large, the binomial bars become smaller and continuity is
approached.
7-42
Normal Approximation to the Binomial
•
•
Rule of thumb: when n ≥ 10 and n(1- ) ≥ 10, then it is appropriate to use
the normal approximation to the binomial distribution.
In this case, the mean and standard deviation for the binomial distribution
will be equal to the normal µ and , respectively.
Example Coin Flips
•
If we were to flip a coin n = 32 times and  = .50, are the
requirements for a normal approximation to the binomial distribution
met?
7-43
Example Coin Flips
•
•
•
•
•
•
n = 32 x .50 = 16
n(1- ) = 32 x (1 - .50) = 16
So, a normal approximation can be used.
When translating a discrete scale into a continuous scale,
care must be taken about individual points.
For example, find the probability of more than 17 heads in
32 flips of a fair coin.
This can be written as P(X  18).
However, “more than 17” actually falls between 17 and 18
on a discrete scale.
7-44
Example Coin Flips
•
•
•
Since the cutoff point for “more than 17” is halfway between 17 and 18, we
add 0.5 to the lower limit and find P(X > 17.5).
This addition to X is called the Continuity Correction.
At this point, the problem can be completed as any normal distribution
problem.
7-45
Example Coin Flips
P(X > 17) = P(X ≥ 18)  P(X ≥ 17.5)
= P(Z > 0.53) = 0.2981
7-46