Download Chapter 5: The Normal Distribution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
Chapter 5: The Normal Distribution
MODELING CONTINUOUS VARIABLES
Histogram 6.1
Proportion
5
0.08
0.06
0.04
0.02
30
35
40
45
50
55
Age
If we draw a curve through the tops of the bars in Histogram 6.1 and require the smoothed
curve to have total area under it equal to 1, we would have what is called a density
function, also called a density curve. The key idea when working with density functions
is that area under the curve, above an interval, corresponds to the proportion of units with
values in the interval.
NOTATION...
Since we will be discussing models for populations, the mean and standard deviation for a
density curve or model will be represented by
 (mu) and 
(sigma), respectively.
DEFINITION:
A density function is a (nonnegative) function or curve that describes the overall shape of
a distribution. The total area under the entire curve is equal to 1, and proportions are
measured as areas under the density function.
1
Normal Distributions
General Notation
X is N(  , ) means
that
the
variable
or
characteristic X is normally distributed with mean


Threeand
members
of the family
of normal .distributions
standard
deviation
Distribution #3:
Normal with a mean of 80
and a standard deviation of 5
Distribution #1:
Normal with a mean of 50
and a standard deviation of 10
20
30
40
50
Distribution #2:
Normal with a mean of 80
and a standard deviation of 10
60
70
80
90
100
2
APPLY THE EMPIRICAL RULE
Example 1 IQ Scores
Let the variable X represent IQ scores of 12-year-olds. Suppose that the distribution of X is
normal with a mean of 100 and a standard deviation of 16—that is, X is
N(100, 16). Jessica is a 12-year-old and has an IQ score of 132.We would like to determine
the proportion of 12-year olds that have IQ scores less than Jessica’s score of 132.
Since the area under the density curve corresponds to proportion, we want to find the area
to the left of 132 under an N(100, 16) curve. Sketch this curve and show the corresponding
area that represents this proportion.
IQ Scores have a
normal distribution
with mean 100 and
standard deviation 16

area to the left of 132 = ?
68
84
100
116
132
IQ Score
Let's Do It!
A physical fitness association is including the mile run in its secondary-school fitness test.
The time for this event for boys in secondary school is known to possess a normal
distribution with a mean of 460 seconds and a standard deviation of 50 seconds. Between
what times do we expect approximately 95% of the boys to run the mile?
3
STANDARD NORMAL DISTRIBUTION AND Z-SCORES
DEFINITION:
If
X is N(  , ) ,
the standardized normal variable
Z
X

is
N 0 ,1.
DEFINITION:
The z-score or standard score for an observed value tells us how many standard
deviations the observed value is from the mean – that is, it tells us how far the observed
value is from the mean in standard-deviation units. It is computed as follows:
Z
X

= number of standard deviations that
X differs from the mean

If Z > 0, then the value of X is above (greater than) its mean.
If Z < 0, then the value of X is below (less than) its mean.
If Z = 0, then the value of X is equal to its mean.
Example 2 Standard IQ Score
Recall the distribution of IQ scores for 12-year-olds—normally distributed with a mean of
100 and a standard deviation of 16.
(a)
Jessica had a score of 132. Compute Jessica’s standardized score.
(b)
Suppose Jessica has an older brother, Mike, who is 20 years old and has an IQ
score of 144. It wouldn’t make sense to directly compare Mike’s score of 144 to
Jessica’s score of 132. The two scores come from different distributions due to the
age difference. Assume that the distribution of IQ scores for 20-year-olds is normal
with a mean of 120 and a standard deviation of 20. Compute Mike’s standardized
score.
(c)
Relative to their respective age group, who had the higher IQ score—Jessica or
Mike?
Solution
Jessica's standard score =
Mike's standard score =
132  100
 2.
16
144  120
 12
. .
20
Thus, relative to their respective age groups, Jessica has a higher IQ score than Mike

4
Method of Z-scores for Identifying Outliers
Rules for detecting outliers:
Box-Plot: outliers are data points beyond the lower and upper fences
Z-score: outliers have a z-score that is more than 3 standard deviations from the
mean, |z| > 3.
Let's Do It! 6
At the U.S. Open Tennis Championship a statistician keeps track of every serve that a
player hits during the tournament. The statistician reported that the mean serve speed of a
particular player was 103 miles per hour (mph) and the standard deviation of the serve
speeds was 12 mph. Using the z-score approach for detecting outliers, which of the
following serve speeds would represent outliers in the distribution of the player's serve
speeds?
61 mph:
115 mph:
and 127 mph:
Let's Do It!
Suppose female bank employee believes that her salary is low as a result of sex
discrimination. To substantiate her belief, she collects information on salaries of her male
counterparts in the banking business. She finds that their salary is approximately normal
with an average of $54,000 and a standard deviation of $2000. Her salary is $47,000. Does
this information support her claim of sex discrimination?
5
How to Calculate Areas under a Normal Distribution
Finding Proportions for the Standard Normal Distribution
Finding proportions under a normal distribution involves standardization and then finding
the corresponding proportion (area) under the standard normal distribution. Let’s first work
on finding areas under a standard normal N(0, 1) distribution.
(a)
Find the area under the standard normal distribution to the left of z = 1.22. Sketch
a picture of the corresponding area and use your TI to find the area.
Solution
The area to the left of is described below. z = 1.22

area to the
left of z=1.22
is 0.8888
0
z=1.22
Z
Using TI:
(b)
Find the area under the standard normal distribution to the right of z = 1.22.
Solution
We already know the area to the left of z = 1.22, and we know that the total area
under any density to he right of 0 is equal to 0.5. So the area to the right of z =
1.22 must be 0.5 - 0.3888 = 0.1112. We could also find the area to the left of z =
-1.22. By the symmetry of the standard normal distribution, the area to the right
of z = 1.22 is equal to the area to the left of z = -1.22.
6
Let's Do It!More Standard Normal Areas
a. Find the area under the normal distribution curve to
the right of z = 1.38. Round to four decimal places.
b. Find the area under the normal distribution curve between z 1 = 2.00 and z 2 = 2.55.
Round your answer to four decimal places.
c. Use the standard normal distribution to find P(-2.25 < z < 0).
d. For a standard normal random variable, find the probability that z exceeds the value
-1.65.
e. Obtain the shaded area under the standard normal curve.
Calculate Areas under a General Normal Distribution( Non-Standard)
Find the indicated probability or percentage for the normally distributed variable.
The variable X is normally distributed. The mean is μ = 60.0 and the standard deviation is σ
= 4.0. Find P(X < 53.0).
Let's Do It!
Physical fitness association is including the mile run in its secondary-school fitness test.
The time for this event for boys in secondary school is known to possess a normal
distribution with a mean of 440 seconds and a standard deviation of 50 seconds. Find the
probability that a randomly selected boy in secondary school can run the mile in less than
325 seconds.
Let's Do It!
The weight of corn chips dispensed into a 24-ounce bag by the dispensing machine has
been identified as possessing a normal distribution with a mean of 24.5 ounces and a
standard deviation of 0.2 ounce. What proportion of the 24-ounce bags contain more than
the advertised 24 ounces of chips?
7
Let's Do It!
A bank's loan officer rates applicants for credit. The ratings are normally distributed with a
mean of 200 and a standard deviation of 50. If an applicant is randomly selected, find the
probability of a rating that is between 200 and 275.
Finding Percentile scores, top and bottom scores of a the Normal Distribution
Example 
The Top 1% of the IQ Distribution


Recall the N 100,16 model for IQ score of 12-year-olds. What IQ score must a 12-yearold have to place in the top 1% of the distribution of IQ scores? Find the percentile using
your calculator.
Again it may be helpful to draw a picture:

The area to the left is 0.99
100
?
IQ Score
Many calculators have the ability to find various percentiles of a normal distribution. The TI has a
built-in function called invNorm under the DIST menu. You must first specify the desired are to the
left, then the mean and the standard deviation for the normal distribution. The steps for finding the
99th percentile of our N(100,16) distribution are as follows:
8
Let's Do It!
a. The lifetimes of lightbulbs of a particular type are normally distributed with a mean of
235 hours and a standard deviation of 7 hours. Find the first quartile.
b. The weights of certain machine components are normally distributed with a mean of
8.83 g and a standard deviation of 0.1 g. find the 97th percentile.
c. The board of examiners that administers the real estate broker's examination in a
certain state found that the mean score on the test was 435 and the standard
deviation was 72. If the board wants to set the passing score so that only the best
10% of all applicants pass, what is the passing score? Assume that the scores are
normally distributed.
Let's Do It!
a. Find a value of the standard normal random variable z, called z0, such that P(z ≤z0)
= 0.70.
b. Find a value of the standard normal random variable z, called z0, such that
P(-z0≤ z ≤ z0) = 0.9234.
9