Download The Normal Curve

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
REVIEW PROBLEM
Residents of upstate New York are
accustomed to large amounts of snow with
snowfalls often exceeding 6 inches in one
day. In one city, such snowfalls were
recorded for two seasons and are as follows
(in inches):
8.6, 9.5, 14.1, 11.5, 7.0, 8.4, 9.0, 6.7, 21.5,
7.7, 6.8, 6.1, 8.5, 14.4, 6.1, 8.0, 9.2, 7.1
What are the mean and the population
standard deviation for this data, to the
nearest hundredth?
The mean = 9.46 inches.
Standard deviation = 3.74
Density Curves &
Normal Distribution
Some Call it a “Bell Curve”
DENSITY CURVES
Density curves give us a mathematical
model for distributions.
Density curves:
are always on or above horizontal axis
have an area of exactly 1 underneath it
Centers of Density Curves
MEDIAN—equal-areas point
MEAN—balance point
The two are equal for a
symmetric density curve.
The mean is pulled towards
the tail in a skewed curve.
In a skewed curve
Median = Equal Areas
Mean = Balance Point
NORMAL DISTRIBUTIONS
We use (mu) to represent the mean
and (sigma) to represent the standard
deviation of a normal distribution.
“Empirical Rule” 68-95-99.7 Rule
Normal Distribution
A normal distribution is a continuous, symmetrical,
bell-shaped distribution of a variable.
Characteristics of Normal Distribution
1. A normal distribution curve is bell-shaped.
2. The mean, median, and mode are equal and
are located at the center of the distribution.
3. The curve is unimodal (i.e., it has only one
mode)
4. The curve is symmetric about the mean (its
shape is the same on both sides of a vertical
line passing through the center)
5. The curve is continuous. There are no gaps
or holes. For each value of X, there is a
corresponding value of Y.
Characteristics of Normal Distribution
6. The curve never touches the x axis. No
matter how far in either direction the curve
extends, it never meets the x axis – but it
gets increasingly closer.
7. The total area under a normal distribution
curve is equal to 1.00, or 100%.
The area under the part of a normal curve that lies
within 1 standard deviation of the mean is
approximately 0.68, or 68%; within 2 standard
deviations, about 0.95, or 95%; and within 3
standard deviations, about 0.997, or 99.7%.
The Empirical Rule
(a.k.a. the “68-95-99.7 Rule”)
In a normal distribution, almost all data will
lie within 3 standard deviations of the
mean.
o About 68% of all data lies within 1 standard
deviation of the mean.
o About 95% of all data lies within 2 standard
deviations of the mean.
o About 99.7% of all data lies within 3 standard
deviations of the mean.
Why Do We Need It?
The Empirical Rule is most often used in
statistics for forecasting or predicting final
outcomes.
After a standard deviation is calculated, and
before exact data can be collected, the Empirical
Rule can be used to estimate impending data.
The probability based on the Empirical Rule can
be used if gathering appropriate data may be
time consuming, or even impossible to obtain.
Tips for Using the Empirical Rule
Before applying the Empirical Rule it is a
good idea to identify the data being described
and the value of the mean and standard
deviation.
Sketch a graph summarizing the information
provided by the empirical rule and identify the
percentages for each region of the graph
(+/- 1, +/- 2, +/- 3)
Remember that data must be normally
distributed for the Empirical Rule to apply.
The Empirical Rule
for a Normal Distribution
STEPS FOR FINDING
NORMAL PROPORTION
1. State the problem in terms of
x, your variable of concern.
2. Standardize the variable.
3. Draw a picture & shade area
of concern.
4. Use the table to determine
proportion.
From Proportion to Real
Value
GOING BACKWARDS!
1.
2.
3.
4.
State the problem.
Use the table.
Draw & shade a picture.
Unstandardize to obtain your x.
The amount of mustard dispensed from
a machine at The Hotdog Emporium
is normally distributed with a mean
of 0.9 ounce and a standard
deviation of 0.1 ounce. If the
machine is used 500 times,
approximately how many times will it
be expected to dispense 1 or more
ounces of mustard.
Choose:
5
16
80
100
The mean is 0.9 and the standard deviation is 0.1. If one standard deviation is
added to the mean, the result is 1.0 ounce. Therefore, dispensing 1 or more
ounces falls into the category above one standard deviation to the right of the
mean. Using the Empirical Rule, 16% of data falls at or above 1 standard
deviation.
16% x 500 = 80 times to dispense one or more ounces of mustard.
A machine is used to fill soda bottles. The amount of soda dispensed
into each bottle varies slightly. Suppose the amount of soda dispensed
into the bottles is normally distributed. If at least 99% of the bottles
must have between 585 and 595 milliliters of soda, find the greatest
standard deviation, to the nearest hundredth, that can be allowed.
The 99% implies a distribution within 3 standard deviations of the mean. The
difference from 585 milliliters to 595 milliliters is 10 milliliters. Symmetrically
divided, there are 5 milliliters used to create 3 standard deviations on one
side of the mean. Dividing 5 by 3, we get the standard deviation to be 1.67
milliliters, to the nearest hundredth.
Battery lifetime is normally distributed for large
samples. The mean lifetime is 500 days and
the standard deviation is 61 days. To the
nearest percent, what percent of batteries have
lifetimes less than 439 days?
Subtracting, we see that 1 s.d. below is 439 days, an exact match to
our question and an indication that the Empirical Rule can be used to
find the answer. The question is asking what percent of a distribution is
beyond one standard deviation to the left of the mean.
Answer: 16%
A shoe manufacturer collected data
regarding men's shoe sizes and found
that the distribution of sizes exactly fits
the normal curve. If the mean shoe size
is 11 and the standard deviation is 1.5,
find:
a. the probability that a man's shoe size
is greater than or equal to 11.
b. the probability that a man's shoe size
is greater than or equal to 14.
a. 50% In a normal distribution, the mean divides the data into two equal
areas. Since 11 is the mean, 50% of the data is above 11 and 50% is below 11.
b. 14 is exactly two standard deviations above the mean. Using the Empirical Rule
we see that 2.5% will fall above two standard deviations. Probability is 0.025.
Five hundred values are normally distributed with a mean of 125 and a standard
deviation of 10.
a. What percent of the values lies in the interval 115 - 135,
to the nearest percent?
b. What interval about the mean includes 95% of the data?
a. What percent of the values is in the interval 115 - 135?
mean + one standard deviation = 135
mean - one standard deviation = 115
Percent within one standard deviation of the mean = 68%
b. 2 standard deviations about the mean for a total interval size
of 40, with the mean in the center.
mean + 2 standard deviations = 145
mean - 2 standard deviations = 105
Interval: [105,145]
Example:
Estimating with the Empirical Rule
You have purchased fluorescent light
bulbs for your home. The average bulb life
is 500 hours with a standard deviation of
24. The data is normally distributed.
One of your bulbs burns out at 450 hours.
Would you send the bulb back for a
refund?
Problem: You have purchased fluorescent light bulbs for your home. The
average bulb life is 500 hours with a standard deviation of 24. The data is
normally distributed. One of your bulbs burns out at 450 hours. Would you send
the bulb back for a refund?
Solution:
According to the Empirical Rule:
o 68% of the light bulbs should last between
500 ± 24 or between 476 to 524 hours.
o 95% of the light bulbs should last between
500 ± 2(24) or between 452 to 548 hours.
o 99.7% of the light bulbs should last between
500 ± 3(24) or between 428 to 572 hours.
If the light bulb only lasted 450 hours, I
would consider it a defective bulb. Less
than 2.5% should last less than 452 hours.
Your Turn
The scores for all high school seniors taking the
verbal section of the Scholastic Aptitude Test
(SAT) in a particular year had a mean of 490 and
a standard deviation of 100. The distribution of
SAT scores is normal.
1. What percentage of seniors scored between 390
and 590 on this SAT test?
2. One student scored 795 on this test. How did this
student do compared to the rest of the scores?
3. A rather exclusive university only admits students
who were among the highest 16% of the scores on
this test. What score would a student need on this
test to be qualified for admittance to this
university?
1. What percentage of seniors scored
between 390 and 590 on this SAT test?
The mean = 490 and the standard deviation = 100. So,
we can draw the Normal Curve to depict the data.
Calculate the difference
between the mean and
the test scores:
|390-490| = 100  1
s.d.
|590-490| = 100  1
s.d.
So, about 68% of
seniors would score
between 390 and 590
on the SAT test.
2. One student scored 795 on this test. How did
this student do compared to the other scores?
Calculate the difference between the mean and the
student’s test score:795-390 = 405
This student’s score is more than 3 std. deviations from
the mean
(3 standard deviations = 3 * 100)
Since about 99.7% of seniors
would score between 190 and
790 on the SAT test, this is an
exceptionally high score.
Only 0.15% of students would
have scores above 790. This
is an example of using the
Empirical Rule to estimate an
answer.
3 . A rather exclusive university only admits students
who were among the highest 16% of the scores on this
test. What score would a student need on this test to be
qualified for admittance to this university?
About 68% of the scores are between 390 and 590, so
this leaves 32% of the scores outside this interval.
Since a bell-shaped curve is symmetrical, one-half of the
scores (16%) are on each end of the distribution.
• 16% of the students scored above 590 on this SAT test.
So, to qualify for admittance to
this university, a student
would need to score 590 or
above on the SAT test.
What if you want to be more precise?
The Empirical Rule works well if we are dealing
with values that are exactly 1, 2, or 3 standard
deviations away from the mean.
It also works well if we simply want to estimate
the probability or percentage for a particular
value.
However, we can be more precise using
NORMAL CALCULATIONS which involves
standardizing values to obtain z-scores.
STANDARD NORMAL
DISTRIBUTION
Standardizing yields z-scores.
Use this equation to find the
standardized value of x:
z
x x
Standard Normal Curve
x has normal distribution of N( , )
z has a standard normal distribution N(0, 1)
The scores for all high school seniors taking the verbal
section of the Scholastic Aptitude Test (SAT) in a
particular year had a mean of 490 and a standard
deviation of 100. The distribution of SAT scores is
normal.
One student scored 795 on this test. How did this student
do compared to the rest of the scores?
We estimated before that only 0.15% of students would
score above 790, but what percent would score above
795?
First, we need to standardize the score of 795.
795 490
z
100
305
100
3.05
In doing so, we see that this student scored 3.05 standard
deviations above the mean of 490.
The Standard Normal Distribution
This is part of a standard Normal table that you should be able to read. Since our
particular problem deals with a z-score of 3.05, we should go to the last row (3.0) and
over to the appropriate column (0.05). We should see that they meet at 0.9989. This is
the proportion TO THE LEFT of our score (of 795). Our question really wants us to
consider the proportion to the right. Since the total area under the curve is 1, we can
subtract 0.9989 from 1 to yield 0.0011. That means that only 0.11% (compared to
estimate of 0.15% from Empirical Rule) of all students scored higher than a 795.
Z
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
2.0
0.9772
0.9778
0.9783
0.9788
0.9793
0.9798
0.9803
0.9808
0.9812
0.9817
2.1
0.9821
0.9826
0.9830
0.9834
0.9838
0.9842
0.9846
0.9850
0.9854
0.9857
2.2
0.9861
0.9864
0.9868
0.9871
0.9875
0.9878
0.9881
0.9884
0.9887
0.9890
2.3
0.9893
0.9896
0.9898
0.9901
0.9904
0.9906
0.9909
0.9911
0.9913
0.9916
2.4
0.9918
0.9920
0.9922
0.9925
0.9927
0.9929
0.9931
0.9932
0.9934
0.9936
2.5
0.9938
0.9940
0.9941
0.9943
0.9945
0.9946
0.9948
0.9949
0.9951
0.9952
2.6
0.9953
0.9955
0.9956
0.9957
0.9959
0.9960
0.9961
0.9962
0.9963
0.9964
2.7
0.9965
0.9966
0.9967
0.9968
0.9969
0.9970
0.9971
0.9972
0.9973
0.9974
2.8
0.9974
0.9975
0.9976
0.9977
0.9977
0.9978
0.9979
0.9979
0.9980
0.9981
2.9
0.9981
0.9982
0.9982
0.9983
0.9984
0.9984
0.9985
0.9985
0.9986
0.9986
3.0
0.9987
0.9987
0.9987
0.9988
0.9988
0.9989
0.9989
0.9989
0.9990
0.9990
A group of 625 students has a mean age
of 15.8 years with a standard deviation of
0.6 years. The ages are normally
distributed. How many students are
younger than 16.2 years? Express
answer to the nearest student?
16.2 15.8
z
0.6
0.4
0.6
0.6667
This z-score yields a percentage less than 16.2 as
74.86%. When we convert that to a number of students
we get 466.75 or approximately 467.
Z
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
0.0
0.5000
0.5040
0.5080
0.5120
0.5160
0.5199
0.5239
0.5279
0.5319
0.5359
0.1
0.5398
0.5438
0.5478
0.5517
0.5557
0.5596
0.5636
0.5675
0.5714
0.5753
0.2
0.5793
0.5832
0.5871
0.5910
0.5948
0.5987
0.6026
0.6064
0.6103
0.6141
0.3
0.6179
0.6217
0.6255
0.6293
0.6331
0.6368
0.6406
0.6443
0.6480
0.6517
0.4
0.6554
0.6591
0.6628
0.6664
0.6700
0.6736
0.6772
0.6808
0.6844
0.6879
0.5
0.6915
0.6950
0.6985
0.7019
0.7054
0.7088
0.7123
0.7157
0.7190
0.7224
0.6
0.7257
0.7291
0.7324
0.7357
0.7389
0.7422
0.7454
0.7486
0.7517
0.7549
0.7
0.7580
0.7611
0.7642
0.7673
0.7704
0.7734
0.7764
0.7794
0.7823
0.7852
Professor Halen has 184 students in his
college mathematics lecture class. The scores
on the midterm exam are normally distributed
with a mean of 72.3 and a standard deviation
of 8.9. How many students in the class can be
expected to receive a score between 82 and
90? Express answer to the nearest student.
90 72.3
z
8.9
82 72.3
z
8.9
17.7
1.9888
8.9
9.7
1.0899
8.9
We get 0.9767 and 0.8621 as the probability associated with these 2 z-scores.
Z
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
0.9
0.8159
0.8186
0.8212
0.8238
0.8264
0.8289
0.8315
0.8340
0.8365
0.8389
1.0
0.8413
0.8438
0.8461
0.8485
0.8508
0.8531
0.8554
0.8577
0.8599
0.8621
1.1
0.8643
0.8665
0.8686
0.8708
0.8729
0.8749
0.8770
0.8790
0.8810
0.8830
1.2
0.8849
0.8869
0.8888
0.8907
0.8925
0.8944
0.8962
0.8980
0.8997
0.9015
1.3
0.9032
0.9049
0.9066
0.9082
0.9099
0.9115
0.9131
0.9147
0.9162
0.9177
1.4
0.9192
0.9207
0.9222
0.9236
0.9251
0.9265
0.9279
0.9292
0.9306
0.9319
1.5
0.9332
0.9345
0.9357
0.9370
0.9382
0.9394
0.9406
0.9418
0.9429
0.9441
1.6
0.9452
0.9463
0.9474
0.9484
0.9495
0.9505
0.9515
0.9525
0.9535
0.9545
1.7
0.9554
0.9564
0.9573
0.9582
0.9591
0.9599
0.9608
0.9616
0.9625
0.9633
1.8
0.9641
0.9649
0.9656
0.9664
0.9671
0.9678
0.9686
0.9693
0.9699
0.9706
1.9
0.9713
0.9719
0.9726
0.9732
0.9738
0.9744
0.9750
0.9756
0.9761
0.9767
Professor Halen has 184 students in his college mathematics lecture
class. The scores on the midterm exam are normally distributed with a
mean of 72.3 and a standard deviation of 8.9. How many students in
the class can be expected to receive a score between 82 and
90? Express answer to the nearest student.
We get 0.9767 and 0.8621 as the probability associated with these 2 z-scores.
So, we can subtract 0.9767-0.8621 to yield a
proportion of 0.1146.
This means that 11.46% of all students scored
between an 82 and 90.
Therefore, roughly 21 (21.0864) of the 184
students scored between an 82 and 90.
Aliens have come to visit Earth. We quickly establish that they
are shorter than humans. In fact, their heights vary normally
with a mean of 50 cm and standard deviation of 10 cm.
1. Aliens place emphasis on height. For an alien to be a “leader” it would
need to be at least 75 cm tall. What proportion of aliens would qualify?
2. Short aliens are often sent to the front lines in battle because they are
smaller targets. All aliens that are under 37 cm tall can expect to be called
into duty. What percent of aliens fall into this category?
3. What percent of all aliens are between 41 and 71 cm tall?
Aliens have come to visit Earth. We quickly establish that they
are shorter than humans. In fact, their heights vary normally
with a mean of 50 cm and standard deviation of 10 cm.
1. Aliens place emphasis on height. For an alien to be a “leader” it would
need to be at least 75 cm tall. What proportion of aliens would qualify?
75 50
z
10
25
10
2.5
This yields a proportion of 0.9938
to the left (or 0.0062 to the right).
So, only a proportion
of 0.0062 of all aliens
would qualify for a
leadership positions.
Aliens have come to visit Earth. We quickly establish that they
are shorter than humans. In fact, their heights vary normally
with a mean of 50 cm and standard deviation of 10 cm.
2. Short aliens are often sent to the front lines in battle because they are
smaller targets. All aliens that are under 37 cm tall can expect to be called
into duty. What percent of aliens fall into this category?
37 50
10
13
1.3
10
This yields a proportion of
0.0968 to the left.
z
Therefore, 9.68% of
all aliens should be
expected to serve
their duty on the front
lines of battle.
Aliens have come to visit Earth. We quickly establish that they
are shorter than humans. In fact, their heights vary normally
with a mean of 50 cm and standard deviation of 10 cm.
3. What percent of all aliens are between 41 and 71 cm tall?
z
71 50
10
21
2.1
10
This yields proportions to the left of 0.9821 and 0.1841.
z
41 50
10
9
10
0.9
So, when we subtract these
proportions, we find that the
proportion of aliens between
41 and 71 cm tall is 0.7980.
In other words, approximately
79.80% of all aliens are
between 41 and 71 cm tall.