Download Chapter 13: Normal Distributions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
Chapter 13: Normal Distributions
Thought Question
Numerical data can distinguish different types of writing, and sometimes even individual authors. Here are data collected by students on the percentages of words of 1 to 15 letters used
in articles in Popular Science magazine:
Length:
Proportion:
1
.036
2
.148
3
.187
4
.160
5
.125
6
.082
7
.081
Make a histogram of this distribution. Then
• Draw a smooth curve around your histogram.
• Indicate the location of the median word length.
• Indicate the location of the mean word length.
1
8
.059
9
.044
10
.036
11
.021
12
.009
13
.006
14
.004
15
.002
Density Curves
How can we represent distributions by a smooth curve?
Terminology
• Density Curve: a curve where the total area underneath it is equal to 1.
• Median of a Density Curve: the point that divides the area of the curve in half.
• Mean of a Density Curve: the point at which the curve would balance.
Histogram vs. Density Curve
Histogram:
• Shows the counts in each class by the height/area of the bars.
• Is a plot of data from a sample.
Density Curve:
• Shows the proportion of observations in any region by areas under the curve.
• Reflects the idealized shape of the population distribution.
How are the mean and median of a symmetric density curve related?
They are equal.
2
Characteristics of Normal Distributions
• Any value can occur.
• The fraction of values in the population which lie in an interval is given by the area under
the curve inside the interval.
• The total area under the curve is 1.
• The mean µ determines the center of the distribution. The curve is symmetric about its
mean.
• The standard deviation σ determines the spread of the distribution. It is the distance
from the mean to the change-of-curvature point on either side.
µ−σ
µ
µ+σ
How does increasing the mean affect the normal curve?
The center is shifted to the right.
How does increasing the standard deviation affect the normal curve?
The curve is more spread out.
3
The 68-95-99.7 Rule
µ − 3σ
µ − 2σ
µ−σ
µ
µ+σ
µ + 2σ
µ + 3σ
In Any Normal Distribution
• 68.2% of the observations fall within σ of the mean µ.
• 95.4% of the observations fall within 2σ of the mean µ.
• 99.7% of the observations fall within 3σ of the mean µ.
Example: ACT Scores
The distribution of ACT scores are approximately normally distributed with mean µ = 18 and
standard deviation σ = 6. Between what values will
68% of the scores lie? 12 and 24
95% of the scores lie? 6 and 30
99.7% of the scores lie? 0 and 36
4
Standard Scores
How can we compare values from different distributions?
Terminology
• Standard Scores: observations expressed in standard deviations above or below the
mean.
How to Calculate the Standard Score
standard score =
observation − mean
standard deviation
If the observation is x, then the standard score =
5
x−µ
.
σ
Example
Suppose X takes values from a normal distribution with mean µ = 5 and standard deviation
σ = 1.5. Calculate the corresponding standard scores Z:
(a) X > 6
Values of X
6
5
Z=
X−µ
σ
?
Values of Z
0
(b) 3.21 < X < 4.76
Values of X
5
?
Values of Z
0
6
Example: SAT and ACT Scores
The distribution of SAT scores are approximately normally distributed with mean µ = 500 and
standard deviation σ = 100. Jennie scored a 600 on the SAT math portion. Gerald scored a 21
on the ACT math portion. Who did better?
To determine this, calculate Jennie and Gerald’s standard scores.
Jennie’s standard score is 600−500
=
100
So Jennie did better than Gerald.
100
100
= 1 and Gerald’s standard score is
How many standard deviations above the mean SAT score is Jennie’s score?
1
How many standard deviations above the mean ACT score is Gerald’s score?
0.5
7
21−18
6
=
3
6
= 0.5.
Percentiles
Terminology
• cth Percentile: value such that c percent of the observations lie below it and the rest
lie above it.
What percentile is the median? 50th percentile
What percentile is the first quartile? 25th percentile
What percentile is the third quartile? 75th percentile
Example: SAT and ACT Scores
What percentile is Jennie’s SAT score?
84.13 percentile.
What percent of people scored below a 600 on the SAT math portion?
84.13 %.
What percentile is Gerald’s ACT score?
69.15 percentile.
What percent of people score below a 21 on the ACT math portion?
69.15 %.
8
Example
For a normal distribution with mean 0 and standard devation 1, find the number z such that
(a) 95% of the values lie between −z and z
0.95
−z
0
z
(b) 90% of the values lie between −z and z
0
(c) 1% of the values are greater than z
0
9
Chapter 13 Exercises
1. A company that sells annuities must base the annual payout on the distribution of the
lifetimes of the participants in the plan. Suppose the lifetimes of participants are approximately
normally distributed with µ = 77 years and σ = 7.5 years. What proportion of the plan
participants would receive payments beyond age 70?
Distribution of Participants’ Lifetimes
77
10