Download Lecture 3 - Department of Mathematics and Statistics

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
MATH 2560 C F03
Elementary Statistics I
LECTURE 3: The Normal Distributions.
1
Outline.
⇒ density curves;
⇒ center & spread for density curve;
⇒ normal distributions;
⇒ the 68 − 95 − 99.7 rule;
2
Density Curves
We have already known about graphical (Lecture 1) and numerical (Lecture 2) tools for describing distributions. We also know how to explore data
on a single quantitative variable.
Sometimes the overall pattern of a distribution can often be described
compactly by a smooth curve.
The smooth curve drawn through the histogram (Figure 1.19) gives a
compact discription of the overall pattern. We draw a smooth curve through
a histogram to describe the shape of the distribution without the lumpiness
of the histogram.
The curve is a mathematical model (idealized mathematical description) for the distribution.
⇒Area of the Bars in Histograms=Proportions of the
Observations
⇒ To make such curves easier to compare for different distributions, we adjust the sacle so that the total area under the curve is
exactly 1.
Areas under the curve then represent proportions of the observations.
⇒ Area=Relative Frequency.
⇒ The curve is then a Density Curve
Figure 1.20 (a) and Figure 1.20 (b) are copies of Figure 1.19 with the
leftmost bars shaded.
The area of the shaded bars in Figure 1.20(a) represents the students
with vocabulary scores 6.0 or lower. There are 287 such students, who make
up the proportion 287/947 = 0.303 of all Gary seventh graders.
The shaded area in Figure 1.20 (b) under the smooth curve represents
the students with vocabulary scores 6.0 or lower. This area is 0.293, only
0.010 away from the histogram result.
Density Curve
A density curve is a curve:
1. Is always on or above the horizontal axis;
2. Has area exactly 1 underneath it.
3. The area under the curve and above any range of values
is the relative frequency of observations that fall in that range.
Remarks
1. The density curve in Figures 1.19 and 1.20 is Normal Curve.
2. Density curves, like distributions, come in many
shapes.
Figure 1.21 (a) shows symmetric density curve.
Figure 1.21 (b) shows skewed density curve.
3
3.1
Measuring Center and Spread for Density
Curve
Skewed Distributions
⇒ MODE of distribution described by density curve is a peak
point of the curve, the location where the curve is highest.
⇒ MEDIAN of distribution
is the point with half the total area on each side.
⇒ QUARTILES
divide the area under the curve into quarters as accurately as possible by eye.
⇒ IQR
is the distance between the first and third quartiles.
3.2
Symmetric Distribution
⇒ MEAN
is the point at which the curve would balance if it were made out of solid
material.
(See Figure 1.22)
⇒ STANDARD DEVIATION
can not be located by eye on most density curves.
Remarks.
1. The median and mean are the same for a symmetric
density curve.
They both lie at the center of the curve.
2. The mean of the of the skewed curve is pulled away from the median
in the direction of the long tail.
3. The density curve is an idealized mathematical model for a distribution
of data.
4. The usual notation for the mean of an idealized distribution is µ.
The usual notation for the standard deviation is σ.
5. Abbreviation for the normal distribution with mean µ and standard
deviation σ is N (µ, σ).
4
Normal Distributions (ND)
⇒ Normal Curves: symmetric, unimodal, and bell-shaped.
The Noraml Curves describe Normal Distributions.
Examples of Normal Curves: Figures 1.20 (a), (b) and 1.21 (a) are
the examples of normal curves.
⇒ exact normal curve is determined completely by mean µ and
standard deviation σ.
⇒ µ is located at the center of the symmetric curve and is the same as
the median.
Changing µ without changing σ moves the normal curve along the horizontal
axis without changing the spread.
⇒ σ controls the spread of a normal curve.
Figure 1.23 shows two normal curves with different values of σ.
The height of the normal density curve at any point z is given by
1 z−µ 2
1
√ e− 2 ( σ ) .
σ 2π
Here: −∞ < µ < +∞ and σ > 0. We can see that the expression of the
normal density curve is completely determined by µ and σ.
Why are the Noramal Distributions (ND)
Important in Statistics
⇒ ND are good descriptions for some distributions of real data;
⇒ ND are good approximations to the results of many kinds of
chance outcomes;
⇒ ND are bases for many statistical inference procedures;
⇐ However, most income distributions are skewed to the right and so are
not normal.
5
The 68 − 95 − 99.7 Rule
All the normal curves have common properties.
The 68 − 95 − 99.7 Rule
In the Normal Distribution with mean µ and standard deviation σ:
⇒ 68 persents of the observations fall within σ of the mean µ,
i.e., in [µ − σ, µ + σ];
⇒ 95 percents of the observations fall within 2σ of µ,
i.e., in [µ − 2σ, µ + 2σ];
⇒ 99.7 percents of the observations fall within 3σ of µ,
i.e., in [µ − 3σ, µ + 3σ].
Figure 1.24 illustrate the 68 − 95 − 99.7 rule. Here: µ = 0 and σ = 1.
68 percents are in [−1, 1], 95 percents are in [−2, 2], and 99.7 percents are in
[−3, 3].
6
Example 1.22
We consider an example to demonstrate the 68 − 95 − 99.7 rule.
Distribution of Heights of Young Women
The distribution of heights of young women aged 18 to 24 is approvimately
normal with mean µ = 64.5 inches and standard deviation σ = 2.5 inches.
Figure 1.25 shows what the 68 − 95 − 99.7 rule says about this distribution.
⇒ 68 rule: 68 percents of heights of the young women fall in the interval:
[µ − σ, µ + σ] = [64.5 − 2.5, 64.5 + 2.5] = [62, 67];
⇒ 95 rule: 95 percents of heights of the young women fall in the interval:
[µ − 2σ, µ + 2σ] = [64.5 − 5, 64.5 + 5] = [59.5, 69.5];
⇒ 99.7 rule: 99.7 percents of heights of the young women fall in the
interval:
[µ − 3σ, µ + 3σ] = [64.5 − 7.5, 64.5 + 7.5] = [57, 72].
7
Summary
The mean µ (balance point), the median (equal-areas point), and the quartiles can be approximately located by eye on a density curve.
The standard deviation σ cannot be located by eye on most density
curve.
The mean and median are equal for symmetric density curves, but the
mean of a skewed curve is located farther toward the long tail than is the
meadian.
The normal distributions N (µ, σ) are described by bell-shaped, symmetric
unimodal density curves.
The mean µ and standard deviation σ completely specify the normal
distribution N (µ, σ).
The mean µ is the center of symmetry and σ is the distance from µ to
the change-of-curvature points on either side.