Download standard normal distribution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
CHAPTER 6
The Normal Distribution
© Copyright McGraw-Hill 2004
6-1
Objectives

Identify distributions as symmetrical or
skewed.

Identify the properties of the normal
distribution.

Find the area under the standard normal
distribution, given various z values.

Find the probabilities for a normally
distributed variable by transforming it into a
standard normal variable.
© Copyright McGraw-Hill 2004
6-2
Objectives (cont’d.)

Find specific data values for given percentages
using the standard normal distribution.

Use the central limit theorem to solve
problems involving sample means for large
samples.

Use the normal approximation to compute
probabilities for a binomial variable.
© Copyright McGraw-Hill 2004
6-3
Introduction

Many continuous variables have distributions
that are bell-shaped and are called
approximately normally distributed variables.

A normal distribution is also known as the
bell curve or the Gaussian distribution.
© Copyright McGraw-Hill 2004
6-4
Normal and Skewed Distributions

The normal distribution is a continuous, bellshaped distribution of a variable.

If the data values are evenly distributed about
the mean, the distribution is said to be
symmetrical.

If the majority of the data values fall to the left
or right of the mean, the distribution is said
to be skewed.
© Copyright McGraw-Hill 2004
6-5
Left Skewed Distributions

When the majority of the data values fall to
the right of the mean, the distribution is said
to be negatively or left skewed. The mean is to
the left of the median, and the mean and the
median are to the left of the mode.
© Copyright McGraw-Hill 2004
6-6
Right Skewed Distributions

When the majority of the data values fall to
the left of the mean, the distribution is said to
be positively or right skewed. The mean falls
to the right of the median and both the mean
and the median fall to the right of the mode.
© Copyright McGraw-Hill 2004
6-7
Equation for a Normal Distribution

The mathematical equation for the normal
distribution is:
y
e
 ( X   )2 (2 2 )
 2
where
e  2.718
  3.14
 = population mean
 = population
standard deviation
© Copyright McGraw-Hill 2004
6-8
Properties of the Normal Distribution

The shape and position of the normal
distribution curve depend on two parameters,
the mean and the standard deviation.

Each normally distributed variable has its
own normal distribution curve, which
depends on the values of the variable’s mean
and standard deviation.
© Copyright McGraw-Hill 2004
6-9
Normal Distribution Properties

The normal distribution curve is bell-shaped.

The mean, median, and mode are equal and
located at the center of the distribution.

The normal distribution curve is unimodal
(i.e., it has only one mode).

The curve is symmetrical about the mean,
which is equivalent to saying that its shape is
the same on both sides of a vertical line
passing through the center.
© Copyright McGraw-Hill 2004
6-10
Normal Distribution Properties (cont’d.)

The curve is continuous—i.e., there are no
gaps or holes. For each value of X, here is a
corresponding value of Y.

The curve never touches the x axis.
Theoretically, no matter how far in either
direction the curve extends, it never meets the
x axis—but it gets increasingly closer.
© Copyright McGraw-Hill 2004
6-11
Normal Distribution Properties (cont’d.)

The total area under the normal distribution
curve is equal to 1.00 or 100%.

The area under the normal curve that lies
within one standard deviation of the mean is
approximately 0.68, or 68%; within two
standard deviations, about 0.95, or 95%; and
within three standard deviations, about 0.997
or 99.7%.
© Copyright McGraw-Hill 2004
6-12
Standard Normal Distribution

Since each normally distributed variable has
its own mean and standard deviation, the
shape and location of these curves will vary.
In practical applications, one would have to
have a table of areas under the curve for each
variable. To simplify this, statisticians use the
standard normal distribution.

The standard normal distribution is a normal
distribution with a mean of 0 and a standard
deviation of 1.
© Copyright McGraw-Hill 2004
6-13
z Values

The z value is the number of standard
deviations that a particular X value is away
from the mean. The formula for finding the z
value is:
value  mean
X 
z
or z 
standard deviation

© Copyright McGraw-Hill 2004
6-14
Area Between 0 and z

To find the area between 0 and any z value:
Look up the z value in the table.
0
z
© Copyright McGraw-Hill 2004
6-15
Area in Any Tail

Look up the z value to get the area.

Subtract the area from 0.5000.
–z
0
© Copyright McGraw-Hill 2004
6-16
Area Between Two z Values

Look up both z values to get the areas.

Subtract the smaller area from the larger
area.
0
z1 z2
© Copyright McGraw-Hill 2004
6-17
Area Between z Values—Opposite Sides

Look up both z values to get the areas.

Add the areas.
–z1
0
z2
© Copyright McGraw-Hill 2004
6-18
Area To the Left of Any z Value

Look up the z value to get the area.

Add 0.5000 to the area.
0
z
© Copyright McGraw-Hill 2004
6-19
Area To the Right of Any z Value

Look up the z value in the table to get the
area.

Add 0.5000 to the area.
-z
0
© Copyright McGraw-Hill 2004
6-20
Area Under the Curve

The area under the curve is more important
than the frequencies because the area
corresponds to the probability!
© Copyright McGraw-Hill 2004
6-21
Calculating the Value of X

When one must find the value of X, the
following formula can be used:
X  z   
© Copyright McGraw-Hill 2004
6-22
Distribution of Sample Means

A sampling distribution of sample means is a
distribution obtained by using the means
computed from random samples of a specific
size taken from a population.

Sampling error is the difference between the
sample measure and the corresponding
population measure due to the fact that the
sample is not a perfect representation of the
population.
© Copyright McGraw-Hill 2004
6-23
Properties of Distribution of Sample Means

The mean of the sample means will be the
same as the population mean.

The standard deviation of the sample means
will be smaller than the standard deviation of
the population, and will be equal to the
population standard deviation divided by the
square root of the sample size.
© Copyright McGraw-Hill 2004
6-24
The Central Limit Theorem

As the sample size n increases, the shape of
the distribution of the sample means taken
with replacement from a population with
mean  and standard deviation  will
approach a normal distribution.
© Copyright McGraw-Hill 2004
6-25
Central Limit Theorem (cont’d.)

If all possible samples of size n are taken with
replacement from the same population, the
mean of the sample means equals the
population mean or:  X   .

The standard deviation of the sample means
equals:    and is called the standard
X
n
error of the mean.
© Copyright McGraw-Hill 2004
6-26
Central Limit Theorem (cont’d.)

The central limit theorem can be used to
answer questions about sample means in the
same manner that the normal distribution
can be used to answer questions about
individual values.

A new formula must be used for the z values:
X 
z
 n
© Copyright McGraw-Hill 2004
6-27
Finite Population Correction Factor

The formula for standard error of the mean is
accurate when the samples are drawn with
replacement or are drawn without
replacement from a very large or infinite
population.

A correction factor is necessary for computing
the standard error of the mean for samples
drawn without replacement from a finite
population.
© Copyright McGraw-Hill 2004
6-28
Finite Population Correction Factor

The correction factor is computed using the
following formula:
N n
N 1
where N is the population size and n is the
sample size.
© Copyright McGraw-Hill 2004
6-29
Correction Factor Applied to Standard Error

The standard error of the mean must be
multiplied by the correction factor to adjust it
for large samples taken from a small
population.

N n
X 

n
N 1
© Copyright McGraw-Hill 2004
6-30
Correction Factor Applied to z Value

The standard error for the mean must be
adjusted when it is included in the formula
for calculating the z values.
X 
z

N n

n
N 1
© Copyright McGraw-Hill 2004
6-31
A Correction for Continuity

A correction for continuity is a correction
employed when a continuous distribution is
used to approximate a discrete distribution.
© Copyright McGraw-Hill 2004
6-32
Characteristics of a Binomial Distribution

There must be a fixed number of trials.

The outcome of each trial must be
independent.

Each experiment can have only two outcomes
or be reduced to two outcomes.

The probability of a success must remain the
same for each trial.
© Copyright McGraw-Hill 2004
6-33
Normal Approximation to Binomial Distribution
Binomial
Normal
When finding
Use
P( X  a )
P(a  0.5  X  a  0.5)
P( X  a )
P( X  a  0.5)
P( X  a )
P( X  a  0.5)
P( X  a )
P( X  a  0.5)
P( X  a )
P( X  a  0.5)
© Copyright McGraw-Hill 2004
6-34
Procedure for Normal Approximation

Step 1
Check to see whether the normal
approximation can be used.

Step 2
Find the mean  and the standard
deviation .

Step 3
Write the problem in probability
notation, using X.
© Copyright McGraw-Hill 2004
6-35
Procedure for Normal Approximation (cont’d.)

Step 4
Rewrite the problem using the
continuity correction factor, and
show the corresponding area under
the normal distribution.

Step 5
Find the corresponding z values.

Step 6
Find the solution.
© Copyright McGraw-Hill 2004
6-36
Summary

The normal distribution can be used to
describe a variety of variables, such as
heights, weights, and temperatures.

The normal distribution is bell-shaped,
unimodal, symmetric, and continuous; its
mean, median, and mode are equal.

Mathematicians use the standard normal
distribution which has a mean of 0 and a
standard deviation of 1.
© Copyright McGraw-Hill 2004
6-37
Summary (cont’d.)

The normal distribution can be used to
describe a sampling distribution of sample
means.

These samples must be of the same size and
randomly selected with replacement from the
population.

The central limit theorem states that as the
size of the samples increases, the distribution
of sample means will be approximately
normal.
© Copyright McGraw-Hill 2004
6-38
Summary (cont’d.)

The normal distribution can be used to
approximate other distributions, such as the
binomial distribution.

For the normal distribution to be used as an
approximation to the binomial distribution,
the conditions np  5 and nq  5 must be met.

A correction for continuity may be used for
more accurate results.
© Copyright McGraw-Hill 2004
6-39
Conclusions

The normal distribution can be used to
approximate other distributions to simplify
the data analysis for a variety of applications.
© Copyright McGraw-Hill 2004
6-40