Download Hypothesis Testing of Single Mean and Single Proportion

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
Connexions module: m17002
1
Hypothesis Testing of Single Mean
and Single Proportion: Assumptions
∗
Susan Dean
Barbara Illowsky, Ph.D.
This work is produced by The Connexions Project and licensed under the
Creative Commons Attribution License †
hypothesis test of a single population mean µ using a Student-t distribu-
When you perform a
tion
(often called a t-test), there are fundamental assumptions that need to be met in order for the test
to work properly.
is approximately
Your data should be a
normally distributed.
simple random sample that comes from a population that
You use the sample standard deviation to approximate the
population standard deviation. (Note that if the sample size is larger than 30, a t-test will work even if the
population is not approximately normally distributed).
When you perform a
hypothesis test of a single population mean
µ
using a normal distribution
(often called a z-test), you take a simple random sample from the population. The population you are testing
is normally distributed or your sample size is larger than 30 or both. You know the value of the population
standard deviation.
When you perform a
hypothesis test of a single population proportion p, you take a simple random
binomial distribution which are there
sample from the population. You must meet the conditions for a
are a certain number
n
of independent trials, the outcomes of any trial are success or failure, and each trial
has the same probability of a success
p.
The shape of the binomial distribution needs to be similar to the
shape of the normal distribution. To ensure this, the quantities
(np
> 5 and nq > 5).
np
and
nq
must both be greater than ve
Then the binomial distribution of sample (estimated) proportion can be approximated
by the normal distribution with
µ=p
and
σ=
q
p·q
n . Remember that
q = 1 − p.
Glossary
Denition 1: Binomial Distribution
A discrete random variable (RV) which arises from Bernoulli trials. There are a xed number,
n,
of independent trials. Independent means that the result of any trial (for example, trial 1) does
not aect the results of the following trials, and all trials are conducted under the same conditions.
X is dened as the number of successes in n trials.
√
npq. The
X∼B (n, p). The mean is µ = np and the standard
deviation is σ =
x n−x
n
exactly x successes in n trials is P (X = x) =
.
x p q
Under these circumstances the binomial RV
The notation is:
probability of
Denition 2: Condence Interval
An interval estimate for an unknown population parameter. This depends on:
∗ Version
1.8: Jan 4, 2011 7:10 pm US/Central
† http://creativecommons.org/licenses/by/3.0/
Source URL: http://cnx.org/content/col10522/latest/
Saylor URL: http://www.saylor.org/courses/ma121/
http://cnx.org/content/m17002/1.8/
Attributed to: Barbara Illowsky and Susan Dean
Saylor.org
Page 1 of 2
Connexions module: m17002
•
•
•
2
The desired condence level.
What is known for the distribution information (for example, known standard deviation).
The sample and its size.
Denition 3: Condence Level
The percent expression for the probability that the condence interval contains the true population
parameter. That is, for example, if CL=90%, then in 90 out of 100 samples the interval estimate
will enclose the true population parameter.
Denition 4: Normal Distribution
√1 e−(x−µ)2 /2σ 2 , where
σ 2π
is its standard deviation. Notation: X ∼ N (µ, σ). If µ =
A continuous random variable (RV) with pdf f(x)
the distribution and
σ
=
standard normal distribution.
Denition 5: Standard Deviation
µ is the mean of
0 and σ = 1, the
RV is called the
A number that is equal to the square root of the variance and measures how far data values are from
their mean. Notation: s for sample standard deviation and
σ
for population standard deviation.
Denition 6: Student-t Distribution
Investigated and reported by William S. Gossett in 1908 and published under the pseudonym
Student. The major characteristics of the random variable (RV) are:
•
•
It is continuous and assumes any real values.
The pdf is symmetrical about its mean of zero. However, the graph is more spread out and
atter at the apex than the normal distribution.
•
•
It approaches the standard normal distribution as n gets larger.
There is a "family" of t distributions: every representative of the family is completely dened
by the number of degrees of freedom which is one less than the number of data.
Source URL: http://cnx.org/content/col10522/latest/
Saylor URL: http://www.saylor.org/courses/ma121/
http://cnx.org/content/m17002/1.8/
Attributed to: Barbara Illowsky and Susan Dean
Saylor.org
Page 2 of 2