Download Chp 10 Ppt - Wylie ISD

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
Chapter 10
Hypothesis Testing Using
a Single Sample
Sharing prescription drugs with others can
be dangerous. Is this a common
occurrence among teens?
OR
How do we answers questions
like these using sample data?
The National Association of Colleges and
In Chapter
9, wethat
usedthe
sample
datastarting
to
Employers
stated
average
In this
chapter,
we
will
use
sample
data
estimate
the value
of an unknown
salary
for
students
graduating
with
a bachelor’s
to test some
claim
or
hypothesis
about
population characteristic.
degree
in 2010characteristic
is $48,351. Istothis
the
population
seetrue
if for
your college?it is plausible.
To do this, we use a test of hypotheses
or test procedure.
What is a test of hypotheses?
A test of hypotheses is a method that
Is it one of the
uses sample data to decide
between
two
values of
the
competing claims (hypotheses)
about the
sample statistic
population characteristic.
that are likely to
occur?
Is theIsvalue
of that
the sample
statistic . . .
it one
isn’t likely
to
– a random
occurrence
due to natural
occur?
variation?
OR
– a value that would
be considered
surprising?
Hypothesis statements:
YouHare
usually
The null hypothesis, denoted by
0, is a
trying to
claim about a population characteristic
determine if
that is initially assumed to be true.
this claim is
The hypothesis statements are ALWAYS
about the population believable.
– NEVER about a sample!
The alternative hypothesis, denoted by
Ha, is the competing claim.
To determine what the alternative
hypothesis should be, you need to keep the
research objectives in mind.
Let’s consider a murder trial . . .
To determine which hypothesis is
What
is the
null
hypothesis?
correct,
the
jury
will listen to the You are trying to
is what you
if the
So we
will make
one This
ofdetermine
two
evidence. Only
if there
is “evidence
is true
Hbeyond
is innocent
evidence
reasonable decisions:
doubt”
would assume
0: the adefendant
before
you begin.
the null hypothesis be rejected in
supports
this
favor of the alternative hypothesis.
claim.
• Reject the null hypothesis
What is
the
alternative
hypothesis?
• Fail
to
reject
null hypothesis
If there
is not the
convincing
evidence, then
we would “fail to reject” the null
Ha: the hypothesis.
defendantRemember
is guilty that the actually
verdict that is returned is “GUILTY” or
“NOT GUILTY”. We never end up
determining the null hypothesis is true –
only that there is not enough evidence to
say it’s not true.
The Form of Hypotheses:
Null hypothesis
H0: population characteristic = hypothesized value
This one is considered a
two-tailed test because
you are interested
in both
Alternative
hypothesis
The null
hypothesis always
direction.
includes the equal
case.
Ha: population
characteristic
> hypothesized
value
This hypothesized
valuevalue
is
Ha: population characteristic
< hypothesized
a specific number
Ha: population characteristic
≠ hypothesized
This
signby
is the value
determined
Notice
that the alternative
These are
considered
onedetermined
by
the
context
of
the
problem
hypothesis
usesyou
the same
tailed tests
because
context
of the
population
characteristic
and
the
Let’s
practice
writing
are only interested in one
problem.
same hypothesized
value
as
the
hypothesis statements.
direction.
null hypothesis.
Sharing prescription drugs with others can be
dangerous. A survey of a representative sample
of 592 U.S. teens age 12 to 17 reported that 118
of those surveyed admitted to having shared a
prescription drug with a friend. Is this sufficient
evidence that more than 10% of teens have
shared prescription medication with friends?
State the hypotheses :
What is the
What words indicate
What
the
is the
population
hypothesized
direction of theThe
characteristic
ofp of
value?
true
proportion
H
:
p
=
.1
0
alternative hypothesis?
teensinterest?
who have shared
Ha: p > .1
prescription medication
with friends
Compact florescent (cfl) lightbulbs are
much more energy efficient than regular
incandescent lightbulbs. Ecobulb brand
60-watt cfl lightbulbs state on the package
“Average life 8000 hours”. People who purchase
this brand would be unhappy if the bulbs lasted
less than 8000 hours. A sample of these bulbs
will be selected and tested.
What is the
StateWhat
the words
hypotheses
: theis the
indicate
What
population
hypothesized
direction of thecharacteristic
of
value?
H : m alternative
= 8000 hypothesis?interest?
0
Ha: m < 8000
The true mean (m) life of
the cfl lightbulbs
Because in variation of the manufacturing
process, tennis balls produced by a particular
machine do not have the same diameters.
Suppose the machine was initially calibrated to
achieve the specification of m = 3 inches.
However, the manager is now concerned that the
diameters no longer conform to this specification.
If the mean diameter is not 3 inches, production
will have to be halted.
State the hypotheses :
What words indicateWhat
the is the population
direction
of
The true mean m
H0: m =of3 the characteristic
alternative hypothesis? diameter
interest?
of tennis
Ha: m ≠ 3
balls
For each pair of hypotheses, indicate which are
Must useand
a population
characteristic - x
not legitimate
explain why
is a statistics (sample)
a) H0 : m  15 ; Ha : m  15
greater
than!
b) H0 : xMust
 4be
; Honly
:
x

15
a
Must use same
c) H0number
: p  .1as; H
:
p
a
in H0!
 .1
d) H0 :Hm0 MUST
 2.3 ;be
Ha“=“
: m!  3.2
e) H0 : p  .5 ; Ha : p  .5
When you perform a hypothesis
test you make a decision:
reject H0 or fail to reject H0
When
one of
Each could possibly
beyoua make
wrong
these decisions, there is
decision; therefore,
there are
a possibility
thattwo
you
types of errors. could be wrong!
That you made an error!
Type I error
• The error of rejecting H0 when H0 is
true
• The probability of a Type I error is
denoted by a.
a is called the significance level of the
This is the lower-case
test
Greek letter “alpha”.
Type II error
• The error of failing to reject H0
when H0 is false
• The probability of a Type II error
is denoted by b
This is the lower-case
Greek letter “beta”.
Here is another way to look at the
types of errors:
Suppose
H0H
is is
true
Suppose
Suppose
H
is
true
is
Suppose
0H
00
and
wereject
fail
to
and
false
weand
wewefail
it,
false
and
reject
it,
what
toreject
what
rejecttype
it,what
what
of
it,
type
of decision
decision
type
decision
of
decision
was
type
of
was
made?
was
made?
made?
was
made?
H0 is
true
H0 is
false
Reject
H0
Type I
error
Correct
Fail to
reject
H0
Type II
Correct
error
The U.S. Bureau of Transportation Statistics
reports that for 2009 72% of all domestic
passenger flights arrived on time (meaning within
15 minutes of its scheduled arrival time). Suppose
that an airline with a poor on-time record decides
to offer its employees a bonus if, in an upcoming
month, the airline’s proportion of on-time flights
exceeds the overall 2009 industry rate of .72.
State the hypotheses. Type I error – the airline
reward the
H0: p = .72
State a decides
Type Ito
error
employees when the
in
context.
H : p > .72
a
proportion of on-time
flights doesn’t exceeds .72
State a Type II error
Type II error – the airline
in context.
employees do not receive the
bonus when they deserve it.
In 2004, Vertex Pharmaceuticals, a biotechnology
company, issued a press release announcing that it
had filed an application with the FDA to begin clinical
trials on an experimental drug VX-680 that had been
found to reduce State
the growth
rate
of pancreatic
a Type
I error
in the and
colon cancer tumors
in animal
context
of studies.
this problem.
A potential consequence of making a Type I
What
is
athe
potential
error
would
be
that the
company
would
Data resulting
from
planned
clinical
trials
can be
tothis
devote
resources to the
error?
used toconsequence
test: continueof
development of the drug when it really is
Let m = the true mean growth
rate of tumors for patients
not effective.
taking the experimental drug
H0: m = mean growth rate of tumors for patients not
taking the experimental drug
Ha: m < mean growth rate of tumors for patients not
taking the experimental drug
A Type I error would be to incorrectly conclude
that the experimental drug is effective in
slowing the growth rate of tumors
In 2004, Vertex Pharmaceuticals, a biotechnology
company, issued a press release announcing that it
had filed an application with the FDA to begin
clinical trials on an experimental drug VX-680
Statetoa reduce
Type IIthe
error
in the
that had been found
growth
rate of
context
this
problem.
pancreaticAand
colon
canceroftumors
in animal
potential
consequence
of making
a Type
What
a potential
erroriswould
be that the company might
studies. II
abandon development
of a drug that was
consequence
of this error?
Data resulting from theeffective.
planned clinical trials can
be used to test:
H0: m = mean growth rate of tumors for patients not
taking the experimental drug
Ha: m < mean growth rate of tumors for patients not
taking the experimental drug
A Type II error would be to conclude that the
drug is ineffective when in fact the mean
growth rate of tumors is reduced
The relationship between a and b
The ideal test procedure would result in both
Selecting a of
significance
level aand
= .05
a = 0 (probability
a Type I error)
b=0
results inof
a test
procedure
that, used over
(probability
a Type
II error).
and over with different samples, rejects a
This isSo
impossible
to
achieve
since
we
must
why
choose
ainsmall
a base
truenot
H0always
about
5
times
100.
a = .05data.
or a = .01)?
our decision(like
on sample
Standard test procedures allow us to select a,
the significance level of the test, but we have no
direct control over b.
The relationship between a and b
Suppose
thisisnormal
curve
If
the
null
hypothesis
false
and
the
Let’s consider the represents the sampling
alternative hypothesis is true, then the
tail distribution
would
represent
the the
following This
hypotheses:
pb,when
This
is thefor
part
of
true proportion
is believed
to
be the
greater
probabilitycurve
of failing
to
reject
a a
null
hypothesis
is
true.
that
represents
than .5 – so the curve should really be
false
H0Type
.
or
the
I error.
shifted to the right.
H0: p = .5
Ha: p > .5
Let a = .05
.5
The relationship between a and b
If the null hypothesis is false and the
Let’s consider
the hypothesis is true, then the
alternative
This
tailthat
would
b, the
following
hypotheses:
true
proportion
to
be bgreater
Notice
asisrepresent
abelieved
gets smaller,
probability
of
failing
to
reject
a
than .5 – sogets
the
curve
should
really
be
larger!
false to
H0the
.
shifted
right.
H0: p = .5
Ha: p > .5
Let a = .01
How does one decide what a
level to use?
After assessing the consequences of
type I and type II errors, identify the
largest a that is tolerable for the
problem. Then employ a test procedure
that uses this maximum acceptable value
–rather than anything smaller – as the
level of significance.
Remember, using a smaller a increases b.
The EPA has adopted what is known as the Lead and
Copper Rule, which defines drinking water as unsafe if the
concentration of lead is 15 parts per billion (ppb) or
greater or if the concentration of copper is 1.3 ppb or
greater.
aa Type
III
error
inmight
Which
type of
error
has
a more
serious
The manager
of aState
community
water
system
use lead
State
Type
error
incontext.
context.
level measurements
from aconsequence?
sample
water specimens
Since
most
people
wouldof
consider
theI? to
What
is
a
consequence
of
a
Type
What is
a consequence
ofIaerror
Type II?
test the following
hypotheses:
consequence
of the Type
H0: m = 15we
versus
Hawant
: m < 15to keep
more serious,
would
A Type I error
theselect
conclusion
that a water source
a leads
smallto
– so
a smaller
meets EPA standards
when the
water
significance
level
of ais=really
.01. unsafe.
There are possible health risks to the community
A Type II error leads to the conclusion that a
water source does NOT meet EPA standards
when the water is really safe.
The community might lose a good water source.
Large-Sample Hypothesis Test
for a Population Proportion
The fundamental idea behind hypothesis
testing is:
We reject H0 if the observed sample is very
unlikely to occur if H0 is true.
Recall the General Properties for
Sampling Distributions of p
These three properties imply that the standardized
variable
pˆ  p
z 
1. ˆ
p 1  p 
p
n
As
long normal
as the distribution
sample size is
has an approximately
p (1  p )standard
less than 10% of the population
 pˆn is
 large.
when
2.
m p
n
3. When n is large, the sampling
distribution of p is approximately
normal.
In June 2006, an Associated Press survey
was conducted to investigate how people
use the nutritional information provided
on food packages. Interviews were
conducted with 1003 randomly selected
adult Americans, and each participant was
asked a series of questions, including the following two:
Based on these data, is it reasonable to
Question
1: When
purchasing
packaged
food,Americans
how often do
conclude
that
a majority
of adult
you
check thecheck
nutritional
labeling on
the package?
frequently
nutritional
labels
when
purchasing
packaged
Question 2: How
often do you
purchasefoods?
food that is bad
for you, even after you’ve checked the nutrition
labels?
It was reported that 582 responded “frequently” to the
question about checking labels and 441 responded “very
often” or “somewhat often” to the question about
purchasing bad foods even after checking the labels.
Nutritional Labels Continued . . .
H0: p = .5
Ha:We
p > .5
will create a test statistic using:
p = true proportion of adult
pˆ  p Americans who
z  nutritional labels
frequently check
p 1  p 
n
We use p > .5 to
test for a majority
582
pˆ 
who
.58 frequently
of adult Americans
For this sample:
1003
check nutritional
labels.
A test statistic indicates how many standard
This
observed
is is
greater
than
deviations
thesample
sampleproportion
statistic (p)
from the
.5..58
Isit.5plausible
a characteristic
sample proportion
of p = .58
population
(p).
z occurred

 5a.08
as
result of chance variation, or is it
.5.5
unusual to observe a sample proportion this large
1003
when p = .5?
Nutritional Labels Continued . . .
H0: p = .5
Ha: p > .5
p = true proportion of adult Americans who
frequently
check
nutritional
labels
the
normal
NextIn
we
findstandard
the P-value
for curve,
this seeing
a value
5.08 or larger is unlikely.
testofstatistic.
It’s probability
582 is approximately 0.
pˆ 
 .58
For this sample:
1003
Since the P-value
is so small,
.58P-value
 .5
The
is the probability
of obtaining
a we
z 
 5.08
reject
H0. There is convincing
test.5statistic
at least
as inconsistent
with H0
.5
evidence
to suggest
that the
as
was
observed,
assuming
H
is
true.
0
1003
P-value ≈ 0
majority of adult Americans
frequently check the nutritional
labels on packaged0foods.
Computing P-values
The calculation of the P-value depends on the form
of the inequality in the alternative hypothesis.
• Ha: p > hypothesize value
z curve
P-value = area in upper tail
Calculated z
Computing P-values
The calculation of the P-value depends on the form
of the inequality in the alternative hypothesis.
• Ha: p < hypothesize value
z curve
P-value = area in lower tail
Calculated z
Computing P-values
The calculation of the P-value depends on the form
of the inequality in the alternative hypothesis.
• Ha: p ≠ hypothesize value
P-value = sum of area in two tails
z curve
Calculated z and –z
Using P-values to make a decision:
To decide whether or not to reject
H0, we compare the P-value to the
significance level a
If the P-value > a, we “fail to reject”
the null hypothesis.
If the P-value < a, we “reject” the null
hypothesis.
Summary of the Large-Sample z
Test for p
Null hypothesis:
Test Statistic:
H0: p = hypothesized value
pˆ  p
z 
p (1  P )
n
Alternative Hypothesis:
P-value:
Ha: p > hypothesized value Area to the right of calculated z
Ha: p < hypothesized value Area to the left of calculated z
Ha: p ≠ hypothesized value 2(Area to the right of z) of +z
or 2(Area to the left of z) of -z
Summary of the Large-Sample z
Test for p Continued . . .
Assumptions:
1.
p is a sample proportion from a random sample
2. The sample size n is large. (np > 10 and
n(1 - p) > 10)
3. If sampling is without replacement, the sample
size is no more than 10% of the population size
A report states that nationwide, 61% of high
school graduates go on to attend a two-year or
four-year college the year after graduation.
Suppose a random sample of 1500 high school
graduates in 2009 from a particular state
estimated the proportion of high school graduates
that attend college the year after graduation to be
58%. Can we reasonably conclude that the
proportion of this state’s high school graduates in
2009 who attended college the year after
graduation is different from the national figure?
Use a = .01.
H0: p = .61
Ha: p ≠ .61
Where p is the proportion of all
State
hypotheses.
2009
highthe
school
graduates in
this state who attended college
the year after graduation
College Attendance Continued . . .
H0: p = .61
Ha: p ≠ .61
Where p is the proportion of all 2009 high
school graduates in this state who
attended college the year after graduation
Assumptions:
• Given a random sample of 1500 high school
graduates
• Since 1500(.61) > 10 and 1500(.39) > 10, sample
size is large enough.
• Population size is much larger than the sample
size.
College Attendance Continued . . .
H0: p = .61
Ha: p ≠ .61
Where p is the proportion of all 2009 high
school graduates in this state who attended
college the year after graduation
Test statistic:
z 
.58  .61
 2.38
.61(.39)
What potential error could you
1500
have made?
Type II
The= area
P-value = 2(.0087)
.0174to the left
Useof
a -2.38
= .01 is
approximately .0087
Since P-value > a, we fail to reject H0. The evidence does
not suggest that the proportion of 2009 high
school graduates in this state who attended
college the year after graduation differs from
the national value.
In December 2009, a county-wide water
conservation campaign was conducted in a
particular county. In 2010, a random
sample of 500 homes was selected and water
usage was recorded for each home in the sample.
Suppose the sample results were that 220
households had reduced water consumption. The
county supervisors wanted to know if their data
supported the claim that fewer than half the
households in the county reduced water
consumption.
H0: p = .5
Ha: p < .5
State
the hypotheses.
where p is the
proportion
of all households in
Calculate
the county with reduced
water p.
usage
pˆ 
220
 .44
500
Water Usage Continued . . .
H0: p = .5
Ha: p < .5
where p is the proportion of all households in
the county with reduced water usage
Verify assumptions
1. p is from a random sample of households
2. Sample size n is large because np = 250 >10 and n(1-p) =
250 > 10
3. It is reasonable that there are more than 5000 (10n)
households in the county.
Water Usage Continued . . .
H0: p = .5
Ha: p < .5
where p is the proportion of all households in
the county with reduced water usage
Calculate
the up
test
Look
this
value
in the
.44  .5
statistic
P-value
z 
 2.What
68
potential
error
could
you
table
of z and
curve
areas
.5(.5)
Type I
have made?
500
P-value = .0037
Use a = .01
Since P-value < a, we reject H0. There is convincing
evidence that the proportion of households with
reduced water usage is less than half.
Water Usage Continued . . .
H0: p = .5
Ha: p < .5
where p is the proportion of all households in
the county with reduced water usage
Since P-value < a, we reject H0.
Confidence intervals are twoUsed a = .01
Compute
98%
confidence
tailed,
soain
we
need
to put
in
With .01
each
tail,
that.01
puts
interval:
the
upper
tail
(since
the –curve is
.98
in
the
middle
Notice that the
Let’s create
a confidence
symmetrical).
are is
testing
Ha: p < .05, a
hypothesizedSince
valueweinterval
this
the
appropriate
with
this
data.


.
44
(.
56
)
would
also
be
in
the
lower
tail.
 level

of .5 is NOT in the
.
44

2
.
326
confidence
What is the appropriate


500
.98
98% confidence
 to use? 
confidence level
interval and that we
(.
388
,
.
492
)
“rejected”
H0!
.5
College Attendance Revisited . . .
H0: p = .61
Ha: p ≠ .61
Where p is the proportion of all 2009 high
school graduates in this state who attended
college the year after graduation
Since P-value > a, we fail to reject H0.
Use a = .01
Let’s compute
a a
This is a two-tailed
test so
for
Notice that the gets splitconfidence
evenly intointerval
both tails,
this
problem.
.58
(.42) 
hypothesized value leaving 99%
in the
middle.
.58  2.576

of .5 IS
in
the
98%

.99
1500


confidence interval
and that we “failed
(.547, .613)
to reject” H0!
Hypothesis Tests
for a Population Mean
Let’s review the assumptions for a
confidence interval for a population mean
The assumptions are the same for a
1) x is the large-sample
sample mean from
a random
hypothesis
testsample,
for a
2) the sample size n population
is large (n mean.
> 30), and
3) , the population standard deviation, is known
or unknown
This
Thisisisthe
thetest
teststatistic
statistic
when
when isisunknown.
known.
x m
z 

n
P-value is area under
the z curve
x m
t 
s
n
P-value is area under
the t curve with df=n-1
The One-Sample t-test for a
Population Mean
Null hypothesis:
Test Statistic:
H0: m = hypothesized value
x m
t 
s
n
Alternative Hypothesis:
P-value:
Ha: m > hypothesized value Area to the right of calculated t
with df = n-1
Ha: m < hypothesized value Area to the left of calculated t
with df = n-1
Ha: m ≠ hypothesized value 2(Area to the right of t) of +t
or 2(Area to the left of t) of -t
The One-Sample t-test for a
Population Mean Continued . . .
Assumptions:
1.
x and s are the sample mean and sample
standard deviation from a random sample
2. The sample size n is large (n > 30) or the
population distribution is at least approximately
normal.
A study conducted by researchers at Pennsylvania
State University investigated whether time
perception, an indication of a person’s ability to
concentrate, is impaired during nicotine withdrawal.
After a 24-hour smoking abstinence, 20 smokers were
asked to estimate how much time had elapsed during a
45-second period. Researchers wanted to see whether
smoking abstinence had a negative impact on time
perception, causing elapsed time to be overestimated.
Suppose the resulting data on perceived elapsed time
(in seconds) were as follows:
69 65 72 73 59 55 39 52 67 57
56 50 70 47 56 45 70 64 67 53
What is the mean and standard
x = 59.30 s = 9.84
n = 20
deviation
of the sample?
Smoking Abstinence Continued . . .
69 65 72 73 59 55 39 52 67 57
56 50 70 47 56 45 70 64 67 53
x = 59.30 s = 9.84 n = 20
Where m is the true mean perceived elapsed
H0: m = 45
time for smokers who haveState
abstained
from
the
Ha: m > 45
smoking for 24-hours
Since the boxplot is approximately hypotheses.
Assumptions:
symmetrical, it is plausible that the
1) It is reasonable
todistribution
believe thatisthe sampleVerify
of smokers
population
To do this, we need to
graph the
assumptions.
is representative
of
all
smokers.
approximately normal.
data using a boxplot or normal
2) Since the sample size is probability
not
plot
at least 30, we must
determine if it is plausible
that the population
distribution is approximately
40 50 60 70
normal.
Smoking Abstinence Continued . . .
69 65 72 73 59 55 39 52 67 57
56 50 70 47 56 45 70 64 67 53
x = 59.30 s = 9.84 n = 20
H0: m = 45
Ha: m > 45
Where m is the true mean perceived elapsed
time for smokers whoCompute
have abstained
from
the test
smoking for 24-hours
Test statistic:
P-value ≈ 0
statistic and P-value.
59.30  45
t 
 6.50
9.84
20
a = .05
Since P-value < a, we reject H0. There is convincing
evidence that the mean perceived elapsed time is
greater than the actual elapsed time of 45 seconds.
Smoking Abstinence Continued . . .
69 65 72 73 59 55 39 52 67 57
56 50 70 47 56 45 70 64 67 53
x = 59.30 s = 9.84 n = 20
Compute
the appropriate
Where
m
is
the
true
mean
perceived elapsed
H0: m = 45
confidence
time for smokers
who have interval.
abstained from
Ha: m > 45
smoking for 24-hours
Since P-value < a, we reject H0.
Notice that the
a = .05
hypothesized value
 9.84
 
59
.
30

1
.
729
of 45 is NOT in the
20 

90%
confidence
Since
this is a one-tailed
test,
a )
(55.497
, 63.103
interval
wetail. .05 goes in
goes inand
thethat
upper
“rejected”
H0! leaving .90 in the
the lower tail,
middle.
A growing concern of employers is time spent in activities
like surfing the Internet and emailing friends during work
hours. The San Luis Obispo Tribune summarized the
findings of a large survey of workers in an article that
ran under the headline “Who Goofs Off More than 2
Hours a Day? Most Workers, Survey Says” (August 3,
2006). Suppose that the CEO of a large company wants
to determine whether the average amount of wasted time
during an 8-hour day for employees of her company is less
than the reported 120 minutes. Each person in a random
sample of 10 employees was contrasted and asked about
daily wasted time at work. The resulting data are the
following:
108 112 117 130 111
131 113 113 105 128
What is the mean and standard
x = 116.80 s = 9.45 n = 10
deviation of the sample?
Surfing Internet Continued . . .
108 112 117 130 111 131 113 113 105 128
x = 116.80 s = 9.45 n = 10
H0: m = 120
Ha: m < 120
Where m is the true mean daily wasted
time for employees of this company
The boxplot reveals some skewness,
State
the
Verify
the
but
there
is
no
outliers.
It
is
plausible
Assumptions:
hypotheses.
assumptions.
that
the
population
distribution
is
1) The given sample was a random sample of employees
approximately normal.
2) Since the sample size is not
at least 30, we must
determine if it is plausible
that the population
110
120
130
distribution is approximately
normal.
Surfing Internet Continued . . .
108 112 117 130 111 131 113 113 105 128
x = 116.80 s = 9.45 n = 10
H0: m = 120
Ha: m < 120
Where m is the true mean daily wasted
time for employees of this company
the
testwe
116potential
.80 Compute
120 error
What
could

 and
1.07
Test Statistic: thave
statistic
TypeP-value.
II
9made?
.45
P-value =.150
10
a = .05
Since p-value > a, we fail to reject H0. There is not
sufficient evidence to conclude that the mean daily
wasted time for employees of this company is less than
120 minutes.
Power and the Probability
of Type II Error
Recall Type I and Type II Errors:
Suppose
H0H
is is
true
Suppose
Suppose
H
is
true
is
Suppose
0H
00
and
wereject
fail
to
and
false
weand
wewefail
it,
false
and
reject
it,
what
toreject
what
rejecttype
it,what
what
of
it,
type
of decision
decision
type
decision
of
decision
was
type
of
was
made?
was
made?
made?
was
made?
Reject
H0
H0 is
true
H0 is
false
Type I
error
Correct
Power
Fail to
Type II
Correct
reject that
The probability
we correctly reject H0 is
error
called the power of the test.
H
0
Suppose that the student body president at a
university is interested in studying the amount of
money that students spend on textbooks each
semester. The director of financial aid services
Thethat
power
of a test
depends
true value of the
believes
average
amount
spentononthe
textbooks
mean!
Becauseand
the uses
actual
value
m is unknown,
is $500
each
semester,
this
to of
However,
if
the
true
mean
was
$525,
it
is
less
likely
If
the
true
mean
is
greater
than
$500,
then
we of
we
cannot
know
the
power
for
the
actual
value
determine
the
amount
of
financial
aid
for
which
a
that
the reject
sampleHwould
be mistaken
for
a sample
from
should
true
mean
is ONLY
a
m
.
0. BUT, if the
student
isgreater,
eligible.if
The
student
body
president
the
population
the
meanthen
were
$500.
So,mean
it is might
more
little
say
$505,
the
sample
BUT,
wethat
can we
gain
insight
to the
power
m by
plans to
ask
each
student
in
acorrectly
random
sample
how
likely
will
reject
H0of
.$500.
look
like
we
expect
if
the
true
mean
were
investigating
somethis
what
if scenarios
...
muchThus
he or
she
spent
on
books
semester
and
we wouldn’t have convincing evidence to reject
use the data to test (using a H
=0.05)
the following
.
hypotheses:
H0: m = 500 versus Ha: m > 500
Let’s consider a one-sided, upper tail test.
Fail to Reject H0
Reject H0
Power = 1 - b
b
m0
a
ma
If the null hypothesis is false, then
m > hypothesized value
Textbooks Continued . . .
H0: m = 500
Ha: m > 500
Suppose that  = $85 and
n = 100. (Since n is large, the
sampling distribution of x is
approximately normal.)
What is the probability of committing a Type I
error?
a = .05
This is the z value with
area
to its of
left.
= 500 is true, for .95
what
values
the sample
If m
mean would you reject the null hypothesis?
Rejection Region
x  500
Use:
1.645 
What is the value of this x?
.95
We would reject H0
for x > 513.98.
85
100
Textbooks Continued . . .
Suppose that  = $85 and
n = 100.
Ha: m > 500 We would reject H
0
for x > 513.98.
If the null hypothesis
is false, then m > 500.
What is theThis
probability
of a Type
II
error (isb)?
area (to the left
of
x
=
513.98)
What if m =520?b.
H0: m = 500
513.98  520
z 
 .708
85
100
Rejection Region
b = .239Look this up in the
table of areas for z
curves
Textbooks Continued . . .
H0: m = 500
Ha: m > 500
Suppose that  = $85 and
n = 100.
We would reject H0
for x > 513.98.
What is the power of the test if m = 520?
b = .239
Power
is
the
probability
of
power is in the
Power = Notice
1 -correctly
.239that
= .761
rejecting H .
SAME curve as b 0
Rejection Region
Power = 1 – b
Textbooks Continued . . .
Suppose that  = $85 and
n = 100.
Ha: m > 500 We would reject H
0
for x > 513.98.
If we reject H0, then m
Notice that, as the distance
between
> 500.
Find b and
power.
the
null hypothesized
valueiffor
and
What
m =m530?
our alternative value for m increases, b
decreases AND power increases.
b = .03
power = .97
H0: m = 500
Rejection Region
Textbooks Continued . . .
Suppose that  = $85 and
n = 100.
Ha: m > 500 We would reject H
0
for x > 513.98.
If the null hypothesis
Notice that, as theisdistance
between
false, then
m > 500.
Find b and
power.
the
null hypothesizedWhat
valueiffor
and
m =m 510?
our alternative value for m decreases, b
increases AND power decreases.
b = .68
power = .32
H0: m = 500
Rejection Region
Textbooks Continued . . .
Suppose that  = $85 and
n = 100.
H0: m = 500
Ha: m > 500
b will increase and power will
decrease.
What happens if we use a = .01?
Rejection Region
b
Rejection Region
Power
b
Power
What happens to a, b, & power when
the sample size is increased?
Fail to Reject H0
a
The standard
b decreases
and
The
significance
deviation
will
power
increases!
Reject
levelH(a)
remains
0
decrease
making
thecurve
same taller
– so the
the
valueskinnier.
where the
and
rejection region
begins must move.
m0
b
Power
ma
Effects of Various Factors on
the Power of a Test
• The larger the size of the discrepancy
between the hypothesized value and the
actual value of the population
characteristic, the higher the power.
• The larger the significance level a, the
higher the power of a test.
• The larger the sample size, the higher
the power of a test
b and Power for a t Test
When using a t-test, the population standard
deviation  is unknown. b not only depends on a,
n, and the actual value of m, but b also depends 
so we must have an estimate of .
The b curves (on the next slide) can be used to
estimate b and the power of a test based upon
the value of d.
d 
alternativ e value - hypothesiz ed value
σ
b curves
Consider testing
H0: m = 100 versus Ha: m > 100
and
focus
onpopulation
the alternative
value mis= normal,
110.
When
the
distribution
Suppose
= 10,
n = 7,hypotheses
and a = .01.about m
the t that
test for
testing
has smaller b than does any other test
110

100
d procedure that
 1has the same significance
level a.
10
Calculate d.
Use the df = 6
Power ≈ .4 curve to
estimate b
b ≈ .6