Download Type II error - Gattoni Math

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Probability wikipedia , lookup

Statistics wikipedia , lookup

Transcript
+
Chapter 9: Testing a Claim
Type I and II Error
+
Nonstatistical Hypothesis Testing
A criminal trial is an example of hypothesis testing without the
statistics.
In a trial a jury must decide between two hypotheses. The null
hypothesis is
H0: The defendant is innocent
The alternative hypothesis or research hypothesis is
Ha: The defendant is guilty
The jury does not know which hypothesis is true. They must
make a decision on the basis of evidence presented.
+
Nonstatistical Hypothesis Testing
There are two possible errors.
A Type I error occurs when we reject a true null hypothesis. That
is, a Type I error occurs when the jury convicts an innocent
person.
A Type II error occurs when we don’t reject a false null
hypothesis [accept the null hypothesis]. That occurs when a guilty
defendant is acquitted.
I and Type II Errors
+
 Type
Definition:
If we reject H0 when H0 is true, we have committed a Type I error.
If we fail to reject H0 when H0 is false, we have committed a Type II
error.
Truth about the population
Conclusion
based on
sample
H0 true
H0 false
(Ha true)
Reject H0
Type I error
Correct
conclusion
Fail to reject
H0
Correct
conclusion
Type II error
On-Time Arrivals
The U.S. Department of Transportation reports that for 2008, 65.3% of all domestic
passenger flights arrived within 15 minutes of the scheduled arrival time. Suppose
that an airline with a poor on-time record decides to offer its employees a bonus if the
airline’s proportion of on-time flights exceeds the overall industry rate of 0.653 in an
upcoming month. A random sample of flights could be selected and used as a basis
for choosing between
H0 : p = 0.653
Ha : p > 0.653
where p is the actual proportion of the airlines flights that are on-time during the
month of interest.
Describe a Type I and a Type II error in this setting, and explain the
consequences of each.
• A Type I error would be concluding that the airline on-time rate exceeds
0.653, when in fact the airline does not have a better on-time record.
Consequence: The airline would reward its employees when the proportion
of on-time flights was not actually greater than 0.653.
•A Type II error is not concluding that the airline’s on-time proportion is
greater than 0.653 when it really is doing better.
•Consequence: The employees would not receive a reward that they had
earned.
+
 Example:
Perfect Potatoes
A potato chip producer and its main supplier agree that each shipment of potatoes
must meet certain quality standards. If the producer determines that more than 8% of
the potatoes in the shipment have “blemishes,” the truck will be sent away to get
another load of potatoes from the supplier. Otherwise, the entire truckload will be
used to make potato chips. To make the decision, a supervisor will inspect a random
sample of potatoes from the shipment. The producer will then perform a significance
test using the hypotheses
H0 : p = 0.08
Ha : p > 0.08
where p is the actual proportion of potatoes with blemishes in a given truckload.
Describe a Type I and a Type II error in this setting, and explain the
consequences of each.
• A Type I error would occur if the producer concludes that the proportion of
potatoes with blemishes is greater than 0.08 when the actual proportion is
0.08 (or less). Consequence: The potato-chip producer sends the truckload
of acceptable potatoes away, which may result in lost revenue for the
supplier.
• A Type II error would occur if the producer does not send the truck away
when more than 8% of the potatoes in the shipment have blemishes.
Consequence: The producer uses the truckload of potatoes to make potato
chips. More chips will be made with blemished potatoes, which may upset
consumers.
+
 Example:
Probabilities
We can assess the performance of a significance test by looking at the
probabilities of the two types of error. That’s because statistical
inference is based on asking, “What would happen if I did this many
times?”
+
 Error
For the truckload of potatoes in the previous example, we were testing
H0 : p = 0.08
Ha : p > 0.08
where p is the actual proportion of potatoes with blemishes. Suppose that the
potato-chip producer decides to carry out this test based on a random sample of
500 potatoes using a 5% significance level (α = 0.05).
Assuming H 0 : p = 0.08 is true, the sampling distribution of pˆ will have :
Shape : Approximately Normal because 500(0.08) = 40 and
500(0.92) = 460 are both at least 10.
Center : m pˆ = p = 0.08
Spread : s pˆ =
p(1- p)
=
n
The shaded area in the right tail is 5%.
Sample proportion values to the right of
0.08(0.92) the green line at 0.0999 will cause us to
=reject
0.0121
H0 even though H0 is true. This will
500
happen in 5% of all possible samples.
That is, P(making a Type I error) = 0.05.
Probabilities
The probability of a Type I error is the probability of rejecting H0 when it is
really true. As we can see from the previous example, this is exactly the
significance level of the test.
Significance and Type I Error
The significance level α of any fixed level test is the probability of a Type I
error. That is, α is the probability that the test will reject the null
hypothesis H0 when H0 is in fact true. Consider the consequences of a
Type I error before choosing a significance level.
What about Type II errors? A significance test makes a Type II error when it
fails to reject a null hypothesis that really is false.
The probability of a type II error is β (Greek letter beta).
We will talk more about β in the next section!!!
The two probabilities are inversely related. Decreasing one increases the
other, for a fixed sample size.
+
 Error