Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
+ Chapter 9: Testing a Claim Type I and II Error + Nonstatistical Hypothesis Testing A criminal trial is an example of hypothesis testing without the statistics. In a trial a jury must decide between two hypotheses. The null hypothesis is H0: The defendant is innocent The alternative hypothesis or research hypothesis is Ha: The defendant is guilty The jury does not know which hypothesis is true. They must make a decision on the basis of evidence presented. + Nonstatistical Hypothesis Testing There are two possible errors. A Type I error occurs when we reject a true null hypothesis. That is, a Type I error occurs when the jury convicts an innocent person. A Type II error occurs when we don’t reject a false null hypothesis [accept the null hypothesis]. That occurs when a guilty defendant is acquitted. I and Type II Errors + Type Definition: If we reject H0 when H0 is true, we have committed a Type I error. If we fail to reject H0 when H0 is false, we have committed a Type II error. Truth about the population Conclusion based on sample H0 true H0 false (Ha true) Reject H0 Type I error Correct conclusion Fail to reject H0 Correct conclusion Type II error On-Time Arrivals The U.S. Department of Transportation reports that for 2008, 65.3% of all domestic passenger flights arrived within 15 minutes of the scheduled arrival time. Suppose that an airline with a poor on-time record decides to offer its employees a bonus if the airline’s proportion of on-time flights exceeds the overall industry rate of 0.653 in an upcoming month. A random sample of flights could be selected and used as a basis for choosing between H0 : p = 0.653 Ha : p > 0.653 where p is the actual proportion of the airlines flights that are on-time during the month of interest. Describe a Type I and a Type II error in this setting, and explain the consequences of each. • A Type I error would be concluding that the airline on-time rate exceeds 0.653, when in fact the airline does not have a better on-time record. Consequence: The airline would reward its employees when the proportion of on-time flights was not actually greater than 0.653. •A Type II error is not concluding that the airline’s on-time proportion is greater than 0.653 when it really is doing better. •Consequence: The employees would not receive a reward that they had earned. + Example: Perfect Potatoes A potato chip producer and its main supplier agree that each shipment of potatoes must meet certain quality standards. If the producer determines that more than 8% of the potatoes in the shipment have “blemishes,” the truck will be sent away to get another load of potatoes from the supplier. Otherwise, the entire truckload will be used to make potato chips. To make the decision, a supervisor will inspect a random sample of potatoes from the shipment. The producer will then perform a significance test using the hypotheses H0 : p = 0.08 Ha : p > 0.08 where p is the actual proportion of potatoes with blemishes in a given truckload. Describe a Type I and a Type II error in this setting, and explain the consequences of each. • A Type I error would occur if the producer concludes that the proportion of potatoes with blemishes is greater than 0.08 when the actual proportion is 0.08 (or less). Consequence: The potato-chip producer sends the truckload of acceptable potatoes away, which may result in lost revenue for the supplier. • A Type II error would occur if the producer does not send the truck away when more than 8% of the potatoes in the shipment have blemishes. Consequence: The producer uses the truckload of potatoes to make potato chips. More chips will be made with blemished potatoes, which may upset consumers. + Example: Probabilities We can assess the performance of a significance test by looking at the probabilities of the two types of error. That’s because statistical inference is based on asking, “What would happen if I did this many times?” + Error For the truckload of potatoes in the previous example, we were testing H0 : p = 0.08 Ha : p > 0.08 where p is the actual proportion of potatoes with blemishes. Suppose that the potato-chip producer decides to carry out this test based on a random sample of 500 potatoes using a 5% significance level (α = 0.05). Assuming H 0 : p = 0.08 is true, the sampling distribution of pˆ will have : Shape : Approximately Normal because 500(0.08) = 40 and 500(0.92) = 460 are both at least 10. Center : m pˆ = p = 0.08 Spread : s pˆ = p(1- p) = n The shaded area in the right tail is 5%. Sample proportion values to the right of 0.08(0.92) the green line at 0.0999 will cause us to =reject 0.0121 H0 even though H0 is true. This will 500 happen in 5% of all possible samples. That is, P(making a Type I error) = 0.05. Probabilities The probability of a Type I error is the probability of rejecting H0 when it is really true. As we can see from the previous example, this is exactly the significance level of the test. Significance and Type I Error The significance level α of any fixed level test is the probability of a Type I error. That is, α is the probability that the test will reject the null hypothesis H0 when H0 is in fact true. Consider the consequences of a Type I error before choosing a significance level. What about Type II errors? A significance test makes a Type II error when it fails to reject a null hypothesis that really is false. The probability of a type II error is β (Greek letter beta). We will talk more about β in the next section!!! The two probabilities are inversely related. Decreasing one increases the other, for a fixed sample size. + Error