Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
8 Tests of Hypotheses Based on a Single Sample Copyright © Cengage Learning. All rights reserved. 8.2 Tests About a Population Mean Copyright © Cengage Learning. All rights reserved. z Tests for Hypotheses about a Population Mean Recall from the previous section that a conclusion in a hypothesis testing analysis is reached by proceeding as follows: 3 z Tests for Hypotheses about a Population Mean Determination of the P-value depends on the distribution of the test statistic when π»0 is true. In this section we describe z tests for testing hypotheses about a single population mean π. By βz test,β we mean that the test statistic has at least approximately a standard normal distribution when π»0 is true. The P-value will then be a z curve area which depends on whether the inequality in π»π is >, <, ππ β . 4 z Tests for Hypotheses about a Population Mean In the development of confidence intervals for π in Chapter 7, we first considered the case in which the population distribution is normal with known π, then relaxed the normality and known s assumptions when the sample size n is large, and finally described the one-sample t CI for the mean of a normal population. In this section we discuss the first two cases, and then present the one-sample t test in Section 8.3. 5 A Normal Population with Known ο³ 6 A Normal Population with Known ο³ Although the assumption that the value of ο³ is known is rarely met in practice, this case provides a good starting point because of the ease with which general procedures and their properties can be developed. The null hypothesis in all three cases will state that ο has a particular numerical value, the null value. We denote by ο0, so the null hypothesis has the form π»0 : π = π0 . Let X1,β¦, Xn represent a random sample of size n from the normal population. 7 A Normal Population with Known ο³ Then the sample mean has a normal distribution with expected value and standard deviation When H0 is true, Consider now the statistic Z obtained by standardizing under the assumption that H0 is true: 8 A Normal Population with Known ο³ Substitution of the computed sample mean gives z, the distance between and ο0 expressed in βstandard deviation units.β For example, if the null hypothesis is , and , then the test statistic value is z = (103 β 100)/2.0 = 1.5. That is, the observed value of is 1.5 standard Deviations (of ) larger than what we expect it to be when H0 is true. 9 A Normal Population with Known ο³ The statistic Z is a natural measure of the distance between , the estimator of ο, and its expected value when H0 is true. If this distance is too great in a direction consistent with Ha, there is substantial evidence that π»0 is false. Suppose first that the alternative hypothesis has the form Ha : ο > ο0. Then an value that considerable exceeds ο0 provides evidence against H0 . Such an value corresponds to a large positive value z. This in turn implies that any value exceeding the calculated z is more contradictory to H0 than z itself. 10 A Normal Population with Known ο³ It follows that Now here is the key point: when π»0 is true, the test statistic Z has a standard normal distributionβbecause we created Z by standardizing π assuming that π»0 is true (i.e., by subtracting π0 ). The implication is that in this case, the P-value is just the area under the standard normal curve to the right of z. Because of this, the test is referred to as upper-tailed. 11 A Normal Population with Known ο³ For example, in the previous paragraph we calculated z = 1.5. If in the alternative hypothesis there is π»π§ : π > 100, then P-value = area under the z curve to the right of 1.5 = 1 - Ξ¦(1.50) 5 .0668. At significance level .05 we would not be able to reject the null hypothesis because the P-value exceeds πΌ. Now consider an alternative hypothesis of the form π»π : π < π0 . In this case any value of the sample mean smaller than our π₯is even more contradictory to the null hypothesis. 12 A Normal Population with Known ο³ Thus any test statistic value smaller than the calculated z is more contradictory to π»0 than is z itself. It follows that The test in this case is customarily referred to as lowertailed. If, for example, the alternative hypothesis is π»π : π < 100 and z = 22.75, then P-value = Ξ¦(22.75) = .0030. This is small enough to justify rejection of π»0 at a significance level of either .05 or .01, but not .001. 13 A Normal Population with Known ο³ The third possible alternative,π»π : π β π0 , requires a bit more careful thought. Suppose, for example, that the null value is 100 and that x = 103 results in z = 1.5. Then any π₯ value exceeding 103 is more contradictory to π»0 than is 103 itself. So any z exceeding 1.5 is likewise more contradictory to π»0 than is 1.5. However, 97 is just as contradictory to the null hypothesis as is 103, since it is the same distance below 100 as 103 is above 100. Thus z =21.5 is just as contradictory to π»0 as is z = 1.5. 14 A Normal Population with Known ο³ Therefore any z smaller than 21.5 is more contradictory to π»0 than is 1.5 or 21.5. It follows that This would also be the P-value if x = 97 results in z = 21.5. The important point is that because of the inequality β in π»π , the P-value is the sum of an upper-tail area and a lower-tail area. By symmetry of the standard normal distribution, this becomes twice the area captured in the tail in which z falls. 15 A Normal Population with Known ο³ Equivalently, it is twice the area captured in the upper tail by |z|, i.e., 2[1 - Ξ¦(|z|)]. It is natural to refer to this test as being two-tailed because z values far out in either tail of the z curve argue for rejection of π»0 . The test procedure is summarized in the accompanying box, and the P-value for each of the possible alternative hypotheses is illustrated in Figure 8.4. 16 A Normal Population with Known ο³ 17 A Normal Population with Known ο³ 18 A Normal Population with Known ο³ Use of the following sequence of steps is recommended when testing hypotheses about a parameter. The plausibility of any assumptions underlying use of the selected test procedure should of course be checked before carrying out the test. 19 A Normal Population with Known ο³ The formulation of hypotheses (Steps 2 and 3) should be done before examining the data, and the significance level a should be chosen prior to determination of the P-value. 20 Example 8.6 A manufacturer of sprinkler systems used for fire protection in office buildings claims that the true average system-activation temperature is 130°. A sample of n = 9 systems, when tested, yields a sample average activation temperature of 131.08°F. If the distribution of activation times is normal with standard deviation 1.5°F, does the data contradict the manufacturerβs claim at significance level ο‘ = .01? 21 Example 8.6 contβd 1. Parameter of interest: ο = true average activation temperature. 2. Null hypothesis: H0: ο = 130 (null value = ο0 = 130). 3. Alternative hypothesis: Ha: ο β 130 (a departure from the claimed value in either direction is of concern). 4. Test statistic value: 22 Example 8.6 contβd 5. Substituting n = 9 and π₯ = 131.08, That is, the observed sample mean is a bit more than 2 standard deviations above what would have been expected were H0 true. 6. The inequality in π»π implies that the test is two-tailed, so the P- value results from doubling the captured tail area: 23 Example 8.6 contβd 7. Because P-value = .0308 > .01 = πΌ, π»0 cannot be rejected at significance level .01. The data does not give strong support to the claim that the true average differs from the design value of 130. 24 A Normal Population with Known ο³ ο’ and Sample Size Determination The z tests with known π are among the few in statistics for which there are simple formulas available for ο’, the probability of a type II error. Consider first the alternative π»π : π > π0 . The null hypothesis is rejected if P-value β€ πΌ, and the P-value is the area under the standard normal curve to the right of z. Suppose that πΌ = .05. The z critical value that captures an upper-tail area of .05 is π§.05 =1.645 (look for a cumulative area of .95 in Table A.3). 25 A Normal Population with Known ο³ Thus if the calculated test statistic value z is smaller than 1.645, the area to the right of z will be larger than .05 and the null hypothesis will then not be rejected. Now substitute π₯ β π0 /π/ π in place of z in the inequality z < 1.645 and manipulate to isolate x on the left (multiply both sides by π/ π and then add π0 to both sides). This gives the equivalent inequality π₯ < π0 + ππ β π/ π. 26 A Normal Population with Known ο³ Now let ο ο’ denote a particular value of ο that exceeds the null value ο0. Then, 27 A Normal Population with Known ο³ As ο ο’ increases, ο0 β ο ο’ becomes more negative, so ο’ (ο ο’) will be small when ο ο’ greatly exceeds ο0 (because the value at which F is evaluated will then be quite negative). Error probabilities for the lower-tailed and two-tailed tests are derived in an analogous manner. If ο³ is large, the probability of a type II error can be large at an alternative value ο ο’ that is of particular concern to an investigator. 28 A Normal Population with Known ο³ Suppose we fix ο‘ and also specify ο’ for such an alternative value. In the sprinkler example, company officials might view ο ο’ = 132 as a very substantial departure from H0: ο = 130 and therefore wish ο’(132) = .10 in addition to ο‘ = .01. More generally, consider the two restrictions P(type I error) = ο‘ and ο’(ο ο’) = ο’ for specified ο‘, ο ο’ and ο’. 29 A Normal Population with Known ο³ Then for an upper-tailed test, the sample size n should be chosen to satisfy This implies that 30 A Normal Population with Known ο³ This equation is easily solved for the desired n. A parallel argument yields the necessary sample size for lower- and two-tailed tests as summarized in the next box. 31 Example 8.7 Let ο denote the true average tread life of a certain type of tire. Consider testing H0: ο = 30,000 versus Ha: ο > 30,000 based on a sample of size n = 16 from a normal population distribution with ο³ = 1500. A test with ο‘ = .01 requires zο‘ = z.01 = 2.33. 32 Example 8.7 contβd The probability of making a type II error when ο = 31,000 is 33 Example 8.7 contβd Since z.1 = 1.28, the requirement that the level .01 test also have ο’(31,000) = .1 necessitates The sample size must be an integer, so n = 30 tires should be used. 34 Large-Sample Tests 35 Large-Sample Tests When the sample size is large, the foregoing z tests are easily modified to yield valid test procedures without requiring either a normal population distribution or known ο³. The key result to justify large-sample confidence intervals was used in Chapter 7 to justify large sample confidence intervals: A large n implies that the standardized variable has approximately a standard normal distribution. 36 Large-Sample Tests Substitution of the null value ο0 in place of ο yields the test statistic which has approximately a standard normal distribution when H0 is true. 37 Large-Sample Tests The P-value is then determined exactly as was previously described in this section (e.g., F(z) when the alternative hypothesis isπ»π : π < π0 ). Rejecting π»0 when P-value β€ πΌ gives a test with approximate significance level a. The rule of thumb n > 40 will again be used to characterize a large sample size. 38 Example 8.8 A dynamic cone penetrometer (DCP) is used for measuring material resistance to penetration (mm/blow) as a cone is driven into pavement or subgrade. Suppose that for a particular application it is required that the true average DCP value for a certain type of pavement be less than 30. The pavement will not be used unless there is conclusive evidence that the specification has been met. 39 Example 8.8 contβd Letβs state and test the appropriate hypotheses using the following data (βProbabilistic Model for the Analysis of Dynamic Cone Penetrometer Test Values in Pavement Structure Evaluation,β J. of Testing and Evaluation, 1999: 7β14): 40 Example 8.8 contβd Figure 8.5 shows a descriptive summary obtained from Minitab. Descriptive Statistics Minitab descriptive summary for the DCP data of Example 8 Figure 8.5 41 Example 8.8 contβd The sample mean DCP is less than 30. However, there is a substantial amount of variation in the data (sample coefficient of variation = = . 4265). The fact that the mean is less than the design specification cutoff may be a consequence just of sampling variability. Notice that the histogram does not resemble at all a normal curve (and a normal probability plot does not exhibit a linear pattern), but the large-sample z tests do not require a normal population distribution. 42 Example 8.8 contβd 1. ο = true average DCP value 2. H0: ο = 30 3. Ha: ο < 30(so the pavement will not be used unless the null hypothesis is rejected) 4. 43 Example 8.8 contβd 5. A test with significance level .05 rejects H0 when z ο£ β1.645 (a lower-tailed test). 6. With n = 52, = 28.76, and s = 12.2647, 7. Since β.73 > β1.645, H0 cannot be rejected. We do not have compelling evidence for concluding that ο < 30; use of the pavement is not justified. 44 Large-Sample Tests Determination of ο’ and the necessary sample size for these large-sample tests can be based either on specifying a plausible value of ο³ and using the pervious formulas (even though s is used in the test) or on using the methodology to be introduced shortly in connection with the one-sample t tests discussed in Section 8.3 45