Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Presentation 11 More About Significance Tests Inference Techniques Confidence Intervals: 1. 1-Proportion 2. 1-Mean 3. Difference between 2-Proportions 4. Difference between 2-Means Hypothesis Tests: 1. 1-Proportion 2. 1-Mean 3. Difference between 2-Means 4. Difference between 2-Proportions 5. Difference between Paired Means. 6. Chi-Square (Relationship between 2 Categorical Variables) 7. Regression (Relationship between 2 Quantitative Variables) 8. ANOVA (Difference between 3 or More Means) 9. Median Tests General Steps for Hypothesis Tests Step 1: Determine the H0 and Ha. The alternative hypothesis, Ha, is the claim regarding the population parameter that we want to test. There are three possible Ha's - the parameter is not equal to a null value (two-sided), less than a null value (one-sided) or greater than a null value (one-sided). The null hypothesis, H0, claims that nothing is happening. H0 can be the opposite of Ha or just that the parameter is equal to the null value. Step 2: Verify necessary data conditions, if met summarize the data into an appropriate test statistic. The conditions to be verified are the same conditions we have seen in Chapter 12 for creating C.I's. The test statistic is a standardized statistic, i.e. is of the form Sample statistic - Null value Null s.e (sample statistic) Under H0 the test statistic follows either a normal distribution (proportion cases) or a t-distribution with some d.f (mean cases). General Steps for Hypothesis Tests Step 3: Find the p-value. The p-value is the probability that the test statistic is as large or larger in the direction(s) specified from Ha assuming that H0 is true. This, to get the p-value you need the form of Ha, the value of the test statistic, and the distribution of the test statistic under the H0. Step 4: Decide whether or not the result is statistically significant. The results are statistically significant if the p-value is less than alpha, where alpha is the significance level (usually 0.05). Based on the pvalue we have two possible conclusions If p-value < α we reject the null and we claim the alternative is true. If p-value > α we fail to reject the null, we claim that there isn't enough evidence to report Ha is true. In this case, we do not claim that the null is true! Step 5: Report the conclusion in the context of the situation. Finally, write one or two sentences explaining what is the conclusion in terms of the problem. One sample t-test (one mean or paired data) In this case we consider the following hypotheses: H a : µ µ0 H0 : µ = µ0 vs one of the following Ha : µ > µ0 Ha : µ < µ0 We have the following t-test statistic t Sample estimate - Null value x 0 s Null Std. Error n Assuming that H0 is true, the test statistic has a t-distribution with (n1) degrees of freedom, if one of the following conditions is true: 1. the random variable of interest is bell-shaped (in practice, for small samples the data should show no extreme skewness or outliers). 2. the random variable is not bell-shaped, but a large random sample is measured, n ≥ 30. One sample t-test (one mean or paired data) If we denote with T a random variable with t-distribution with (n-1) d.f, and t is the value of out test statistic, then Ha µ < µ0 µ > µ0 µ µ0 p-value P( T < t ) P( T > t ) 2P( T > |t| ) We can find the p-value using the t-table (table A3), and draw a conclusion about the hypotheses in the usual manner. Two-Sample t-test (difference between two means). There are two populations (Population 1 and 2) of interest having unknown means µ1 and µ2 respectively. In this case we consider the following hypotheses: H a : µ1 - µ2 < 0 H0 : µ1 - µ2 = 0 (i.e. no difference) vs one of the following Ha : µ1 - µ2 > 0 H a : µ1 - µ2 0 We have the following t-test statistic (x x ) 0 t 1 2 2 2 s1 s2 n1 n2 Assuming H0 is true, the test statistic has a t-distribution with approximate d.f. equal to the minimum of n1-1 and n2-1 if the following conditions are true: 1. 2. The two samples are independent. Each sample is either coming from a bell shaped population or the sample size is ≥30. Using the appropriate d.f, we can get the p-value from table A.3 in the same way as in the one-mean case. Difference between two proportions Now we will consider two populations having some unknown proportions of interest, p1 and p2 respectively. In this case we consider the following hypotheses: Ha : p1 - p2 < 0 H0 : p1 - p2 = 0 (i.e. no difference) vs one of the following Ha : p1 - p2 > 0 Ha : p1 - p2 0 ˆ1 pˆ 2 . To get the null standard The sample statistic we will use is p error of the statistic we assume that H0 is true, so let p1 = p2 =p. Thus, instead of using p̂1 and p̂2 in the s.e formula, we will substitute them with an estimate of the common population proportion. That is, we use the proportion in all available data n1 pˆ 1 n2 pˆ 2 pˆ n1 n2 and we have the null s.e. pˆ (1 pˆ ) pˆ (1 pˆ ) n1 n2 Difference between two proportions Thus, we have the following t-test statistic z pˆ1 pˆ 2 pˆ (1 pˆ ) pˆ (1 pˆ ) n1 n2 pˆ1 pˆ 2 1 1 pˆ (1 pˆ ) n1 n2 Assuming H0 is true, the test statistic has a standard normal distribution if the following conditions are true: 1. The two samples are independent. 2. All the quantities n1 pˆ 1, n1(1 pˆ 1), n 2 pˆ 2 and n 2(1 pˆ 2) are at least 5 and preferably at least 10. We can obtain the p-value using the standard normal table A1. If we denote with Z a random variable with standard normal distribution, and z is the value of out test statistic, then Ha p-value p 1 - p2 < 0 P( Z < z ) p 1 - p2 > 0 P( Z > z ) p1 - p2 0 2P( Z > |z| ) Table For Hypothesis Testing Type One Mean (or Paired mean) . = Difference Between Means One Proportion Parameter Statistic µ or µd x or d µ1- µ2 x1 x 2 p p̂ Null Std. Error s or n sd n s12 s 2 2 n1 n2 po(1 po) n pˆ 1 pˆ 2 pˆ (1 pˆ ) pˆ (1 pˆ ) n1 n2 Test Statistic t df = n-1 t df=min(n1-1,n2-1) z Difference Between Proportions p1-p2 n1 pˆ 1 n2 pˆ 2 p̂ = n1 n2 This is the overall proportion-like if we had one big sample! pˆ 1 pˆ 2 z Detailed Example 1 A box of corn flakes is advertised as containing 16 oz. of cereal. John works for consumer reports and is interesting in verifying the companies claim. He suspects that the average amount of cereal is less than 16oz. He takes a random sample of 100 boxes of cereal and records the weight of each box. The sample mean is 15.7oz and the sample standard deviation is 1.2oz. Example 1: Overview Before you carry out the hypothesis test it is a good idea to label the information you have. Since there is 1 quantitative variable we are doing a hypothesis test of 1-mean. The parameter of interest is µ, the mean weight of cereal in all corn flakes boxes. The statistic of interest is x = 15.7oz, the sample mean weight. Other information: The sample size n = 100, and the sample standard deviation s = 1.2oz Example 1: Hypotheses and Conditions 1. Define the null and alternative hypotheses: Ho: µ = 16oz Ha: μ < 16oz 2. Check conditions: Either A or B A) The distribution of weights is normal. B) The sample size is ≥ 30. Example 1: Test Statistic and P-Value 3. Calculate the test statistic. Test Statistic Sample Statistic Null Value Null Standard Error Null Value = 16, Sample Statistic = 15.7 s Null Std. Error = = 1.2/10 = .12 n Test Statistic = 15.7 16 2.5 .12 P-Value = P(T<-2.5) Use Table A.3! degrees of freedom = df = n-1 = 100-1 = 99 Example 1: P-value & Conclusion T-Distribution, df = 99 Density 0.2 0.1 P-Value = P(T<-2.5) 0.0 Look up the absolute value of the test statistic and df in the table. If they do not have the exact test statistic, then take your best guess OR use the next smallest test statistic. Using the next larger and next smallest value for tstatistic on the table you can get an interval for the p-value. 0.3 Table A.3 gives the one-sided p-values for t-tests. For 2-sided hypothesis tests make sure you multiply the p-value by 2! -4 -2 0 2 4 T In our case, df = 99 and t-statistic = -2.5. Since 2.5 is not in the table we can use 2.33, and 90 df (since 99 is not in the table). We get that the pvalue is less than .011. The p-value is < .05 so we REJECT the null hypothesis and we conclude that the average weight IS less than 16oz. Detailed Example 2 A potential presidential candidate in 2002 election wished to know if men and women had equal preferences for her candidacy versus this not being the case (she suspected that men were more likely to favor her). Her pollster polled a random sample of 400 men and 400 women, with the following results. Preference: Would vote for her? No Yes Total Men 164 236 400 Women 180 220 400 Total 344 556 800 Example 2: Overview Here there are 2 categorical variables, both with 2 levels so we are interested in a test of 2-proportions! Is the proportion of men who favor the candidate greater than the proportion of women who favor the candidate? The parameter of interest is pm-pw. The sample statistic of interest is pˆ m pˆ w =.59-.55 = .04 Example 2: 1. Calculate the null and alternative hypotheses: Ho: pm = pf or pm-pf = 0 Ha: pm > pf or pm-pf > 0 Let’s denote with pm with p1 and pf with p2. 2. Check the conditions: All quantities, n1 pˆ 1, n1(1 pˆ 1) and are greater than or equal to 10. n 2 pˆ 2, n 2(1 pˆ 2) Example 2: Test Statistic and P-Value 3. Calculate the test statistic Test Statistic z Sample Statistic Null Value Null Standard Error Null Value 0 (since the paramete is p1 p2 ) Statistic pˆ 1 pˆ 2 236/400 – 220/400 0.04 n1 pˆ 1 n2 pˆ 2 236 220 pˆ 0.57 n1 n2 80 Null Std. Error pˆ (1 pˆ )1 n1 1 n2 .57(1 .57)1 400 1 400 0.035 Test Statistic ( pˆ 1 pˆ 2 ) 0 .04 1.14 pˆ (1 pˆ ) pˆ (1 pˆ ) .035 n1 n2 Example 2: P-Value and Conclusion P-Value = P(Z>1.14) = 1-P(Z<1.14) = 1-0.8729 = 0.127 P-Value is >.05 so we can NOT reject the null hypothesis. Therefore, we do NOT have enough evidence to conclude that men are more likely to favor the candidate. MINITAB Output for Example 2 Test and CI for Two Proportions Sample Men Women X 236 220 N Sample p 400 0.590000 400 0.550000 Estimate for p(1) - p(2): 0.04 Test for p(1) - p(2) = 0 (vs > 0): Z = 1.14 P-Value = 0.126 CI’s and Two-sided Alternatives When testing the hypotheses H0: parameter = null value vs. Ha: parameter null value If the null value is covered by a (1 - ) CI, the null hypothesis is not rejected and the test is not statistically significant at level . If the null value is not covered by a (1 - ) CI, the null hypothesis is rejected and the test is statistically significant at level . For instance, for a 95% Confidence Interval, (1- ) = 0.95 = 95%. So for 95% confidence, the significance level is = 0.05, which is the significance level and confidence interval used most frequently. Problem 13.38 in the Text Each of the following presents a 95% CI and the alternative hypothesis of a corresponding hypothesis test. In each case, state a conclusion for the test, including the level of significance you are using. A) CI for µ is (101 to 105), Ha: µ ≠1 00 B) CI for p is (.12 to .28), Ha: p<.10 C) CI for µ1-µ2 is (3 to 15), Ha: µ1-µ2 > 0 D) CI for p1-p2 is (-.15 to .07), Ha: p1-p2 ≠ 0 Possible Errors, Power and Sample Size Considerations When we are testing a hypothesis two out four possible decisions lead to an error. Type I error - We reject H0 when it is true Type II error- We fail to reject H0 when Ha is true. There is an inverse relationship between the probabilities of the two types of errors. Increase in the probability of type I error leads to decrease in the probability of type II error and vice versa. When the alternative hypothesis is true, the probability of making the correct decision is called power of the test. The power increases when the sample size increased. Can you see why? The power increases when the difference between the true population parameter value and the null value increases. Can you see why? The hypothesis test may have very low power because of small data set. We should also be careful in cases our conclusions are based on extremely large samples, since in cases like that even a weak relationship (or a small difference) can be statistically significant.