Download Chapter 7 Hypothesis Testing

Overview Definition Hypothesis in statistics, is a claim or statement about a property of a population 1 Null Hypothesis: H0  Statement about value of population parameter  Must contain condition of equality  =, , or   Test the Null Hypothesis directly  Reject H0 or fail to reject H0 2 Alternative Hypothesis: Ha  Must be true if H0 is false  , <, >  ‘opposite’ of Null 3 Note about Forming Your Own Claims (Hypotheses) If you are conducting a study and want to use a hypothesis test to support your claim, the claim must be worded so that it becomes the alternative hypothesis. 4 Legal Trial Hypothesis Test HO The defendant is not guilty Claim about a population parameter HA The defendant is guilty Opposing claim about a population parameter The evidence convinces the jury to reject the assumption of innocence. The verdict is guilty The statistic indicates a rejection of HO, and the alternate hypothesis is accepted. Result 5 Test Statistic a value computed from the sample data that is used in making the decision about the rejection of the null hypothesis 6 Test Statistic a value computed from the sample data that is used in making the decision about the rejection of the null hypothesis For large samples, testing claims about population means z= x - µx  n 7 Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis 8 Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region 9 Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region 10 Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Regions 11 Significance Level  denoted by   the probability that the test statistic will fall in the critical region when the null hypothesis is actually true.  common choices are 0.05, 0.01, and 0.10 12 Critical Value Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis 13 Critical Value Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis Critical Value ( z score ) 14 Critical Value Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis Reject H0 Fail to reject H0 Critical Value ( z score ) 15 Two-tailed,Right-tailed, Left-tailed Tests The tails in a distribution are the extreme regions bounded by critical values. 16 Two-tailed Test H0: µ = 100 Ha: µ  100 17 Two-tailed Test H0: µ = 100  is divided equally between Ha: µ  100 the two tails of the critical region 18 Two-tailed Test H0: µ = 100  is divided equally between Ha: µ  100 the two tails of the critical region Means less than or greater than 19 Two-tailed Test H0: µ = 100  is divided equally between Ha: µ  100 the two tails of the critical region Means less than or greater than Reject H0 Fail to reject H0 Reject H0 100 Values that differ significantly from 100 20 Right-tailed Test H0: µ  100 Ha: µ > 100 21 Right-tailed Test H0: µ  100 Ha: µ > 100 Points Right 22 Right-tailed Test H0: µ  100 Ha: µ > 100 Points Right Fail to reject H0 100 Reject H0 Values that differ significantly from 100 23 Left-tailed Test H0: µ  100 Ha: µ < 100 24 Left-tailed Test H0: µ  100 Ha: µ < 100 Points Left 25 Left-tailed Test H0: µ  100 Ha: µ < 100 Points Left Reject H0 Values that differ significantly from 100 Fail to reject H0 100 26 Conclusions in Hypothesis Testing always test the null hypothesis 1. Reject the H0 2. Fail to reject the H0 need to formulate correct wording of final conclusion 27 Wording of Final Conclusion Start Does the original claim contain the condition of equality Yes (Original claim contains equality and becomes H0) No Do you reject H0?. “There is sufficient evidence to warrant (Reject H0) rejection of the claim that. . . (original claim).” Yes No (Fail to reject H0) (Original claim does not contain equality and becomes Ha) Do you reject H0? Yes (Reject H0) “There is not sufficient evidence to warrant rejection of the claim that. . . (original claim).” “The sample data supports the claim that . . . (original claim).” No (Fail to reject H0) (This is the only case in which the original claim is rejected). (This is the only case in which the original claim is supported). “There is not sufficient evidence to support the claim that. . . (original claim).” 28 Accept versus Fail to Reject some texts use “accept the null hypothesis we are not proving the null hypothesis sample evidence is not strong enough to warrant rejection (such as not enough evidence to convict a suspect) 29 Definition Power of a Hypothesis Test is the probability (1 - ) of rejecting a false null hypothesis, which is computed by using a particular significance level  and a particular value of the mean that is an alternative to the value assumed true in the null hypothesis. 30 Assumptions for testing claims about population means 1) The sample is a simple random sample. 2) The sample is large (n > 30). a) Central limit theorem applies b) Can use normal distribution 3) If  is unknown, we can use sample standard deviation s as estimate for . 31 Traditional (or Classical) Method of Testing Hypotheses Goal Identify a sample result that is significantly different from the claimed value 32 The traditional (or classical) method of hypothesis testing converts the relevant sample statistic into a test statistic which we compare to the critical value. 33 Hypotheses Testing 5 Step Process 1. State the hypotheses 2. Decide on a model. 3. Determine the endpoints of the rejection region and state the decision rule. 4. Compute the test statistic 5. State the conclusion 34 Test Statistic for Claims about µ when n > 30 z= x - µx  n Test Statistic for Claims about µ when n < 30 t= x - µx  n 35 Decision Criterion Reject the null hypothesis if the test statistic is in the critical region Fail to reject the null hypothesis if the test statistic is not in the critical region 36 Wording of Final Conclusion FIGURE 7-4 Start Does the original claim contain the condition of equality Yes (Original claim contains equality and becomes H0) No Do you reject H0?. “There is sufficient evidence to warrant (Reject H0) rejection of the claim that. . . (original claim).” Yes No (Fail to reject H0) (Original claim does not contain equality and becomes Ha) Do you reject H0? Yes (Reject H0) “There is not sufficient evidence to warrant rejection of the claim that. . . (original claim).” “The sample data supports the claim that . . . (original claim).” No (Fail to reject H0) (This is the only case in which the original claim is rejected). (This is the only case in which the original claim is supported). “There is not sufficient evidence to support the claim that. . . (original claim).” 37 Example: Given a data set of 106 healthy body temperatures, where the mean was 98.2o and s = 0.62o , at the 0.05 significance level, test the claim that the mean body temperature of all healthy adults is equal to 98.6o. 38 Example: Given a data set of 106 healthy body temperatures, where the mean was 98.2o and s = 0.62o , at the 0.05 significance level, test the claim that the mean body temperature of all healthy adults is equal to 98.6o. Steps: 1) 2) State the hypotheses H0 :  = 98.6o Ha :   98.6o Determine the model Two tail Z test, n > 30 39 3) Determine the Rejection Region  = 0.05 /2 = 0.025 (two tailed test) 0.4750 0.025 z = - 1.96 0.4750 0.025 1.96 40 4) Compute the test statistic z = x-µ =  n 98.2 - 98.6 0.62 106 = - 6.64 41 5) State the Conclusion Sample data: x = 98.2o or z = - 6.64 Reject H0: µ = 98.6 Fail to Reject H0: µ = 98.6 z = - 1.96 µ = 98.6 Reject H0: µ = 98.6 z = 1.96 or z = 0 z = - 6.64 REJECT H0 There is sufficient evidence to warrant rejection of claim that the mean body temperatures of healthy adults is equal to 98.6o. 42 Assumptions for testing claims about population means 1) The sample is a simple random sample. 2) The sample is small (n  30). 3) The value of the population standard deviation  is unknown. 4) The sample values come from a population with a distribution that is approximately normal. 43 Test Statistic for a Student t-distribution x -µx t= s n Critical Values Found in Table A-3 Degrees of freedom (df) = n -1 Critical t values to the left of the mean are negative 44 Important Properties of the Student t Distribution 1. The Student t distribution is different for different sample sizes (see Figure 6-5 in Section 6-3). 2. The Student t distribution has the same general bell shape as the normal distribution; its wider shape reflects the greater variability that is expected with small samples. 3. The Student t distribution has a mean of t = 0 (just as the standard normal distribution has a mean of z = 0). 4. The standard deviation of the Student t distribution varies with the sample size and is greater than 1 (unlike the standard normal distribution, which has a  = 1). 5. As the sample size n gets larger, the Student t distribution get closer to the normal distribution. For values of n > 30, the differences are so small that we can use the critical z values instead of developing a much larger table of critical t values. (The values in the bottom row of Table A-3 are equal to the corresponding critical z values from the normal distributions.) 45 Figure 7-11 Choosing between the Normal and Student t-Distributions when Testing a Claim about a Population Mean µ Start Use normal distribution with Is n > 30 ? Yes Z (If  is unknown use s instead.) No Is the distribution of the population essentially normal ? (Use a histogram.) x - µx / n No Yes Is  known ? No Use nonparametric methods, which don’t require a normal distribution. Use normal distribution with Z x - µx / n (This case is rare.) Use the Student t distribution with x - µx t  s/ n 46 The larger Student t critical value shows that with a small sample, the sample evidence must be more extreme before we consider the difference is significant. 47 A company manufacturing rockets claims to use an average of 5500 lbs of rocket fuel for the first 15 seconds of operation. A sample of 6 engines are fired and the mean fuel consumption is 5690 lbs with a sample standard deviation of 250 lbs. Is the claim justified at the 5% level of significance? 1. HO: µ = 5500 HA: µ  5500 2. Two tail t test, n < 30, unknown population standard deviation -2.571 1.862 3. t critical for 5% for a two tail test with 5 d.f. is 2.571 4. t  2.571 x -μ 5690  5500   1.862 s n 250 6 5. Fail to reject HO, there is no evidence at the .05 level that the average fuel consumption is different from µ = 5500 lbs 48 P-Value Method Table A-3 includes only selected values of  Specific P-values usually cannot be found Use Table to identify limits that contain the P-value Some calculators and computer programs will find exact P-values 49 P-Value Method of Testing Hypotheses very similar to traditional method key difference is the way in which we decide to reject the null hypothesis approach finds the probability (P-value) of getting a result and rejects the null hypothesis if that probability is very low 50 P-Value Method of Testing Hypotheses Definition P-Value (or probability value) the probability that the test statistic is as far or farther from  if the null hypothesis is true The attained significance level of a hypothesis test is the P value of its test statistic 51 P-Value Method of Testing Hypotheses Guidelines for rejecting HO based on the P value: If P < , then reject HO and accept HA If P > , then reserve judgement about HO 52 P-value Interpretation Small P-values (such as 0.05 or lower) Unusual sample results. Significant difference from the null hypothesis Large P-values (such as above 0.05) Sample results are not unusual. Not a significant difference from the null hypothesis 53 Figure 7-8 Finding P-Values Start What type of test ? Left-tailed Right-tailed Two-tailed Left P-value = area to the left of the test statistic P-value µ Test statistic Is the test statistic to the right or left of center ? P-value = twice the area to the left of the test statistic P-value is twice this area µ Test statistic Right P-value = twice the area to the right of the test statistic P-value = area to the right of the test statistic P-value is twice this area P-value µ µ Test statistic Test statistic 54 Testing Claims with Confidence Intervals A confidence interval estimate of a population parameter contains the likely values of that parameter. We should therefore reject a claim that the population parameter has a value that is not included in the confidence interval. 55 Testing Claims with Confidence Intervals Claim: mean body temperature = 98.6º, where n = 106, x = 98.2º and s = 0.62º  95% confidence interval of 106 body temperature data (that is, 95% of samples would contain true value µ )  98.08º < µ < 98.32º  98.6º is not in this interval  Therefore it is very unlikely that µ = 98.6º  Thus we reject claim µ = 98.6º 56 Underlying Rationale of Hypotheses Testing  If, under a given observed assumption, the probability of getting the sample is exceptionally small, we conclude that the assumption is probably not correct.  When testing a claim, we make an assumption (null hypothesis) that contains equality. We then compare the assumption and the sample results and we form one of the following conclusions: 57 Underlying Rationale of Hypotheses Testing If the sample results can easily occur when the assumption (null hypothesis) is true, we attribute the relatively small discrepancy between the assumption and the sample results to chance. If the sample results cannot easily occur when that assumption (null hypothesis) is true, we explain the relatively large discrepancy between the assumption and the sample by concluding that the assumption is not true. 58 Type I Error The mistake of rejecting the null hypothesis when it is true.  (alpha) is used to represent the probability of a type I error Example: Rejecting a claim that the mean body temperature is 98.6 degrees when the mean really does equal 98.6 59 Type II Error the mistake of failing to reject the null hypothesis when it is false. ß (beta) is used to represent the probability of a type II error Example: Failing to reject the claim that the mean body temperature is 98.6 degrees when the mean is really different from 98.6 60 Type I and Type II Errors True State of Nature We decide to reject the null hypothesis The null hypothesis is true The null hypothesis is false Type I error (rejecting a true null hypothesis)  Correct decision Correct decision Type II error (rejecting a false null hypothesis)  Decision We fail to reject the null hypothesis 61 Controlling Type I and Type II Errors For any fixed , an increase in the sample size n will cause a decrease in  For any fixed sample size n , a decrease in  will cause an increase in . Conversely, an increase in  will cause a decrease in  . To decrease both  and , increase the sample size. 62

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Download Chapter 7 Hypothesis Testing