Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 10 Section 2 Hypothesis Tests for a Population Mean Assuming the Population Standard Deviation is Known Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 1 of 55 Chapter 10 – Section 2 ● Learning objectives 1 2 3 4 5 Understand the logic of hypothesis testing Test hypotheses about a population mean with σ known using the classical approach Test hypotheses about a population mean with σ known using P-values Test hypotheses about a population mean with σ known using confidence intervals Understand the difference between statistical significance and practical significance Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 2 of 55 Chapter 10 – Section 2 ● Learning objectives 1 2 3 4 5 Understand the logic of hypothesis testing Test hypotheses about a population mean with σ known using the classical approach Test hypotheses about a population mean with σ known using P-values Test hypotheses about a population mean with σ known using confidence intervals Understand the difference between statistical significance and practical significance Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 3 of 55 Chapter 10 – Section 2 ● We have the outline of a hypothesis test, just not the detailed implementation ● How do we quantify “unlikely”? ● How do we calculate Type I and Type II errors? ● What is the exact procedure to get to a do not reject / reject conclusion? Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 4 of 55 Chapter 10 – Section 2 ● There are three equivalent ways to perform a hypothesis test ● They will reach the same conclusion ● The methods The classical approach The P-value approach The confidence interval approach Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 5 of 55 Chapter 10 – Section 2 ● The classical approach If the sample value is too many standard deviations away, then it must be too unlikely ● The P-value approach If the probability of the sample value being that far away is small, then it must be too unlikely ● The confidence interval approach If we are not sufficiently confident that the parameter is likely enough, then it must be too unlikely ● Don’t worry … we’ll be explaining more Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 6 of 55 Chapter 10 – Section 2 ● The three methods all begin the same way We have a null hypothesis, that the actual mean is equal to a value μ0 We have an alternative hypothesis ● The three methods all set up a criterion A criterion that quantifies “unlikely” That the actual mean is unlikely to be equal to μ0 A criterion that determines what would be a do not reject and what would be a reject Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 7 of 55 Chapter 10 – Section 2 ● The three methods all need information We run an experiment We collect the data We calculate the sample mean ● The three methods all make the same assumptions to be able to make the statistical calculations That the sample is a simple random sample That the sample mean has a normal distribution Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 8 of 55 Chapter 10 – Section 2 ● In this section we assume that the population mean σ is known (as in section 9.1) ● We can apply our techniques if either The population has a normal distribution Our sample size n is large (n ≥ 30) ● In those cases, the distribution of the sample mean x is normal with mean μ and standard deviation σ / √ n Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 9 of 55 Chapter 10 – Section 2 ● The three methods all compare the observed results with the previous criterion Classical – how many standard deviations P-value – the size of the probability Confidence interval – inside or outside the interval ● If the results are unlikely based on these criterion, then we say that the result is statistically significant Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 10 of 55 Chapter 10 – Section 2 ● The three methods all conclude similarly We do not reject the null hypothesis, or We reject the null hypothesis ● We reject the null hypothesis when the result is statistically significant Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 11 of 55 Chapter 10 – Section 2 ● We now will cover how each of the Classical P-value, and Confidence interval approaches will show us how to conclude whether the result is statistically significant or not Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 12 of 55 Chapter 10 – Section 2 ● Learning objectives 1 2 3 4 5 Understand the logic of hypothesis testing Test hypotheses about a population mean with σ known using the classical approach Test hypotheses about a population mean with σ known using P-values Test hypotheses about a population mean with σ known using confidence intervals Understand the difference between statistical significance and practical significance Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 13 of 55 Chapter 10 – Section 2 ● The classical approach ● We compare the sample mean x to the hypothesized population mean μ0 Measure the difference in units of standard deviations A lot of standard deviations is far … few standard deviations is not far Just like using a general normal distribution Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 14 of 55 Chapter 10 – Section 2 ● How far is too far? ● For example, we can use α = 0.05 as the level of significance ● “Unlikely” means that this difference occurs with probability α = 0.05 of the time, or less ● This concept applies to two-tailed tests, lefttailed tests, and right-tailed tests Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 15 of 55 Chapter 10 – Section 2 ● For two-tailed tests The least likely 5% is the lowest 2.5% and highest 2.5% (below –1.96 and above +1.96 standard deviations) … –1.96 and +1.96 are the critical values The region outside this is the rejection region Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 16 of 55 Chapter 10 – Section 2 ● For left-tailed tests The least likely 5% is the lowest 5% (below –1.645 standard deviations) … –1.645 is the critical value The region less than this is the rejection region Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 17 of 55 Chapter 10 – Section 2 ● For right-tailed tests The least likely 5% is the highest 5% (above 1.645 standard deviations) … +1.645 is the critical value The region greater than this is the rejection region Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 18 of 55 Chapter 10 – Section 2 ● An example of a two-tailed test ● A bolt manufacturer claims that the diameter of the bolts average 10.0 mm H0: Diameter = 10.0 H1: Diameter ≠ 10.0 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is 0.3 mm The sample mean is 10.12 mm We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 19 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 10.12 is 0.12 higher than 10.0 The standard error is (0.3 / √ 40) = 0.047 The test statistic is 2.53 The critical normal value, for α/2 = 0.025, is 1.96 2.53 is more than 1.96 ● Our conclusion We reject the null hypothesis We have sufficient evidence that the population mean diameter is not 10.0 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 20 of 55 Chapter 10 – Section 2 ● An example of a left-tailed test ● A car manufacturer claims that the mpg of a certain model car is at least 29.0 H0: MPG = 29.0 H1: MPG < 29.0 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is 0.5 The sample mean mpg is 28.89 We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 21 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 28.89 is 0.11 lower than 29.0 The standard error is (0.5 / √ 40) = 0.079 The test statistic is -1.39 -1.39 is greater than -1.645, the left-tailed critical value for α = 0.05 ● Our conclusion We do not reject the null hypothesis We have insufficient evidence that the population mean mpg is less than 29.0 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 22 of 55 Chapter 10 – Section 2 ● An example of a right-tailed test ● A bolt manufacturer claims that the defective rate of their product is at most 1.70 per 1,000 H0: Defect Rate = 1.70 H1: Defect Rate > 1.70 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is .06 The sample defect rate is 1.78 We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 23 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 1.78 is 0.08 higher than 1.70 The standard error is (0.06 / √ 40) = 0.009 The test statistic is 8.43 8.43 is more than 1.645, the right-tailed critical value for α = 0.05 ● Our conclusion We reject the null hypothesis We have sufficient evidence that the population mean rate is more than 1.70 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 24 of 55 Chapter 10 – Section 2 ● Two-tailed test The critical values are zα/2 and –zα/2 The rejection region is {less than –zα/2} and {greater than z1-α/2} ● Left-tailed test The critical value is –zα The rejection region is {less than –zα} ● Right-tailed test The critical value is zα The rejection region is {greater than zα} Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 25 of 55 Chapter 10 – Section 2 ● The difference is x 0 ● In units of standard deviations, this is z0 x 0 / n ● This is called the test statistic ● If the test statistic is in the rejection region – we reject Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 26 of 55 Chapter 10 – Section 2 ● The general picture for a level of significance α Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 27 of 55 Chapter 10 – Section 2 ● Learning objectives 1 2 3 4 5 Understand the logic of hypothesis testing Test hypotheses about a population mean with σ known using the classical approach Test hypotheses about a population mean with σ known using P-values Test hypotheses about a population mean with σ known using confidence intervals Understand the difference between statistical significance and practical significance Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 28 of 55 Chapter 10 – Section 2 ● The P-value is the probability of observing a sample mean that is as or more extreme than the observed ● The probability is calculated assuming that the null hypothesis is true ● We use the P-value to quantify how unlikely the sample mean is Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 29 of 55 Chapter 10 – Section 2 ● Just like in the classical approach, we calculate the test statistic x 0 z0 / n ● We then calculate the P-value, the probability that the sample mean would be this, or more extreme, if the null hypothesis was true ● The two-tailed, left-tailed, and right-tailed calculations are slightly different Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 30 of 55 Chapter 10 – Section 2 ● For the two-tailed test, the “unlikely” region are values that are too high and too low ● Small P-values corresponds to situations where it is unlikely to be this far away Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 31 of 55 Chapter 10 – Section 2 ● For the left-tailed test, the “unlikely” region are values that are too low ● Small P-values corresponds to situations where it is unlikely to be this low Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 32 of 55 Chapter 10 – Section 2 ● For the right-tailed test, the “unlikely” region are values that are too high ● Small P-values corresponds to situations where it is unlikely to be this high Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 33 of 55 Chapter 10 – Section 2 ● For all three models (two-tailed, left-tailed, righttailed) The larger P-values mean that the difference is not relatively large … that it’s not an unlikely event The smaller P-values mean that the difference is relatively large … that it’s an unlikely event Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 34 of 55 Chapter 10 – Section 2 ● Larger P-values A P-value of 0.30, for example, means that this value, or more extreme, could happen 30% of the time 30% of the time is not unusual ● Smaller P-values A P-value of 0.01, for example, means that this value, or more extreme, could happen only 1% of the time 1% of the time is unusual Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 35 of 55 Chapter 10 – Section 2 ● The decision rule is ● For a significance level α Do not reject the null hypothesis if the P-value is greater than α Reject the null hypothesis if the P-value is less than α ● For example, if α = 0.05 A P-value of 0.30 is likely enough, compared to a criterion of 0.05 A P-value of 0.01 is unlikely, compared to a criterion of 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 36 of 55 Chapter 10 – Section 2 ● An example of a two-tailed test ● A bolt manufacturer claims that the diameter of the bolts average 10.0 mm H0: Diameter = 10.0 H1: Diameter ≠ 10.0 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is 0.3 mm The sample mean is 10.12 mm We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 37 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 10.12 is 0.12 higher than 10.0 The standard error is (0.3 / √ 40) = 0.047 The test statistic is 2.53 The 2-sided P-value of 2.53 is 0.01 < 0.05 = α ● Our conclusion We reject the null hypothesis We have sufficient evidence that the population mean diameter is not 10.0 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 38 of 55 Chapter 10 – Section 2 ● An example of a left-tailed test ● A car manufacturer claims that the mpg of a certain model car is at least 29.0 H0: MPG = 29.0 H1: MPG < 29.0 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is 0.5 The sample mean mpg is 28.89 We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 39 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 28.89 is 0.11 lower than 29.0 The standard error is (0.5 / √ 40) = 0.079 The test statistic is -1.39 The 1-sided P-value of -1.39 is 0.08 > 0.05 = α ● Our conclusion We do not reject the null hypothesis We have insufficient evidence that the population mean mpg is less than 29.0 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 40 of 55 Chapter 10 – Section 2 ● An example of a right-tailed test ● A bolt manufacturer claims that the defective rate of their product is at most 1.70 per 1,000 H0: Defect Rate = 1.70 H1: Defect Rate > 1.70 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is .06 The sample defect rate is 1.78 We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 41 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 1.78 is 0.08 higher than 1.70 The standard error is (0.06 / √ 40) = 0.009 The test statistic is 8.43 The 1-sided P-value of 8.43 is extremely small ● Our conclusion We reject the null hypothesis We have sufficient evidence that the population mean rate is more than 1.70 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 42 of 55 Chapter 10 – Section 2 ● Compare the rejection regions for the classical approach and the P-value approach ● They are the same Classical P-Value Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 43 of 55 Chapter 10 – Section 2 ● Learning objectives 1 2 3 4 5 Understand the logic of hypothesis testing Test hypotheses about a population mean with σ known using the classical approach Test hypotheses about a population mean with σ known using P-values Test hypotheses about a population mean with σ known using confidence intervals Understand the difference between statistical significance and practical significance Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 44 of 55 Chapter 10 – Section 2 ● The confidence interval approach yields the same result as the classical approach and as the P-value approach ● We compare A hypothesis test with a level of significance α to A confidence interval with confidence (1 – α) •100% ● These are the same α’s Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 45 of 55 Chapter 10 – Section 2 ● The relationship is Not rejecting the hypothesis μ0 is inside the Confidence interval Rejecting the hypothesis μ0 is outside the Confidence interval ● The hypothesis test calculation and the confidence interval calculation are very similar Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 46 of 55 Chapter 10 – Section 2 ● An example of a two-tailed test ● A bolt manufacturer claims that the diameter of the bolts average 10.0 mm H0: Diameter = 10.0 H1: Diameter ≠ 10.0 ● We take a sample of size 40 (Somehow) We know that the standard deviation of this measurement is 0.3 mm The sample mean is 10.12 mm We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 47 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 10.12 is 0.12 higher than 10.0 The standard error is (0.3 / √ 40) = 0.047 The confidence interval is 10.12 ± 1.96 • 0.047, or 10.03 to 10.21 10.0 is outside (10.03, 10.21) ● Our conclusion We reject the null hypothesis We have sufficient evidence that the population mean diameter is not 10.0 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 48 of 55 Chapter 10 – Section 2 ● An example of a left-tailed test ● A car manufacturer claims that the mpg of a certain model car is at least 29.0 H0: MPG = 29.0 H1: MPG < 29.0 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is 0.5 The sample mean mpg is 28.89 We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 49 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 28.89 is 0.11 lower than 29.0 The standard error is (0.5 / √ 40) = 0.079 The confidence interval limit is 28.89 + 1.645 • 0.079, or 29.02 29.0 is inside (–∞, 29.02) ● Our conclusion We do not reject the null hypothesis We have insufficient evidence that the population mean mpg is less than 29.0 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 50 of 55 Chapter 10 – Section 2 ● An example of a right-tailed test ● A bolt manufacturer claims that the defective rate of their product is at most 1.70 per 1,000 H0: Defect Rate = 1.70 H1: Defect Rate > 1.70 ● We take a sample of size 40 (Somehow) We know that the standard deviation of the population is .06 The sample defect rate is 1.78 We’ll use a level of significance α = 0.05 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 51 of 55 Chapter 10 – Section 2 ● Do we reject the null hypothesis? 1.78 is 0.08 higher than 1.70 The standard error is (0.06 / √ 40) = 0.009 The confidence interval limit is 1.78 – 1.645 • 0.009, or 1.76 1.70 is outside (1.76, ∞) ● Our conclusion We reject the null hypothesis We have sufficient evidence that the population mean rate is more than 1.70 Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 52 of 55 Chapter 10 – Section 2 ● Learning objectives 1 2 3 4 5 Understand the logic of hypothesis testing Test hypotheses about a population mean with σ known using the classical approach Test hypotheses about a population mean with σ known using P-values Test hypotheses about a population mean with σ known using confidence intervals Understand the difference between statistical significance and practical significance Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 53 of 55 Chapter 10 – Section 2 ● A significant statistical difference is one where the hypothesis test, for equality, is rejected ● A statistical significance does not necessarily mean that it is practically significant ● If we have a large sample size, we will be able to pinpoint the rejectable values of the population mean ● Our analysis may be unnecessarily precise Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 54 of 55 Summary: Chapter 10 – Section 2 ● A hypothesis test of means compares whether the true mean is either Equal to, or not equal to, μ0 Equal to, or less than, μ0 Equal to, or more than, μ0 ● There are three equivalent methods of performing the hypothesis test The classical approach The P-value approach The confidence interval approach Sullivan – Fundamentals of Statistics – 2nd Edition – Chapter 10 Section 2 – Slide 55 of 55