Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 9 Inferring Population Means Copyright © 2017, 2014 Pearson Education, Inc. Slide 1 Chapter 9 Topics • Apply the Central Limit Theorem for sample means • Make inferences about population means using confidence intervals and hypothesis tests Copyright © 2017, 2014 Pearson Education, Inc. Slide 2 Section 9.1 Studio 8. Pearson Education Ltd SAMPLE MEANS OF RANDOM SAMPLES • Discuss the Accuracy and Precision of the Sample Mean as an Estimate of the Population Mean • Find the Standard Error of the Sampling Distribution of the Sample Mean Copyright © 2017, 2014 Pearson Education, Inc. Slide 3 Three Characteristics of the Sample Mean 1. Accuracy 2. Precision 3. Probability distribution Notation Review: μ = population mean x = sample mean σ = population standard deviation s = sample standard deviation Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 44 Accuracy and Precision of the Sample Mean The sample mean is unbiased when estimating the population mean – on average, the sample mean is the same as the population mean. The precision of the sample mean depends on the variability in the population. The more observations we collect, the more precise the sample mean becomes. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 55 Sampling Distribution of the Sample Mean The sampling distribution can be thought of as the distribution of all possible sample means that would result from drawing repeated samples of a certain size from the population. Reminder: The standard deviation of the sampling distribution is called the standard error. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 66 Effect of Sample Size on the Sampling Distribution As the sample size increases the standard error decreases but the mean remains the same. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 77 Accuracy and Bias of the Sample Mean For all populations, the sample mean, if based on a random sample, is unbiased when estimating the population mean. The standard error of the sample mean is s . n As the sample size increases, the sample mean becomes more precise. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 88 Example: Playlists A student has a large digital music library. The mean length of the songs is 258 seconds with a standard deviation of 87 seconds. The student creates a playlist that consists of 30 randomly selected songs. a. Is the mean value of 258 seconds a parameter or a statistic? Explain. b. What should the student expect the average song length to be for his playlist? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 99 Example: Playlists c. What is the standard error for the mean song length of 30 randomly selected songs? d. Would a playlist of 50 songs have a standard error that is greater than or less than your answer to part c? Explain. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 10 10 Example: Playlists a. The mean value of 258 seconds is a parameter because it is the mean of the population (all songs in the digital music library) b. The student should expect that the average song length of the playlist is 258 seconds (sample mean is typically the same as the population mean) Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 11 11 Example: Playlists c. Standard error = s 87 = = 15.88 seconds n 30 d. If the playlist had 50 songs, the standard error would be smaller because the standard error decreases as the sample size increases. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 12 12 Section 9.2 sonya etchison. Shutterstock CENTRAL LIMIT THEOREM FOR SAMPLE MEANS • Apply the Central Limit Theorem for Sample Means to Calculate Probabilities • Describe the Features of the t-distribution Copyright © 2017, 2014 Pearson Education, Inc. Slide 13 Central Limit Theorem for Sample Means If certain conditions are met, the Central Limit Theorem assures us that the distribution of sample means follows an approximately Normal distribution no matter what the shape of the population distribution. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 14 14 Conditions to Check When determining whether you can apply the Central Limit Theorem to analyze data, consider these three conditions: 1. Random Sample and Independence. Each observation is collected randomly from the population and observations are independent of each other. 2. Large sample. Either the population distribution is Normal or the sample size is large (usually 25 is large enough). 3. Big population. If the sample is collected without replacement then the population must be at least 10 times larger than the sample size. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 15 15 Central Limit Theorem If the three conditions are met, then the sampling distribution of x is approximately æ s ö N ç m, ÷ è nø The larger the sample size the better the approximation. If the population is normal to begin with then the sampling distribution is exactly a Normal distribution, regardless of the sample size. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 16 16 Visualizing Distributions of the Sample Mean The figure below shows the distribution of in-state tuition and fees for all two-year colleges in the United States for the 2012-2013 academic year. Note that it is skewed and multimodal. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 17 17 Visualizing Distributions of the Sample Mean The figures below show the distribution of the sample mean for samples of size 30 and size 90 drawn from the skewed multimodal distribution on the previous slide. Note the shape of each is approximately Normal and the standard error decreases as the sample size increases. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 18 18 Example: Weights of 10-Year-Old Boys According to data from the National Health and Nutrition Exam Survey, the mean weight of 10-year-old boys is 88.3 pounds with a standard deviation of 2.06 pounds. Assume the distribution of weights is Normal. a. Suppose we take a random sample of 30 boys from this population. Can we find the approximate probability that the average weight of this sample will be above 89 pounds? If so, find it. If not, explain why. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 19 19 Example: Weights of 10-Year-Old Boys b. Suppose we take a random sample of 10 boys from this population. Can we find the approximate probability that the average weight of this sample will be above 89 pounds? If so, find it. If not, explain why not. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 20 20 Example: Weights of 10-Year-Old Boys a. Suppose we take a random sample of 30 boys from this population. Can we find the approximate probability that the average weight of this sample will be above 89 pounds? If so, find it. If not, explain why. We have a random sample, the population is at least 10 times larger than the sample, and the population is Normal so we can apply the Central Limit Theorem. The distribution of sample means is æ 2.06 ö N ç88.3, ÷ è 30 ø We can use technology to find the probability that the sample mean is greater than 89 pounds. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 21 21 The standard error = s 2.06 = = 0.3761 n 30 The probability that the mean of the sample is greater than 89 pounds is 0.03. Copyright © 2017, 2014 Pearson Education, Inc. Slide 22 Example: Weights of 10-Year-Old Boys b. Suppose we take a random sample of 10 boys from this population. Can we find the approximate probability that the average weight of this sample will be above 89 pounds? If so, find it. If not, explain why not. We have a random sample, the population is at least 10 times larger than the sample, and the population is Normal. So, we can apply the Central Limit Theorem even though the sample size is less than 25. The distribution of sample means is 2.06 N 88.3, . 10 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 23 23 The standard error is 2.06 n 0.6514. 10 The probability the sample mean is greater than 89 pounds is 0.1413. Note that this probability is different than our answer to part (a) due to the larger standard error in the distribution of samples of size 10. Copyright © 2017, 2014 Pearson Education, Inc. Slide 24 Example: Home Prices Home prices in a certain community have a distribution that is skewed right. The mean of the home prices is $498,000 with a standard deviation of $25,200. a. Suppose we take a random sample of 30 homes in this community. What is the probability that the mean of this sample is between $500,000 and $510,000? b. Suppose we take a random sample of 10 homes in this community. Can we find the approximate probability that the mean of the sample is more than $510,000? If so, find it. If not, explain why not. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 25 25 Example: Home Prices a. Suppose we take a random sample of 30 homes in this community. What is the probability that the mean of this sample is between $500,000 and $510,000? We have a random sample. The sample is large (greater than 25) and the population is large (we assume there are more than 10x30 = 300 homes in the community) so according to the Central Limit Theorem the distribution of sample means is 25200 N 498, 000, . 30 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 26 26 The standard error is 25200 4600.87. n 30 The probability the sample mean is between $500,000 and $510,000 is 0.3273. Copyright © 2017, 2014 Pearson Education, Inc. Slide 27 Example: Home Prices b. Suppose we take a random sample of 10 homes in this community. Can we find the approximate probability that the mean of the sample is more than $510,000? If so, find it. If not, explain why not. Because the population of home prices is skewed we would need a sample size of at least 25 to apply the Central Limit Theorem. Since the sample size is 10, we cannot find this probability. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 28 28 Identify the Distribution The next slide shows three distributions. One of these distributions is a population. The other two distributions are approximate sample distributions of the sample means randomly sampled from the population, one of sample size 10 and the other of sample size 25. Match the graph with the correct distribution. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 29 29 Identify the Distribution Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 30 30 Identify the Distribution The Central Limit Theorem tells us that the sampling distribution for sample means are approximately Normal. Since this graph is not approximately Normal it is the graph of the population. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 31 31 Identify the Distribution As the sample size increases, the standard error (spread) of the distribution decreases. Since Figure (a) has more spread than Figure (c), Figure (a) is the distribution for samples of size 25 and Figure (c) is the distribution for samples of size 10. Figure (a) Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Figure (c) Slide Slide 32 32 The t-Statistic Hypothesis tests and confidence intervals for estimating and testing the mean are based on a statistic called the t-statistic: x -m t= s n Since the population standard deviation is almost always unknown, we divide by an estimate of the standard error, using the sample standard deviation s instead of s . Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 33 33 The t-Distribution The t-statistic does not follow the Normal distribution, because the denominator changes with every sample size. The t-statistic is more variable than the z-statistic, whose denominator is always the same. If the three conditions for using the Central Limit Theorem hold, the t-statistic follows a distribution called the t-distribution. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 34 34 The t-Distribution • • • • Symmetric and “bell-shaped” Has thicker tails than the Normal distribution Shape depends on the degrees of freedom (df) If df is small, the tails are thick; as df increases, tails get thinner. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 35 35 Section 9.3 Imagemore Co., Ltd ANSWERING QUESTIONS ABOUT THE MEAN OF A POPULATION • Construct and Interpret a Confidence Interval for a Population Mean Copyright © 2017, 2014 Pearson Education, Inc. Slide 36 Answering Questions about the Mean of a Population There are two approaches for answering questions about a population mean: 1. Confidence intervals – Used for estimating parameter values 2. Hypothesis tests – Used for deciding whether a parameter’s value is one thing or another These are the same methods as introduced in previous chapters for population proportions but modified to work with population means. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 37 37 Confidence Interval • Provides a range of plausible values for the population mean along with a measure of the uncertainty in our estimate • Is a measure of the uncertainty in our estimate – the higher the level of confidence, the better our estimate is Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 38 38 When to Use Confidence Intervals Use confidence intervals whenever you are estimating the value of a population parameter on the basis of a random sample. Do NOT use a confidence interval if there is no uncertainty in your estimate. If you have data for the entire population you need to find a confidence interval since the population parameter is known – there is no need to estimate it. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 39 39 Confidence Interval for a Population Mean: Conditions Before constructing a confidence interval for a population mean, check these three conditions: 1. Random, independent sample 2. Large sample – Either the population is Normally distributed or the sample size is at least 25. 3. Big population – If the sample is collected without replacement the population must be at least 10 times larger than the sample size. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 40 40 Example: Car Prices A used car website wanted to estimate the mean price of a 2012 Nissan Altima. The site gathered data on a random sample of 30 such cars and found a sample mean of $16,610 and a sample standard deviation of $2736. The 95% confidence interval for the mean cost of this model car based on this data is (15588, 17632) a. Describe the population. Is the number $16,610 an example of a parameter or a statistic? b. Verify that the conditions for a valid confidence interval are met. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 41 41 Example: Car Prices a. Describe the population. Is the number $16610 an example of a parameter or a statistic? The population is all 2012 Nissan Altimas that are for sale. The number $16,610 is a statistic because it is a measure of a sample, not a population. b. Verify that the conditions for a valid confidence interval are met. The sample is random and independent. The sample size (30) is large (at least 25) and the population is big (we can assume there are at least 10 x 40 = 400 2012 Altimas for sale). Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 42 42 Interpretation of the Confidence Level The confidence level is a measure of how well the method used to produce the confidence interval performs. For example, a 95% confidence interval means that if we were to take many random samples of the same size from the same population, we expect 95 of them would “work” (contain the population parameter) and five of them would be “wrong” (not contain the population parameter). Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 43 43 Calculating the Confidence Interval General structure: Estimator ± margin of error where margin of error = (multiplier) x SE and SE = s n Because we usually do not know the population standard deviation, we replace SE with its estimate that uses the sample standard deviation. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 44 44 One-Sample t-Interval The one-sample t-interval is a confidence interval for a population mean. where and x ±m m = t *SEEST SEEST = s n The multiplier t* is found using the t-distribution with n–1 degrees of freedom. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 45 45 One-Sample t-Interval To construct a one-sample t-interval you need four pieces of information: 1. The sample mean, x , that you calculate from the data. 2. The sample standard deviation, s, which you calculate from the data. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 46 46 One-Sample t-Interval 3. The sample size, n. 4. The multiplier, t*, which you can look up in a table or use technology. t* is determined by the confidence level and the sample size. The value of t* tells us how wise the margin of error is, in terms of standard errors. For example, if t*= 2 then our margin of error is 2 standard errors wide. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 47 47 Finding t* t* can be found using a t-distribution table, but because most tables stop at 35 or 40 degrees of freedom, it it best to use technology to construct confidence intervals. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 48 48 Example: Highway Speeds Data on the speed (in mph) for random sample of 30 cars travelling on a highway was collected. The mean speed was 63.3 mph with a standard deviation of 5.23 mph. a. Find the 95% confidence interval for the mean speed of all cars travelling on the highway. Verify that the necessary conditions hold. b. Interpret the interval. c. Is it plausible that the mean speed of cars on the highway is 67 mph? Why or why not? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 49 49 Example: Highway Speeds a. Find the 95% confidence interval for the mean speed of all cars travelling on the highway. Verify that the necessary conditions hold. The sample is random and independent. We can assume that the population is large (at least 10 x 30 = 300 cars travel on the highway). The sample is large (at least 25) so the necessary conditions hold. Use technology to create the interval. StatCrunch: Stats > T stats > One sample > with summary Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 50 50 Confidence Interval: (61.35, 65.25) Copyright © 2017, 2014 Pearson Education, Inc. Slide 51 Example: Highway Speeds b. Interpret the interval. c. Is it plausible that the mean speed of cars on the highway is 67 mph? Why or why not? We are 95% confidence that the mean speed of cars on the highway is between 61.35 and 65.25 mph. Since 67 is not in our interval, it is not a plausible value for the mean speed of cars on the highway. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 52 52 Using the TI-84 Calculator To construct a confidence interval for a population mean on the TI-84 calculator: 1. Push STAT > TESTS then select T-INTERVAL. 2. If you have data in a list, select DATA. Enter the name of the list where the data is stored, leave Freq: 1, select your C-level and press Calculate. If you do not have data in a list, select STATS. Enter the sample mean, standard deviation, sample size. Select your Clevel and press Calculate. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 53 53 Example: Movies A random sample of 35 college students was asked how many movies they had seen in the previous month. The sample mean was 4.14 movies with a standard deviation of 10.02. a. Construct a 90% confidence interval for the mean number of movies college students see per month. b. If we were to use the data to construct the 95% confidence interval, would the interval be wider or narrower than the 90% confidence interval? c. What would be the effect of taking a larger sample on the width of the interval? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 54 54 The 90% confidence interval is (1.28, 7.00). Copyright © 2017, 2014 Pearson Education, Inc. Slide 55 Example: Movies a. We are 90% confident that the mean number of movies seen by college students in a month is between 1.28 and 7.00. b. A 95% confidence interval would be wider than a 90% confidence interval because a 95% confidence interval would have a larger t* multiplier than a 90% confidence interval. c. If we take a larger sample, the standard error would be smaller. This means the margin of error would be smaller and so the interval would be narrower. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 56 56 Section 9.4 Monkey Business Images. Shutterstock HYPOTHESIS TESTING FOR MEANS • Conduct a Hypothesis Test for a Population Mean Copyright © 2017, 2014 Pearson Education, Inc. Slide 57 Four Steps for Hypothesis Testing 1. Hypothesize State your hypotheses about the population parameter. 2. Prepare Choose a significance level and test statistic, check conditions and assumptions. 3. Compute to Compare Compute a test statistic and p-value. 4. Interpret Do you reject the null hypothesis or not? What does this mean? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 58 58 Hypothesis Test: Claim about a Population Mean Test Statistic for a One-Sample t-Test: x - m0 t= SEEST where SEEST s = n If conditions hold, the test statistic follows a t-distribution with df = n – 1. Two Conditions: 1. Random, independent sample 2. Large sample: The population must be Normal or the sample size must be at least 25. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 59 59 Example: Nursing In 2010, the mean years of experience among a nursing staff was 14.3 years. A nurse manager took a survey of a random sample of 35 nurses at the hospital and found a sample mean of 18.37 years with a standard deviation of 11.12 years. Do we have evidence that the mean years of experience among the nursing staff at the hospital has increased? Use a significance level of 0.05. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 60 60 Example: Nursing 1. Hypotheses H0: μ = 14.3 Ha: μ > 14.3 2. Prepare Random, independent sample; sample size is large (35 > 25) 3. Compute to Compare SEEST 11.12 = = 1.88 35 t= 18.37 -14.3 = 2.16 1.88 Use technology to find the p-value that corresponds with t = 2.16 df = 34. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 61 61 4. Interpret With a p-value of 0.019 and a significance level of 0.05, we would reject H0. Evidence suggests that mean experience among the nursing staff has increased. Copyright © 2017, 2014 Pearson Education, Inc. Slide 62 One- and Two-sided Alternative Hypotheses Just as with hypothesis tests for proportions, hypothesis tests can be one-sided or two-sided depending on the research question. The choice of the alternative hypothesis determines how the p-value is calculated. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 63 63 Using Technology Both the test-statistic and p-value can be computed using technology. StatCrunch: Stats > t Stats > One Sample > with summary Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 64 64 Using the TI-84 Calculator To conduct a hypothesis test for a mean with one population on the TI-84 calculator: 1. Push STAT > TESTS then select the T-test option. 2. If your data is in a list, select DATA, enter the number in H0, the list the data is in. Leave Freq at 1, and highlight the sign in your Ha. If you have summary statistics, select STATS and enter the required statistics. 3. Press Calculate or Draw. The test statistic and p-value will be displayed. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 65 65 Example: Hockey Attendance In the 2010 season, average home attendance for NHL hockey games was 17,072. Suppose a sports statistician took a random sample of 30 home hockey games during the 2014 season and found a sample mean of 18,104 with a standard deviation of 1203.5. Can we conclude that mean attendance at NHL games has changed since the 2010 season? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 66 66 Example: Hockey Attendance 1. Hypothesize H0: μ = 17072 Ha: μ ≠ 17072 2. Prepare Random independent sample, sample size is large, assume population is large. 3. Use technology to find the test statistic and pvalue. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 67 67 3. Compute to Compare Test statistic: t = 4.70 p-value: p < 0.0001 Copyright © 2017, 2014 Pearson Education, Inc. Slide 68 Example: Hockey Attendance Since the p-value is less than our significance level, we reject H0. Yes, mean attendance at hockey games has changed since 2010. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 69 69 Section 9.5 Robert Kneschke. Shutterstock COMPARING TWO POPULATION MEANS • Conduct Hypothesis Tests and Construct Confidence Intervals About a Population Mean for Independent and Dependent Samples Copyright © 2017, 2014 Pearson Education, Inc. Slide 70 Independent vs. Dependent Samples When comparing two populations, it is important to note whether the data are two independent samples or are paired (dependent) samples. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 71 71 Paired/Dependent Samples Each observation in one group is coupled or paired with one particular observation in the other group. Examples: • “Before and after” comparisons • Related objects/people (twins, siblings, spouses) Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 72 72 Example: Dependent or Independent? People chosen in a random sample are asked how many minutes they spend reading and how many minutes they spent exercising during a certain day. Researchers wanted to know how different the mean amounts of time were for each activity. Is this a dependent or an independent sample? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 73 73 Example: Dependent or Independent? A sample of men and women each had their hearing tested. Researchers wanted to know whether, typically, mean and women differed in their hearing ability. Is this a dependent or an independent sample? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 74 74 Example: Dependent or Independent? A random sample of married couples are asked how many minutes per day they spent exercising. Means were compared to see if the mean exercise times for husbands and wives differed. Is this a dependent or an independent sample? Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 75 75 Example: Dependent or Independent? 1. In the reading and exercise example, the samples are dependent. The study was based on one sample of people who were measured twice (once for reading and once for exercise). 2. In the hearing example, the samples are independent because two different populations were used: one of men and the other of women. The people were not related. 3. In the study involving exercise of married couples, the sample is dependent because each husband is paired with his wife. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 76 76 Confidence Intervals: Independent Samples To construct the confidence interval for the difference in population means given independent samples, check three conditions: 1. Random samples and independence – Both samples are randomly taken from their populations and each observation is independent of any other. 2. Independent samples – The samples are not paired. 3. Large samples – The populations are approximately Normal OR the sample sizes are each 25 or more. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 77 77 Confidence Intervals: Independent Samples General structure: Estimate ± margin of error Specifically: (estimate of difference) ± t* (SEestimate of difference) The estimate of difference = Meanfirst sample – Meansecond sample SEestimate of difference = s12 s22 + n1 n2 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 78 78 Confidence Intervals: Independent Samples Two-Sample t-Interval ( x1 - x2 ) ± t * s12 s22 + n1 n2 t* is based on an approximate t-distribution with df as the smaller of n1 – 1 and n2 – 1. For more accuracy, use technology. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 79 79 Interpreting Confidence Intervals: Independent Samples μ1 – μ2 Interpreting confidence intervals for the difference of population means given independent samples is the same as interpreting confidence intervals for the difference of population proportions. 1. If 0 is in the interval, there is no significant difference between μ1 and μ2. 2. If both values in the confidence interval are positive, then μ1>μ2. 3. If both values in the confidence interval are negative, then μ1<μ2 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 80 80 Example: Units A college randomly surveyed day students and evening students to determine the number of units students were enrolled in. The data is shown in the table below. Use the data to find a 95% confidence interval for the difference in the mean difference in number of units for the two groups. Based on your confidence interval, is there a difference in mean number of units taken by day and evening students at this college? Day Evening n 72 68 mean 8.7 5.9 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. s 3.1 3.7 Slide Slide 81 81 Checking Conditions: Using Technology Checking conditions: 1. Random sample and independence 2. Samples are independent 3. Large sample size (at least 25) StatCrunch: STAT > T-stats > Two sample > with summary Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 82 82 95% Confidence Interval: (1.66, 3.94) Copyright © 2017, 2014 Pearson Education, Inc. Slide 83 Example: Units 95% Confidence Interval: (1.66, 3.94) This is a confidence interval for μday – μevening Since both values are positive, we know μday > μevening Day students enroll, on the average, in more units than evening students at this college. The difference in the mean number of units enrolled is estimated between 1.66 and 3.94. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 84 84 Using the TI-84 Calculator To construct a confidence interval for the difference of population means (independent samples) on the TI-84 calculator: 1. Push STAT > TESTS then select option 2-SAMPTInt. 2. Select DATA (if you have data in a list) or STATS (if you have summary stats). 3. Enter the sample means, sample standard deviations, and sample sizes as prompted by the calculator, enter the confidence level and calculate. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 85 85 Example: Home Prices in Albuquerque Data on the selling price (in hundreds of $) was obtained for a sample of homes in Albuquerque, New Mexico. Homes were classified as located in the northeast (NE) section of the city or in another location in the city. The data are shown in the table below. Construct a 95% confidence interval for the difference in mean home price between the NE section and other sections in the city. Is there a difference in home prices? NE other n 39 78 mean 972.8 1107.4 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. s 320.4 401.5 Slide Slide 86 86 Example: Home Prices in Albuquerque Check conditions: 1. Random sample and independence 2. Independent samples 3. Sample size is at least 25 Use appropriate technology to construct the interval. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 87 87 Confidence Interval (-270.73, 1.53) Copyright © 2017, 2014 Pearson Education, Inc. Slide 88 Example: Home Prices in Albuquerque Confidence Interval (–270.73, 1.53) The interval is for μNE - μother Since the interval contains 0, there is no significant difference in the average home prices between the two areas. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 89 89 Hypothesis Testing: Two Means 1. Hypothesize H0: μ1 = μ2 Ha: μ1 ≠ μ2 or μ1 < μ2 or μ1 ≠=> μ2 2. Prepare 1. Random samples, independent observations 2. Independent samples 3. Both populations are approximately Normal OR both sample sizes are at least 25 or more. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 90 90 Hypothesis Testing: Two Means 3. Compute to Compare t = difference in sample means – what H0 says the difference is SEEST H0 usually says the difference is 0, and so SEEST s12 s22 = + n1 n2 x1 - x2 t= SEEST Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 91 91 Hypothesis Testing: Two Means 4. Interpret Compare the p-value to the significance level. If the p-value is less than or equal to α, we reject the null hypothesis. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 92 92 Note about Pooling When using technology for a two-sample t-test, use the unpooled version. It is more accurate in most situations than the pooled version. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 93 93 Example: Units – Hypothesis Test Use the units data to test the claim that the mean number of units taken by day and evening students at the college are different. Day Evening n 72 68 mean 8.7 5.9 s 3.1 3.7 1. Hypothesize H0: μday = μevening Ha: μday ≠ μevening Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 94 94 Example: Units – Hypothesis Test 2. Prepare 1. Random samples, independent observations 2. Independent samples 3. Both sample sizes are at least 25 or more 3. Compute to Compare Use technology to compute the test statistic and p-value. StatCrunch: STAT > T Stats > Two Sample > with summary TI-84: 2-SampTTest Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 95 95 Test statistic: t = 4.84 p-value: < 0.0001 Copyright © 2017, 2014 Pearson Education, Inc. Slide 96 Example: Units – Hypothesis Test 4. Interpret Since our p-value is close to 0, we reject H0. There is a difference in mean units between day and night students at this college. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 97 97 Using the TI-84 Calculator To run a hypothesis test for the difference of population means (independent samples) on the TI-84 calculator: 1. Push STAT > TESTS then select option 2-SAMPTTest. 2. Select DATA (if you have data in a list) or STATS (if you have summary stats) 3. Enter the sample means, sample standard deviations, and sample sizes as prompted by the calculator, enter the sign in Ha and calculate. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 98 98 Example: Home Prices in Albuquerque – Hypothesis Test Use the data on home prices in Albuquerque to test the claim that the mean price of homes in the NE differs from other areas of the city. Use a significance level of 0.05. n mean s NE 39 972.8 320.4 other 78 1107.4 401.5 1. Hypothesize H0: μNE = μother Ha: μNE ≠ μother Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 99 99 Example: Home Prices in Albuquerque – Hypothesis Test 2. Prepare 1. Random samples, independent observations 2. Independent samples 3. Both sample sizes are at least 25 or more 3. Compute to Compare Use technology to compute the test statistic and p-value. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 100 100 Test statistic: t = -1.96 p-value: 0.0526 Copyright © 2017, 2014 Pearson Education, Inc. Slide 101 Example: Home Prices in Albuquerque – Hypothesis Test 4. Interpret Because the p-value (0.0526) is not less than the significance level, we cannot reject H0. We cannot conclude that prices in the NE neighborhood differ from the rest of the city. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 102 102 Dependent Samples • Transform the original data from two variables into a single variable that contains the difference between the scores in Group 1 and Group 2. • After the differences have been computed, we can apply either a confidence interval approach or a hypothesis test approach to the differences. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 103 103 Example: Reading Intervention Suppose 4th grade children are exposed to a reading intervention program designed to improve scores on a reading assessment. Children were given a pre-test and post-test before and after the reading intervention and the scores are shown in the table below. Assume that all conditions needed to construct the confidence interval are met. Construct a 95% confidence interval for the mean difference in reading score after participating in the program. Based on your confidence interval, do you believe the reading program is effective? Pre 10 14 21 18 15 16 18 19 20 25 16 Post 12 14 23 19 21 20 19 21 20 24 16 Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 104 104 Example: Reading Intervention Pre 10 Post 12 14 14 21 23 18 19 15 21 16 20 18 19 19 21 20 20 25 24 16 16 Diff 0 -2 -1 -6 -4 -1 -2 0 1 0 -2 The differences are pre – post. Use technology to create a 95% confidence interval using the difference data. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 105 105 Copyright © 2017, 2014 Pearson Education, Inc. Slide 106 Example: Reading Intervention The 95% confidence interval for the difference between the pre-test and post-test scores is (–2.90, –0.19). Since the confidence interval does not include 0, there is a significant difference in the mean pre- and post-test scores; however, the difference is not large in size (only between 0.19 and 2.9 points). Since the differences were for pre-test minus post-test, having negative values means the the post-test score was greater than the pre-test score, so there was improvement after the reading intervention. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 107 107 Example: Reading Intervention – Hypothesis Test Use the reading pre/post data to conduct a hypothesis test to see if there was a difference in mean pre/post test scores after the reading intervention. Assume the conditions for conducting a hypothesis test are met. Use a 0.05 significance level. 1. Hypothesize H0: μdifference = 0 (there is no difference in scores) Ha: μdifference ≠ 0 2. Prepare We were told to assume the conditions for conducting a hypothesis test are met. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 108 108 Copyright © 2017, 2014 Pearson Education, Inc. Slide 109 Example: Reading Intervention – Hypothesis Test 3. Compute to Compare Test statistic: t = -2.54 p-value: 0.03 Since the p-value is less than 0.05, reject H0. 4. Interpret Yes, there is a difference in mean pre- and posttest scores. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 110 110 Section 9.6 Rawpixel. Shutterstock OVERVIEW OF ANALYZING MEANS • Compare the Methods of the Chapter Used to Analyze Means Copyright © 2017, 2014 Pearson Education, Inc. Slide 111 A Lot of Repetition… • The hypothesis test for two means is very similar to the test for one mean. • The hypothesis test for paired data is really a special case of the one-sample t-test. • Hypothesis tests use almost the same calculations as confidence intervals and they impose the same conditions. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 112 112 Don’t Accept H0 If the p-value is larger than the significance level, we do not reject the null hypothesis. This is different from “accepting” the null hypothesis. Just because we do not reject the null hypothesis does not mean that we now believe the null hypothesis is true. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 113 113 Confidence Intervals and Hypothesis Tests If the alternative hypothesis is two-sided, a confidence interval can be used instead of a hypothesis test. Both approaches will always reach the same conclusion. Copyright Copyright©©2017, 2017,2014 2014Pearson PearsonEducation, Education,Inc. Inc. Slide Slide 114 114