Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
AP Statistics Review Part V & Part VI & Part VII: From the Data at Hand to the World at Large Learning About the World Inference When Variables Are Related Part V: Chapters 18 - 22 FROM THE DATA AT HAND TO THE WORLD AT LARGE SAMPLING DISTRIBUTION MODELS Sampling Distributions • A sampling distribution is the distribution of a statistic, such as x-bar or p-hat, from all possible samples of a given size (n) from a population of size N. • A certain proportion (p) of the population may display a particular characteristic. In each randomly selected sample of sze n, that same characteristic will be seen in a proportion (phat) of the sample. • We use p-hat (our sample proportion) as an estimate of p (the population proportion). • The same is true for the sample mean (x-bar) as an estimate of the population mean (μ). Modeling the Distribution of Sample Proportions The sampling distribution of sample proportions has these characteristics. • Shape – The distribution of sample proportions is approximately normally distributed provided that • np≥10 (there are at least 10 successes) and • n(1-p)≥10 (there are at least 10 failures) • Center – The mean of the sampling distribution of sample proportions, (p-hat) is centered at the mean of the population, p, regardless of sample size. (We say that p-hat is an unbiased estimator of p). • Spread – The standard deviation of sampling distribution of sample proportions is given by the formula pˆ p (1 p) , provided that n • 10n≥N (since we are sampling without replacement, the sample size should be less than 10% of the population size in order to use this formula for standard error). Modeling the Distribution of Sample Means • For categorical variables, we are interested in a proportion (p). For quantitative variables, a parameter of interest is the average or mean (μ). • The sampling distribution of sample means has these characteristics. • Shape – As sample size increases, the distribution of sample means tends toward an approximately normal distribution (Central Limit Theorem). • Center – The mean of the sampling distribution of sample means, x , is centered at the mean of the population, μ, regardless of sample size (We say that x-bar is an unbiased estimator of μ). • Spread – The standard deviation of the sampling distribution of sample means is given by the formula x . n Central Limit Theorem • The Central Limit Theorem (CLT) says that means of repeated samples will tend to follow a normal model if the sample size is “large enough”; this true no matter what shape the population distribution has! • If the population is not Normal, the sample size should be at least 30 for the CLT to apply. • Avoid confusing the distribution of a sample with the sampling distribution model for sample means. The distribution of a (one) sample will, as n increases, look more like the population from which it was taken. Standard Error • For both sampling distribution models (proportions and means), we have assumed that we know the population parameters ρ or μ and σ. • In most cases, it is not possible to know these values. Consequently, we must use what we do know and estimate the standard deviations of sampling distributions. • When this happens, we call the estimated standard deviation the standard error. Standard Error (cont.) • For a proportion, the standard error of p-hat is: SE ( pˆ ) s pˆ pˆ 1 pˆ n • For a sample mean, the standard error of x-bar is: SE ( x ) sx s n • We estimate a population parameter, p, by using a sample statistic, p̂. For this reason, p̂ is called a point estimate for p. • In general, if the expected value of the estimator equals the corresponding parameter, the statistic is called an unbiased estimator. • Since p̂ p, p̂ is an unbiased estimator for p. (Similarly, x is a point estimate for μ, and since x , x is an unbiased estimator for μ.) Standard Error (cont.) • We have seen that sample statistics vary. So rather than simply giving a single value for our estimate of the population proportion, we use our sample estimate, p̂ , to build a range of plausible values. • To do this, we want to establish an interval that we believe (at least with some degree of certainty) contains the true value of our population parameter. • This interval forms an interval estimate for the parameter, p, and is called a confidence interval. • Because of sampling variability, we can never say we are 100% certain; therefore, the confidence level we choose indicates the degree of certainty we have that our interval captures the true value of the parameter. CONFIDENCE INTERVALS FOR PROPORTIONS Creating and Interpreting a Confidence Interval • What We Say • If we choose 95% as our confidence level, we build an interval that stretches about 2 SE’s from p̂ in either direction. • Then we say: We are 95% confident that the true proportion is between (calculated lower limit) and (calculated upper limit). • This is a correct interpretation of the confidence interval. Creating and Interpreting a Confidence Interval • It is also possible to interpret the confidence level. • A correct interpretation of the confidence level would be: 95% confidence means that, in the long run, 95 out of 100 intervals calculated using the same procedure would (using random samples of the same size from the same population) capture the true population proportion. • Commonly used values for confidence levels are 90%, 95%, and 99%, but other levels may be chosen. Creating and Interpreting a Confidence Interval Know the difference between confidence interval and confidence level and be sure to give the correct interpretation. It’s easy to get interval and level confused. Creating and Interpreting a Confidence Interval • The general form of all confidence intervals is: Estimate ± (Critical Value)(Standard Error) • The desired confidence level determines the critical value (the number of standard deviations on either side of the estimate). The critical value (CV) is denoted by z*. • The most commonly used values are: Level z* 90% 1.645 95% 1.960 99% 2.576 Conditions for a One-Proportion z-Interval • Randomization Condition – proper randomization techniques used to collect the data. • 10% Condition – sample size is less than 10% of the population. • Success/Failure Condition – have at least 10 “successes” npˆ 10 and 10 “failures” n 1 pˆ 10 . Margin of Error • The distance of the endpoints of the interval from p̂ is called the margin of error (ME), in this case z s p̂ . • The margin of error is tied to the level of confidence. (Surveys in the newspaper or on TV usually report a margin of error based on a 95% confidence level.) • The more confident you want to be, the larger the margin of error will be. More confidence implies a wider interval. • Since the standard error of p̂ is tied to the sample size, as the sample size increases, the value of the standard error deceases and results in a narrower interval. Things to Remember About a Parameter • A parameter does not vary – there is only One true value of a parameter. • You DO NOT KNOW the ture value of the parameter. If you did, you would not need a confidence interval. • NEVER use p̂ when referring to the value of the parameter, p. Things to Remember About Interpreting a Confidence Interval • A confidence interval makes a statement about the true population proportion, and your interpretation of the interval should be about the population proportion. • Keep your interpretation to what you know – do not claim to know too much. A sentence or two should be sufficient. • Sample statistics vary with each sample. Not every interval you compute will capture the true population proportion. Your interpretation of the confidence interval should reflect how confident you are that your interval has managed to capture the true value; your confidence is in the method used to produce the interval. TESTING HYPOTHESES ABOUT PROPORTIONS Confidence Intervals and Hypothesis Tests • Confidence intervals, just one form of statistical inference, allow us to use a sample statistic to make a statement about how confident we are that a population parameter lies within certain limits. • Another form of statistical inference is a hypothesis test or test of significance. We use this form of statistical inference when a particular conjecture or claim (hypothesis) has been made about the true value of a population parameter. • Again, we use sample statistics to help us decide whether or not to believe the hypothesis. We want to know if there is enough evidence to support (NOT prove) the hypothesis or to reject it. The Reasoning of Hypothesis Testing • Hypothesis • The null hypothesis: The term null comes from the idea that there is “no difference” between the hypothesized value and the true population parameter. • H0: p=p0 (hypothesized value) • The null hypothesis MUST have a sign of equality and MUST be written using population parameters and NEVER sample statistics. • You MUST use standard notation (don’t make up your own symbols). • You should write your hypothesis in both symbols and words. • Your final decision MUST always be stated in terms of rejecting or failing to reject the null hypothesis. The Reasoning of Hypothesis Testing (cont.) • The alternative hypothesis: the alternative hypothesis is what would be accepted as true if the null hypothesis were to be rejected. It can take one of three different forms; • For proportions, a one-sided alternative would be either Ha: p<p0 or Ha: p>p0. • A two-sided alternative would be Ha: p≠p0. • The form used will depend on the study being done and should be clear from the context of the problem. The Reasoning of Hypothesis Testing (cont.) • Plan • Collect data that can be used to calculate a sample statistic to test against the null hypothesis. • Decision Rule: Decide what will be “convincing evidence” to reject the null hypothesis. • Mechanics • Check the conditions for the sampling distribution model. • Calculate a test statistic. A test statistic is the standardized value of the sample statistic with respect to the hypothesized value. The Reasoning of Hypothesis Testing (cont.) • In general, it has the form Test statistic = statistic - parameter standard deviation of statistic • For a proportion, the test statistic is given by pˆ p0 , where SD pˆ p0 1 p0 z SD pˆ n • Note: When computing SD for the test statistic, we use p0, because our model is based on the conjecture or claim (null hypothesis). When computing the SE for a confidence interval, we use p-hat because we do not have a hypothesized parameter. The Reasoning of Hypothesis Testing (cont.) • Calculate the p-value. • The p-value is the probability that a value at least as extreme as the observed value could occur if, in fact, the hypothesized value were true. • Use the normal table to convert your test statistic to a p-value. • The smaller the p-value, the less likely it is that our sample came from a population with the parameter value stated in our null hypothesis. In other words, the smaller the p-value, the stronger the evidence is against the null hypothesis. The Reasoning of Hypothesis Testing (cont.) • Conclusion • Make a decision about whether to reject or fail to reject the null hypothesis based upon the pvalue. • Reject H0 if the calculated p-value is less than the chosen α–level. We say that the result is statistically significant. • Your statement MUST include • Decision (reject H0 in favor of Ha or fail to reject H0). • Criteria for decision (p-value compared with α–level or test statistic compared with critical value). • Context (use the words of the problem to restate your decision). MORE ABOUT TESTS Possible Errors When Hypothesis Testing • Because of the variability of sample data, we may draw a wrong conclusion despite having followed correct procedures. • Whenever we make decisions based on sample data, there is a risk of error. • These errors are of two types, Type I and Type II. Type I Error • A Type I Error is the consequence of rejecting a null hypothesis that is in fact true. • The probability of making a Type I Error is equal to the value of α, the level of significance. Type II Error • A Type II Error is the consequence of failing to reject a null hypothesis that is in fact false. • The probability of this type of error is called β. • The value of β changes depending on the value of the parameter that is chosen as the alternative. • Note: Knowing how to calculate β is NOT a requirement of the AP curriculum. You ARE expected to understand the difference between α and β and how they affect each other. Power • The probability that a test will correctly reject a null hypothesis that is in fact false is called the power of the test. • POWER = 1 - β. • Power is the complement of a Type II Error. • High power is desired because it indicates the sensitivity of the test to specific values of the alternative hypothesis. Ways to Reduce Error • Type I Error (α) – decrease the α-level. • Type II Error (β) – increase the α-level. • Type I and Type II Errors – increase the sample size, n. • Power (1 - β) – increase the sample size, n, or decrease β, or select a different, more extreme alternative. COMPARING TWO PROPORTIONS The Sampling Distribution for pˆ1 pˆ 2 Conditions for the Difference of Two Proportions • Independence Condition – the two sample groups are independent of one another. • Randomization • 10% Condition • Success/Failure Condition Two-Proportion z-Interval Two Proportion z-Test • The procedure for this test is essentially the same as for a one-proportion z-test with one slight difference: the null hypothesis is that there is no difference between the two proportions. H0: p1=p2 (H0: p1-p2=0) • The alternative hypothesis is determined by the kind of difference you expect to get in the particular situation (<, ≠, >). Two Proportion z-Test • The standard error in this case is pooled (combine the two proportions). It is okay to pool the proportions because the null hypothesis assumes that p1=p2 and thus the two populations are identical for the attribute being studied. SE pooled 1 1 x x pˆ pooled qˆ pooled , where pˆ pooled 1 2 n1 n2 n1 n2 Two Proportion z-Test It is highly unlikely that you will have to do this calculation by hand. The calculator two-proportion z-test calculates the z-score automatically and uses the correct standard error. What You Need to Know • Identify parameters and statistics in a sample or experiment. • Recognize the fact of sampling variability: a statistic will take different values when you repeat a sample or experiment. • Interpret a sampling distribution as describing the values taken by a statistic in all possible repetitions of a sample or experiment under the same conditions. • Describe the bias and variability of a statistic in terms of the mean and spread of its sampling distribution. • Understand that the variability of a statistic is controlled by the size of the sample. Statistics from larger samples are less variable. What You Need to Know • Recognize when a problem involves a sample proportion 𝑝. • Find the mean and standard deviation of the sampling distribution of a sample proportion 𝑝 for an SRS of size n from a population having population proportion 𝑝. • Know that the standard deviation (spread) of the sampling distribution of 𝑝 gets smaller at the rate 𝑛 as the sample size n gets larger. • Recognize when you can use the Normal approximation to the sampling distribution of 𝑝. Use the Normal approximation to calculate probabilities that concern 𝑝. What You Need to Know • Recognize when a problem involves the mean 𝑥 of a sample. • Find the mean and standard deviation of the sampling distribution of a sample mean 𝑥 from an SRS of size n when the mean µ, and standard deviation σ of the population are known. • Know that the standard deviation (spread) of the sampling distribution of 𝑥 • gets smaller at the rate 𝑛 as the sample size n gets larger. What You Need to Know • Understand that x has approximately a Normal distribution when the sample is large (central limit theorem). Use this Normal distribution to calculate probabilities involving 𝑥. • Use the z procedure to give a confidence interval for a population proportion 𝑝. • Check that you can safely use the z procedures in a particular setting. • Determine the sample size required to obtain a level C confidence interval with a specified margin of error. • Use the z statistic to carry out a test of significance for the hypothesis 𝐻0 : 𝑝 = 𝑝0 about a population proportion p against either a one-sided or a two-sided alternative . What You Need to Know • Check that you can safely use the oneproportion z test in a particular setting. • Use the two-sample z procedure to give a confidence interval for the difference 𝑝1 − 𝑝2 between proportions in two populations based on independent SRSs from the populations. • Use a two-proportion z test to test the hypothesis 𝐻0 : 𝑝1 = 𝑝2 that proportions in two distinct populations are equal. • Check that you can safely use these z procedures in a particular setting. PRACTICE PROBLEMS #1 Suppose that 35% of all business executives are willing to switch companies if offered a higher salary. If a headhunter randomly contacts an SRS of 100 executives, what is the probability that over 40% will be willing to switch companies if offered a higher salary? a) b) c) d) e) .1469 .1977 .4207 .8023 .8531 #2 The average number of daily emergency room admissions at a hospital is 85 with a standard deviation fo 37. In an SRS of 30 days, what is the probability that the mean number of daily emergency admissions is between 75 and 95? a) b) c) d) e) .1388 .2128 .4090 .5910 .9474 #3 A confidence interval estimate is determined from a SRS of n students. Which of the following will result in a smaller margin of error? I. A smaller confidence level II. A smaller sample size a) I only b) II only c) both I and II #4 A survey was conducted to determine the percentage of high school students who planned to go to college. The results were stated as 82% with a margin of error of 5%. What is meant by +/- 5%? a) Five percent of the population were not surveyed. b) In the sample, the percentage of students who plan to go to college was between 77% and 87% c) The percentage of the entire population of students who plan to go to college is between 77% and 87% d) It is unlikely that the given sample proportion result would be obtained unless the true percentage was between 77% and 87% e) Between 77% and 87% of the population were surveyed. #5 A USA Today “Lifeline” column reported that in a survey of 500 people, 39% said they watch their bread while it’s being toasted. Establish a 90% confidence interval estimate for the percentage of people who watch their bread being toasted. a) 39% +/- .078% b) 39% +/- 2.2% c) 39% +/- 2.8% d) 39% +/- 3.6% e) 39% +/- 4.3% #6 A politician wants to know what percentage of the voters support her position on a hot issue. What size voter sample should be obtained to determine with 90% confidence the support level to within 4%? a) b) c) d) e) 21 25 423 600 1691 #7 Which of the following are true statements? I. Hypothesis tests are designed to measure the strength of the evidence against the null hypothesis. II. A well-planned test should result in a statement either that the null hypothesis is true or that it is false. III. The alternate hypothesis is one-sided if there is interest in deviations from the null hypothesis in only one direction. a) b) c) d) e) I and II I and III II and III I, II, and III None of the above #8 A building inspector believes that the percentage of new construction with serious code violations may be even greater than the previously claimed 7%. She conducted a hypothesis test on 200 new homes and finds 23 with serious code violations. Is this strong evidence against the 7% claim? a) b) c) d) e) Yes, because the P-value is .0062 Yes, because the P-value is 2.5 No, because the P-value is only .0062 No, because the P-value is over 2 No, because the P-value is .045 #9 In a survey of 9700 T.V. viewers, 40% said they watch network news programs. Find the margin of error for this survey if we want 95% confidence in our estimate of the percent of T.V. viewers who watch network news programs. a) 1.12% b) 1.28% c) 0.731% d) 0.975% #10 Which is true about a 99% confidence interval based on a given sample? I. The interval contains 99% of the population. II. Results from approximately 99% of all samples will capture the true parameter in their respective intervals. III. The interval is wider than a 95% confidence interval would be. a) I only b) II only c) III only d) II and III only e) None #11 To maintain customer satisfaction, an online catalog company wants to have on-time delivery for 90% or better of the orders they ship. The company has been shipping their orders via UPS and FedEx but will switch to a new, cheaper delivery service called ShipFast unless there is evidence that this service cannot meet the delivery goal of 90% or better. As a test the company sends a random sample of orders via ShipFast, and then makes follow-up phone calls to see if these orders arrived on time. Which hypotheses should they test? a) H0: p<0.90 Ha: p=0.90 b) H0: p>0.90 Ha: p=0.90 c) H0: p=0.90 Ha: p<0.90 d) H0: p=0.90 Ha: p≠0.90 e) H0: p=0.90 Ha: p>0.90 #12 A researcher investigating whether runners are less likely to get colds than non-runners found a P-value of 3%. This means that: a) 3% of runners get colds. b) 3% fewer runners get colds. c) There’s a 3% chance that runners get fewer colds. d) There's a 3% chance our assumption of no difference in number of colds whether a runner or not is incorrect. e) There’s a 3% chance that the sample statistic or more extreme will occur assuming there is no difference between number of colds whether a runner or not. Part Vi: Chapters 23 - 25 LEARNING ABOUT THE WORLD INFERENCE ABOUT MEANS The t-Distributions: A Sampling Distribution for Means • By the Central Limit Theorem (CLT), the means of repeated samples tend to follow a normal model as long as the population distribution is normal or the sample size is large enough. • According to the CLT, the formula for the standard deviation of the sampling distribution for means is x SD( x ) n • When we have to estimate with , the standard error of the sampling distribution for means is s sx SE ( x ) n • Because σ is unknown, the standard error is used in place of the standard deviation, and the shape of the sampling distribution changes. • The new model is a family of distributions called the t-distribution. The t-Distributions • The shape of the t-distributions (unimodal, symmetric, and bell-shaped) is connected to the sample size by a parameter called the degrees of freedom. • The increased variation due to small sample size increases the probability in the tails. • The degrees of freedom (df) determine the particular t-distribution. • Degrees of freedom (df) equals (n – 1). Sampling Distribution for Means Conditions for Using t-Distributions • Randomization Condition • 10% Condition • Nearly Normal Condition – the data come from a unimodal, symmetric, bell-shaped distribution. This can be verified by constructing a histogram or a normal probability plot of the sample data. • If the population is approximately normal, the tstatistic is appropriate regardless of sample size. • For n≤40, the t-statistic is appropriate provided there are no outliers. • For 15≤n<40, the t-statistic is appropriate provided there are no outliers or strong skewness. • For n<15, the t-statistic is appropriate provided the distribution of the sample data is approximately normal. One-Sample Confidence Interval for the Mean (σ unknown) • Because we are using s to estimate , we use a t-critical value for confidence intervals and a t-test statistic for hypothesis tests. One-Sample t-Confidence Interval Hypothesis Test for Means The format for all hypothesis tests is essentially the same as it was for proportions. Determining Sample Size • Once you have decided on a confidence level and margin of error, you calculate the sample size from the appropriate confidence interval formula. • You should always round the sample size (n) UP to the nearest integer. Sample size should be a whole number INFERENCE FOR TWO INDEPENDENT SAMPLES Comparing Two Means • As is the case of proportions, the parameter of interest is the difference between two population means, μ1 - μ2. The mechanics are the same. The Sampling Distribution for Two Independent Means x1 x2 • The mean of the sampling distribution of x1 x2 is μ1 - μ2. • The standard error, because we substitute s for σ, of the sampling distribution is SE x1 x2 12 n1 22 n2 • The assumptions and the conditions for inference for two-sample means are the same as those for one-sample means, with the added condition that the samples are independent. A Two-Sample t-Interval for Means The formula for calculating the actual degrees of freedom is rather complicated, and generally is not a whole number (the TI calculator gives this number). A Two-Sample t-Interval for Means • When using the TI calculator, you are asked to choose “Pooled: No Yes.” Choose NO. • The word pooled here does not have the same meaning as it did for two proportions. Test for the Difference Between Two Means When you record a p-value, be certain that it is a number from 0 to 1. Do not carelessly copy the calculator output! Probabilities cannot exceed 1! A SPECIAL CASE: TWO SAMPLES OR ONE? Matched Pairs • What should we do when the two samples are not independent. • What if the data are for the same people at two different times. • It would not make sense to get averages before and after, and do a two-sample ttest for means. Besides it would violate the condition of independent samples. • We really want to know the amount of change for each individual. So we subtract first and create a single set of data. Now we use a one-sample procedure. Matched Pairs • There are many situations that call for data to be treated in this way, but we must have a good reason for pairing the data. • Just because two samples have the same number of data values is not a reason for pairing them. • It is up to you to recognize when a matched pairs procedure is called for. • For matched pairs, we use the same procedures we used for the one-sample tinterval or the one-sample t-test for the mean. What You Need to Know • Why the t-distribution is used to model the sampling distribution of sample means? • Properties of the t-distribution, the sampling distribution for means. • What degrees of freedom is and how to calculate. • Conditions for using t-distribution. • Calculate one-sample t-confidence interval for means. • Conduct one-sample t-hypothesis test for means. What You Need to Know • Determine sample size for means. • Sampling distribution for two means and conditions required for inference. • Calculate two-sample t-confidence interval for means. • Conduct two-sample t-hypothesis test for the difference of means. • Recognize a matched pairs situation. • Calculate matched pairs t-confidence interval and t-hypothesis test. PRACTICE PROBLEMS #1 In preparing to use a t procedure, suppose we were not sure if the population was normal. In which of the following circumstances would we not be safe using a t procedure? a) A stemplot of the data is roughly bell shaped. b) A histogram of the data shows moderate skewness. c) A stemplot of the data has a large outlier. d) The sample standard deviation is large. #2 The weight of 9 men have mean of 175 lbs and a standard deviation of 15 lbs. What is the standard error of the mean? a) b) c) d) e) 58.3 19.4 5 1.7 None of the above. #3 What is the critical value t* which satisfies the condition that the t distribution with 8 degrees of freedom has probability 0.10 to the right of t*? a) b) c) d) e) 1.397 1.282 2.89 0.90 None of the above. The answer is _____. #4 Suppose we have two SRSs from two distinct populations and the sample are independent. We measure the same variable for both samples. Suppose both populations of the values of these variables are normally distributed but the means and the standard deviations are unknown. For purposes of comparing the two means, we use a) b) c) d) e) Two-sample t procedures Matched pairs t procedures z procedures The least-squares regression line None of the above. #5 • You want to compute a 90% confidence interval for the mean of a population with unknown population standard deviation. The sample size is 30. The value of t* you would use for this interval is a) b) c) d) e) f) 1.96 1.645 1.699 .90 1.311 None of the above. #6 A 95% confidence interval for the mean reading achievement score for a population of third-graders is (44.2, 54.2). The margin of error of this interval is a) b) c) d) e) 95% 5 2.5 54.2 The answer cannot be determined from the information given. #7 Using the same set of data, you compute a 95% confidence interval and a 99% confidence interval. Which of the following statements is correct? a) b) c) d) The intervals have the same width. The 99% interval is wider. The 95% interval is wider. You cannot determine which interval is wider until you know n and s. #8 To use the two-sample t procedure to perform a significance test on the difference between two means, we assume a) The populations’ standard deviations are known. b) The samples from each population are independent. c) The distributions are exactly normal in each population. d) The sample sizes are large. e) All of the above. #9 In a test for acid rain, 49 water samples showed a mean pH level of 4.4 with a standard deviation of 0.35. Find a 90% confidence interval estimate for the mean pH level. a) b) c) d) e) 4.4±0.01175 4.4±0.0839 4.4±0.315 4.4±0.35 None of the above. The answer is _____. Part VII: Chapters 26 & 27 Inference When Variables Are Related INFERENCE WHEN VARIABLES ARE RELATED • We have already reviewed inference for proportions (z-intervals and tests) and inference for means (t-intervals and tests). • In this part of our review, we look at tests for one and two categorical variables. • The data for these tests are usually presented in some form of table divided into categories and must be in the form of counts for each category. COMPARING COUNTS The Chi-Square Statistic • There are three inference tests involving categorical variables and counts. • These tests introduce a new statistic called the chi-squared statistic, which has the same form for all three tests: 2 all cells O E 2 E • Where O stands for the observed value and E stands for the expected value. The Chi-Square Distribution • The χ2 distribution is a family of distributions. • Each individual distribution is specified by its degrees of freedom. • Unlike the t-distribution, the value of n in this case is not the sample size but rather the number of categories. • The χ2 distribution is skewed to the right and has only positive values. • Because there are only positive values, all hypothesis tests are right tailed tests and the p-value is computed by calculating the area under the curve to the right of the computed χ2 test statistic. THE CHI-SQUARE TESTS Goodness-of-Fit Test • One categorical variable – one sample. • Used to determine how well a set of observed values matches a set of expected values. • In other words, how well does a sample distribution match the hypothesized population distribution? • The degree of freedom is the number of categories minus 1 (#cat. – 1). • The expected values (E=np) are calculated in one of two ways: • Values for each category are expected to all have a uniform distribution. • Values are expected to follow a distribution specified according to some stated condition. Assumptions/Conditions for Chi-Square Test for Goodness-of-Fit • Counted Data Condition: Make sure the data are listed in the form of counts for each category. Precents or proportions WON’T DO – convert them to counts (rounded to whole numbers) if necessary. • Randomization Condition: The individual cases should be a random sample from the population of interest. • Expected Cell-Frequency Condition: There are at least 5 cases in each expected cell. Chi-Square Test for Goodness-of-Fit • Null and alternative hypotheses are usually written in words. • The null hypothesis is a statement that the distributions are consistent or the same. • The alternative hypothesis is a statemnt that the distributions are not consistent or the same. Chi-Square Test for Goodness-of-Fit Chi-Square Test for Goodness-of-Fit • If the expected distribution is uniform, the percentage would be 100%/# of cells. For example, we expect digits in a randomnumber table to appear with equal frequencies. The percentage for each digit would be 10%. We would use these percentages to calculate the expected counts. • If you are asked to identify the cell that contributes the most to the χ2 test statistic, just look at the individual components that contribute to your sum. Test of Independence • The “Chi-Square 2 Test of Independence” is used to determine if, in a single population, there is an association between two categorical variables. • The data are presented in a two-way table. • The null and alternative hypotheses are written in words. • The null hypothesis is a statement that the two categories are independent or not related. • The alternative hypothesis is that the two categories are depentent or related. Chi-Square Test of Independence • The Expected Values are calculated as; row total column total E table total • The degrees of freedom are calculated as; df # rows 1 # columns 1 • Although, we reject the null hypothesis of independence, we cannot interpret our small p-value as proof of causation. A failure of independence does not prove a cause-and-effect relationship. There may be lurking variables present. What You Need to Know • Be able to write hypotheses for all of the various χ2 tests. • Be able to do the various tests with and without your calculator, decide to reject/retain, and write a conclusion. • Know how to find expected values for chisquare tests. PRACTICE PROBLEMS #1 Answer: B #2 A test of independence for data organized in a two-way table relating number of siblings and number of family relocations is conducted using the chi-square distribution. The p-value of the test is .045. If alpha is .05, then which of the following is a valid conclusion of the test? a) The mean is significant. b) We reject the hypothesis that the variables are dependent. c) We accept the hypothesis that the variables are independent. d) We have sufficient evidence to reject the hypothesis that the variables are independent. e) The variables are independent. #3 A chi-square goodness of fit test is used to test whether a 0-9 spinner is “fair” (ie., the outcomes are all equally likely). The spinner is spun 100 times, and the results are recorded. Which member of the chi-square family of curves is used? a) b) c) d) e) χ2(8) χ2(9) χ2(10) χ2(99) None of the above #4 A two-way table of counts is analyzed to examine the hypothesis that the row and column classifications are independent. There are 3 rows and 4 columns. The degrees of freedom for the chi-square statistic are a) b) c) d) e) 12 11 6 The minimum of n1-1 and n2-1 None of the above #5 In the paper “Color Association of Male and Female 4th grade School Children”, children were asked to indicate what emotion they associated with the color red. The expected frequency for the cell corresponding to Anger and Males is: a) b) c) d) e) 15.9 55.7 30.4 31.9 29.1 #6 In the paper “Color Association of Male and Female 4th grade School Children”, children were asked to indicate what emotion they associated with the color red. The null hypothesis is: a) Emotional association with red is independent of gender. b) Gender is dependent upon the emotional association with red. c) The probability of selecting an emotion with red is related to gender. d) The number of children in each cell does not depend upon gender or upon emotion. e) The color red is independent of the emotion associated with it and with gender. #7 In the paper “Color Association of Male and Female 4th grade School Children”, children were asked to indicate what emotion they associated with the color red. The null hypothesis will be rejected at α=0.05 if the test statistic exceeds: a) b) c) d) e) 3.84 5.99 7.81 9.49 14.07 #8 In the paper “Color Association of Male and Female 4th grade School Children”, children were asked to indicate what emotion they associated with the color red. The approximate P-value is: a) Between 0.100 and 0.900 b) Between 0.050 and 0.100 c) Between 0.025 and 0.050 d) Between 0.010 and 0.025 e) Between 0.005 and 0.010