Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 7 Estimation Procedures Basic Logic In estimation procedures, statistics calculated from random samples are used to estimate the value of population parameters. Example: If we know 42% of a random sample drawn from a city are Republicans, we can estimate the percentage of all city residents who are Republicans. Basic Logic Information from samples is used to estimate information about the population. Statistics are used to estimate parameters. POPULATION SAMPLE PARAMETER STATISTIC Basic Logic Sampling Distribution is the link between sample and population. The value of the parameters are unknown but characteristics of the S.D. are defined by theorems. POPULATION SAMPLING DISTRIBUTION SAMPLE Estimation Procedures A point estimate is a sample statistic used to estimate a population value. Both sample means and sample proportions are unbiased estimates of the population mean or proportion. (explain bias) For both means and proportions we can use characteristics of their respective sampling distributions to establish confidence intervals around the statistic. Shape of sampling distributions For both means and proportions, as N becomes large, the sampling distribution will be normal (for our purposes, N=100 is large enough) The standard deviation of the sampling distribution is called the standard error. Standard error and mean of sampling distribution 1. If we take a large sample, we can use the mean of the sample as an unbiased estimate of the mean of the sampling distribution. 2. The standard error of the mean = (the standard deviation of the sample)/(the square root of N-1) Efficiency of confidence intervals Efficiency is determined by the dispersion in the sampling distribution. The smaller the standard deviation of the sampling distribution, the greater the efficiency of our estimate Efficiency is therefore maximized as N gets larger Constructing Confidence Intervals For Means Set the alpha level(probability that the interval will be wrong). Setting alpha equal to 0.05, a 95% confidence level, means the researcher is willing to be wrong 5% of the time. Find the Z score associated with alpha. Z-scores are expressed in standard deviation units. Statistics texts always have a table that correlates z scores and areas under a normal curve (show example on overhead projector) If alpha is equal to 0.05, we would place half (0.025) of this probability in the lower tail and half in the upper tail of the distribution. Confidence Intervals For Means: Problem 7.5c For a random sample of 178 households, average TV viewing was 6 hours/day with s = 3. Alpha = .05. c.i. c.i. c.i. c.i. = = = = 6.0 6.0 6.0 6.0 ±1.96(3/√177) ±1.96(3/13.30) ±1.96(.23) ± .44 Confidence Intervals For Means We can estimate that households in this community average 6.0±.44 hours of TV watching each day. Another way to state the interval: 5.56≤μ≤6.44 We estimate that the population mean is greater than or equal to 5.56 and less than or equal to 6.44. This interval has a .05 chance of being wrong. Confidence Intervals For Means Even if the statistic is as much as ±1.96 standard deviations from the mean of the sampling distribution the confidence interval will still include the value of μ. Only rarely (5 times out of 100) will the interval not include μ. Sampling error for proportions As N becomes large (100 or more), the proportion in the sample is an unbiased estimate of the proportion in the sampling distribution (and the population) The standard error of proportions = the square root of (.25/N) Constructing Confidence Intervals For Proportions Procedures: Set alpha. Find the associated Z score. For an alpha of .05, we put .025 in each tail of the normal distribution, and using our table of normal curve areas, Z = 1.96. Confidence Intervals For Proportions If 42% of a random sample of 764 from a Midwestern city are Republicans, what % of the entire city are Republicans? Don’t forget to change the % to a proportion. c.i. c.i. c.i. c.i. = = = = .42 .42 .42 .42 ±1.96 (√.25/764) ±1.96 (√.00033) ±1.96 (.018) ±.04 Confidence Intervals For Proportions Changing back to %s, we can estimate that 42% ± 4% of city residents are Republicans. Another way to state the interval: 38%≤Pu≤ 46% We estimate the population value is greater than or equal to 38% and less than or equal to 46%. This interval has a .05 chance of being wrong. SUMMARY In this situation, identify the following: Population Sample Statistic Parameter SUMMARY Population = All residents of the city. Sample = the 764 people selected for the sample and interviewed. Statistic = Ps = .42 (or 42%) Parameter = unknown. The % of all residents of the city who are Republican.