* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Example
History of statistics wikipedia , lookup
Bootstrapping (statistics) wikipedia , lookup
Taylor's law wikipedia , lookup
Statistical inference wikipedia , lookup
Misuse of statistics wikipedia , lookup
Sampling (statistics) wikipedia , lookup
Gibbs sampling wikipedia , lookup
Chapter 18 The Diversity of Samples from the Same Populations population – set of all possible measurements sample – a subset of the population parameter – a numerical measure of a population statistic – a numerical measure of a sample Since the population is often not available, we use statistics to estimate parameters In statistical application, we take a random sample from the population and compute a statistic, say x The value of the statistic x depends on which items are selected for the sample Therefore, x is a random variable … different samples yield different values of x The probability of a statistic over all possible samples is known as its sampling distribution Sampling Distribution of the Sample Mean The statistic x estimates the population mean  We hope x is very close to  We want the sampling distribution to be centered at the value of the parameter and to have little variation. Facts about sampling distribution of x x   The average value of x across all possible samples in  , the population mean x   n The standard deviation of the sampling distribution of x is the population standard deviation  divided by n Notice that as n increases the sample to sample variability in x decreases If our sample comes from a normal distribution with mean  and standard x deviation  then Z  has a  n standard normal distribution Central Limit Theorem If we sample from a population with mean  and standard deviation  then x is approximately standard Z  n normal for large n . If n  30 or larger, the central limit theorem will apply in almost all cases Example A population of soft drink cans has amounts of liquid following a normal distribution with   12 and   0.2 oz. What is the probability that a single can is between 11.9 and 12.1 oz. P(11.9  X  12.1)  P(0.5  Z  0.5)  .69  .31  .38 What is the probability that x is between 11.9 and 12.1 for n = 16 cans P (11.9  x  12.1)  P (2  Z  2)  .975  .025  .95 Example A population of trees have heights with a mean of 110 feet and a standard deviation of 20 feet. A sample of 100 trees is selected Find  x  x    110 Find  x  20 x   2 n 100 Find P( x  108 feet) P ( x  108)  P ( Z  1)  1  .16  .84 Notice this probability for x is approximately correct even though the population is not normally distributed (because of central limit theorem) What about P( X  108) ? This cannot be done for x since we do not know that the population is normally distributed. Sampling Distribution of the Sample Proportion Population Proportion # in population with characteristic p # in population Sample Proportion # in sample with characteristic pˆ  n p̂ is a point estimate of p  pˆ  p  pˆ  p1  p  n If we sample from a population with a proportion of p, then Z  pˆ  p is p1  p  n approximately standard normal for large n. Example Suppose the president’s approval rating is 56% and we look at samples of size 100. Find the following. Find  p̂  pˆ  p  .56 Find  p̂ p1  p  .561  .56  n 100 .2464   .002464  .0496 100  pˆ  Example A survey of 120 registered voters yields 54 who plan to vote for the republican candidate. p = proportion of all voters who plan to vote for the republican candidate pˆ  54  0.45  45% 120 Do you think there is much of a chance that the republican candidate will get at least 50% of the vote? Calculate the margin of error 1 120  .0913 Think about the variance of the sampling distribution pˆ 1  pˆ  .451  .45 .2475    .0454 n 120 120 The empirical rule says that 95% of data should be within 2 standard deviations 2.0454   .0908 Do you see where the margin of error comes from?