* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Statistics 101
Foundations of statistics wikipedia , lookup
Sufficient statistic wikipedia , lookup
History of statistics wikipedia , lookup
Bootstrapping (statistics) wikipedia , lookup
Statistical inference wikipedia , lookup
Taylor's law wikipedia , lookup
Sampling (statistics) wikipedia , lookup
Gibbs sampling wikipedia , lookup
Statistics 101 Chapter 9 : Sampling Distributions Section 1 Sampling Distributions Parameter – a number that describes the population. Fixed number, but in practice we do not know its value because we cannot examine the entire population. Statistic – a number that describes a sample. The value of a statistic is known when we have taken a sample, but it can change from sample to sample. Difference between a parameter and a statistic Parameter –come from population Statistics –come from samples Example 9.1: Making Money The mean income of the sample of households contacted by the Current Population Survey was $57,045. The number 57,045 is a statistic because it describes this one CPS sample. The population that the poll wants to draw conclusions about is all 106 million U.S. households. The parameter of interest is the mean income of all of these households. We do not know the value of this parameter. Sampling Variability What would happen if we took many samples? Take a large number of samples from the same population Calculate the sample mean (x) or sample proportion p for each sample Make a histogram of the values of X or p Examine the distribution displayed WHY WOULD WE DO THESE THINGS?? Example 9.3 Baggage Check! http://statweb.calpoly.edu/chance/applets/app lets.html Simulation is powerful for studying chance. It is faster to use table B then to actually draw repeated SRS’s The distribution of the sample proportion from SRSs of size 100 drawn from population with population proportion p = 0.7. The histogram shows the results of drawing 1000 SRSs. Sampling Distribution The distribution of values taken by the statistic in all possible samples of the same size from the same population. The population used to construct the random digits table (Table B) can be described by the probability distribution. Values of mean in all possible samples of two random digits. The sampling distribution of x for samples of size n=2. Bias of a statistic A statistic used to estimate a parameter is unbiased if the mean of its sampling distribution is equal to the true value of the parameter being estimated. Variability of a Statistic The spread is determined by the sampling design and the size of the sample. Larger samples give smaller spread The spread of the sampling distribution is approximately the same for any population size as long as the population is much larger than the sample (10 times or greater.) Summary Parameters describe the population Statistics describe the sample A statistic from a probability sample or randomized experiment has a sampling distribution Bias and variability come in when a statistic is used as an estimator of a parameter. Properly chosen statistics do not suffer from bias or variability. Bias and variability – a visual Exercises 2, 8, 9abcde, 10, 17