Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
CHAPTER SIX SAMPLING DISTRIBUTION FOR MEANS AND PROPORTIONS In general the population characteristics will be represented by letters from the Greek alphabet while sample characteristics will be represented by latin letters. In statistical inference, the mean and the variance calculated from sample data are used to estimate the population mean and variance, hence is called a point estimate for μ 2 s is called a point estimate for 2. The mean of all sample means is written as . The standard deviation of all sample mean is written as . is called the finite population correction factor and must be used when sampling from finite population. As a rule of thumb, when the sample size is less than 5% of the population size. It can be proved (mathematically) that the probability distribution of the sample means for sample size greater than 30, selected from any population (whose mean and variance are known) approaches a Normal distribution with mean μ and standard deviation This is called the Central Limit Theorem. The distribution of sample means for sample size n≥30. In addition, it can also be proved (mathematically) that the Central Limit Theorem applies for small samples selected from Normal populations when the population variance σ2 is known. for samples of any size from a Normal distribution, known variance σ2 A direct application of the Central Limit Theorem is the (i) (ii) calculation of probabilities regarding sample means; calculations of the limits that contain various percentages of the mean. Example An importer of Herbs and Spices claims that the average weight of packets of Saffron is 20 gms. However, packets are actually filled to an average weight μ= 19,5 gm., standard deviation σ = 1,8 gm. A random sample of 36 packets is selected, calculate: a) the probability that the average weight is 20 gms or more; b) the two limits within which 95% of all packets weight; c) the two limits within which 99% of all average weights fall. a) The question ask for probability P(x≥20). It is necessary to use the probability distribution of the sample mean, that is Normal with μ = 19,5 and σ= 1,8/(n)1/2 . For From Tables the area in the tail is 0,0475. b) The question asks for individual weights, not average. The Z values that correspond to a tail area of 0,025 are -1,96 and +1,96.. The upper limit is μ+Zσ= 19,5 + 1,96 (1,8) = 19,5 + 3,528 = 23,028. The lower limit is μ-Zσ = 19,5 – 1,96(1,8) = 19,5 – 3,528 = 15,672. c) This part asks for sample averages. The method is the same as in part b) except that the standard for average is . The upper limit is given by μ+Z = 19,5 + 1,96 (0,3) = 19,5 +0,588= 20,088. The lower limit is given by μ-Z = 19,5 -1,96 (0,3) = 19,5 -0,588= 18,912. Sampling distribution of proportion A proportion is the number of elements with a given characteristic divided by the total number of elements in the group. The sample proportion, p, is the point estimate of the population proportion, π. The standard error for proportion is given by the formula If N is large, the second term goes to zero and The sampling distribution of proportion The list of every possible sample proportion, with its probability, is called the sample distributions of proportions. For large samples (n≥30) Example In a certain neighbourhood, it is known that 12% of youths aged from 16 to 24 are unemployed. If a random sample of 150 youth are selected what is the probability that the sample contains At most 10% unemployed At most 15 unemployed If 12% are unemployed π=0,12 e σp = = 0,0265. P(p≤0,10). Z= = -0,75 The tail area is 0,2266. The probability that at most 10% of sample is unemployed is 0,2266. Mean and standard errors for the distribution of proportion when π is unknown The calculation of the mean and standard error of sample proportion depends on knowing the value of the population proportion, π. Since π is seldom known, the mean and standard error for proportions are approximated by substituting p for π. Hence Some desirable properties of estimators Two of the most important properties of point estimators are that they should be: 1. Unbiased 2. Minimum variance Unbiased estimators An estimator is said to be unbiased if the average value of all the point estimates is equal to the population parameter to be estimated. The mean is an unbiased estimator of μ and the sample proportion p is an unbiased estimator for π. Minimum variance The values of sample statistics (sample means and proportios) can vary greatly about the population parameter being estimated. It is obviously desiderable to keep this variation (as measured) by standard error as small as possible (minimum variance). Increasing sample size reduces the standard error. An estimator is described as precise when the values of the estimates are close. The standard deviation of all the estimates (called standard error) is a measure of the precision of an estimator. Ideally an estimator should be unbiased and have minimum variance (both accurate and precise). The sample mean and proportion are unbiased and minimum variance estimators of the population mean and proportion.