Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Psychometrics wikipedia , lookup
Degrees of freedom (statistics) wikipedia , lookup
Bootstrapping (statistics) wikipedia , lookup
Taylor's law wikipedia , lookup
History of statistics wikipedia , lookup
Eigenstate thermalization hypothesis wikipedia , lookup
Foundations of statistics wikipedia , lookup
Statistical hypothesis testing wikipedia , lookup
Resampling (statistics) wikipedia , lookup
Ngawang Tsering 1 CONFIDENCE INTERVAL ESTIMATES In general, the purpose and the meaning of a confidence interval are to point out the true proportion of a sample or the range of values to estimate the true value of a population parameter. It is an interval between where the true value of population parameter lies within. It basically means the distance from the left threshold to right threshold of a critical point which consists of the true value of the population parameter. Construct a 99% confidence interval estimate for the true proportion of yellow candies. (ππΌ = 0.01) ππΌ 2 = 0.005 (Critical value = 2.575) π‘ππ‘ππ π¦πππππ€ πππππππ (p= π‘ππ‘ππ ππ. πππππππππ , p= 297 , =0.198) 1500 Point estimate= 0.198 ππΌ Margin of error (E) = 2 β 2.575 = β 2 =.0265 (π)(π) π (.198)(.802) 1500 β Confidence interval =π ± E =.198 ± .0265 = (.1715 < p < .2245) Construct a 95% confidence interval estimate for the true mean number of candies per bag. (π‘πΌ = 0.05) β (πππππ = π‘ππ‘ππ ππ. ππ πππππππ ππ.ππ ππππ ππ πππππππ 1500 ,= 25 (β π₯βπ₯ 2 ) , =60) 90 Standard deviation = β = β =β3.75 =1.93649 πβ1 24 Degrees of freedom=n-1 =25-1 =24 π‘ π Margin of error (E) = πΌ 2 βπ 1.93649 =2.064 β25 =.799383 β Confidence interval =π ± E =60±.799383 = (59.2 < µ < 60.8) Ngawang Tsering 2 Construct a 98% confidence interval estimate for the standard deviation of the number of candies per bag. (π‘πΌ = 0.02) = π‘πΌ 0.02 = 2 2 =0.01 Degrees of freedom=n-1 =25-1 =24 (π‘πΌ = 2.492) Standard deviation = β =β (πβ1)π 2 <Ο<β (β π₯βπ₯ 2 ) πβ1 =β 90 24 =β3.75 =1.93649 (πβ1)π 2 2 ππ (25β1)1.936492 ππΏ2 (25β1)1.936492 =β <Ο<β 42.980 =1.44706< Ο < 2.87929 10.856 All the three interval estimates are shown above. The first one is the interval estimate of the proportion of yellow candies from the total number of candies from class with the confidence level of 99%. The true value of proportion (0.198) lies in between the intervals (.1715 and .0265). Second is the interval estimate of the true mean number of candies per bag with the confidence level of 95%. The true mean number of candies (60) is within the interval of (59.2 and 60.8). Third estimate is for the standard deviation of the number of candies per bag with the confidence level of 98%. The standard deviation of the sample (1.94) does lie within the interval of (1.45 and 2.88). HYPOTHESIS TESTS Generally in statistics, hypothesis test is a procedure for testing a claim about a property of a population. Whenever we do a hypothesis test we are making a claim or a statement for a value of population parameter. We do the test to decide whether to accept or reject the claim. We always do the test referring to null hypothesis and alternative hypothesis. For this, we need to calculate the value of the test statistics, choose a sampling distribution that is relevant, find either P value or critical value, and state conclusion about the claim after you get the result whether to accept or reject. Use a 0.05 significance level to test the claim that 20% of all Skittles candies are red. P=0.20, q=1-p =.80, n= 1500, x=305, Ξ±=0.05 π₯ 305 =^π = π =1500 =0.203333 (Null hypothesis)ππ : P= 0.20 (Alternative hypothesis)ππ : Pβ 0.20 Test statistics= = .^π βπ ππ β π 0.2033β0.20 β (0.20)(0.80) 1500 Z =.322749 Ngawang Tsering 3 Z value area= .6255 P value= 1-.6255 = .3745 =.3745×2 =.746886/.75 (0.75>0.05) βPβ value is greater than Ξ± 0.05. Therefore, we fail to reject the null hypothesis. In conclusion, we state that there is not sufficient evidence to reject the null hypothesis that 20% of all skittles candies are red. Use a 0.01 significance level to test the claim that the mean number of candies in a bag of Skittles is 55. Ξ±=0.01, µ =55, n=25 Standard deviation = β β (πππππ = (β π₯βπ₯ 2 ) πβ1 π‘ππ‘ππ ππ. ππ πππππππ ππ.ππ ππππ ππ πππππππ =β ,= 90 24 =β3.75 =1.93649 1500 25 , =60) Degrees of freedom=n-1 =25-1 =24 (π‘πΌ = 2.492) (Null hypothesis)ππ : µ = 55 (Alternative hypothesis)ππ : µ β 55 Test statistics= β π βµ π βπ 60β55 = 1.93649 β25 =12.91 T=12.9099 P value =0.00 (from T-table) P value is less than Ξ± 0.01. Therefore, we reject the null hypothesis. In conclusion, there is sufficient evidence in order to reject the null hypothesis that the mean number of candies in a bag of skittles is 55. So, it supports the alternative hypothesis with sufficient evidence that is mean of candies in a bag of skittles is not 55. The hypothesis tests done above are the claims for proportion of candies that 20% of all skittles candies are red and the mean number of candies in a bag of skittles is 55. We needed to do the test with null hypothesis claiming that the proportion and mean are equal to what it is said in the description. Null hypothesis donβt expect any change or difference from the original data. Then we need alternative hypothesis claiming that null hypothesis is not correct. It expects changes and differences in the data from the original. So, we have performed the test and we couldnβt find sufficient evidence to reject the first claim which is 20% of the candies is Red. But we have sufficient evidence to reject the second claim which is mean of a bag of candies is 55. Ngawang Tsering 4 Reflection The conditions for doing interval estimates and hypothesis tests are that you need to know the confidence interval or confidence level, critical point with z score that is a threshold for unusual, margin of error, point estimate, sample size, true percentage, true mean, standard deviation. The sample should be greater than 30 to be in normal distribution. During hypothesis, you need to do test statistics in order to find your critical value and then need to know how to use, z, t and chi square table. My sample met these conditions because the data were there with sample size. After that I calculated all the mean and standard deviation. The question includes the other parts such as confidence level, proportion of the sample that we need to find. Using this data, there might not be an error if you understand the question but if you are confused then there would be error in finding you point estimate, p value, βzβ score and mean of X. If you made error in the beginning of your findings then your whole work will be meaningless. Sampling method could be improved by increasing your sample size βnβ. From this statistical research I have learnt how you can get a percentage or an estimate of everything in your daily life. I realized how important role statistics can play in any aspects of your life, whether you are in business, healthcare, justice system, journalist or any other profession. You can tell the probability of anything thatβs important to you if you have the right data and the right knowledge of performing the test on it.