* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Estimation, Error, and Expectation
Survey
Document related concepts
Transcript
Muuuuuuuuuu Event, Estimation, Error, and Expectation Assume each one of you is a Sample of N=1: •Your height (Xi) is the mean height of your sample •Your height (mean height) is an Estimation of the Class (Population) Mean height (μ, 67.6 inches) •How much Error will your height (mean) have as an Estimator of The class Mean height (μ)? The Standard Deviation (4.1 inches) is a measure of the Expected Error of Estimation 68% of the time you height will be off by no more that 4.1” As an Estimator of μ (67.6 inches) The More Extreme Your Are, The Less Probable You Are .5 PROBABILITY .4 .3 .2 .1 0.0 55 60 70 65 HEIGHT 75 80 Your Deviation Score Is Your Error Score .5 PROBABILITY .4 .3 .2 .1 0.0 -10 0 DEVIATION SCORE 10 The More You Deviate From μ, The Less Likely Your Error Deviation Standard Deviation Average Deviation Expected Deviation Error Standard Error Average Error Expected Error The probability of your height is the: •Probability of your Deviation •Probability of your Error of Estimation 95% of the time your Error of Estimation will be 2*SD or less 4.1” * 2 = 8.2 Think Many Samples The farther off (Error) a Sample is (from μ): •The Less frequent the Sample •The Less frequent the Error •The Less Probable the Error Expect the Sample to occur less often Expect the Error to be less Likely (Probability) Sample Error of Estimation With Size Central Limit Theorem: •The sample means of an infinite number Of samples (of the same size) from the same population will Have a Grand Mean equal to μ •The sample means will be normally distributed about μ •The Standard Deviation of the Distribution of Sample Means Will Decrease as Sample Size Increases This means that the Errors of Estimation (X-bar- μ) Will Decrease as the size of the samples Increases Three Distributions •Population Distribution: Distribution of all raw scores in the population •Sample Distribution: Distribution of raw scores in a sample •Sampling Distribution: Distribution of infinite # of Sample Means (equal sample size) from same Population Per Grand Mean and CLT: Distribution of Sample Means is Distribution of Errors of Estimation of μ Standard Error Of The (Sample) Mean The Standard Deviation of the Sampling Distribution is the Standard Error of the Mean (I guarantee you that if you don’t know this I WILL fail you!!) Standard Error of the Mean: Average Error of the Mean Expected Error of the Mean Expected Error of the Sample Mean in Estimating μ (See parentheses above!) Sample Size and SEM If all samples have N=1: •The Sampling Distribution is Identical to the Sample Distribution •The Standard Error of the Mean equals the Standard Deviation of the Sample As the N of each sample increases for each (uniformly): •Errors of Estimation Decrease •Standard Error of the Mean Decreases The SEM Is The SD Of The Sampling Distribution •68% of the Sample Means will be within 1 SEM/SD of the Grand Mean/μ •68% of the Sample Means will have Errors of Estimation (of μ) of 1 SEM/SD or less. •95% of the Sample Means will have Errors of Estimation (of μ) of 2 SEM/SDs or less Computing The SEM/SD Of The Sampling Distribution With sample sizes N=1, the SEM equals the SD of the Population/Sample As sample size Increases, Errors of Prediction Decrease Per N in the denominator The SEM/SD Is Determined By Two Factors 1. Heterogeneity of the Parent Population 2. Size of the Samples Computing The SEM/SD Of The Sampling Distribution SD N 4.1” 4.1 4.1 4.1 4.1 4.1 1 2 (1.4) 3 (1.7) 4 (2) 9 (3) 16 (4) SEM SEMx2 (68%) (95%) 4.1” 8.2” 2.9” 5.8” 2.4” 4.8” 2.05” 4.1” 1.37 2.74” 1.025 2.5” Or Less! Sample Means have Probabilities Just Like Raw Scores The Error of your Sample The Expected Error of your Sample •The number of Standard Deviations X-bar is from μ •The probability of a Sample with a Mean with (X-bar – μ) amount Of Error of Estimation Error of Estimation Because of the Shape of the Bell Curve of Sampling Distributions: •Probability of Error decreases with Size of Error •Versus a rectangular distribution The 5 Es of Experimentation 2. Estimation 3. Error of Estimation 5. Evaluation 4. Expectation of Error Of Estimation Every Sample Has a Z A Sample’s Z tells you the probability of: •Getting this Sample from the Specified Population •Finding a Sample with a Mean this far away from the Specified μ •Making a mistake if you decide that this Sample didn’t come from The Specified Population Because the Sample is too damn different from the Population I, Anthropologist I’m a Watusi, what’s it to ya?? I, Anthropologist Watusi μ = 84 inches Watusi = 4 inches Watusi X-bar = 81.06 inches (H0) N=6 Z = (81.1 – 84) / [4/(6)] = (-2.94)/1.63 Z = -1.8 Critical Z (one-tail) for p < 0.05 = 1.65 -1.8 > -1.65 Sample probably not Watusi Reject Null Hypothesis!!! I’m a Watusi, what’s it to ya?? I, Anthropologist Pygmy μ = 42 inches Pygmy = 4.5 inches Pygmy X-bar = 45.12 inches (H0) N=6 Z = (45.12 – 42) / [4.5/(6)] = (3.12) / 1.84 Z = 1.7 Critical Z (one-tail) for p < 0.05 = 1.65 1.7 > 1.65 Sample probably not Pygmy Reject Null Hypothesis!!! I’m a good player, Pygmy!! I, Anthropologist Ubangi, u bet!! Have Mursi on me!!!! I, Anthropologist Ubangi μ = 70 inches Ubangi = 4.2 inches Ubangi X-bar = 73.6 inches (H0) N=6 Z = (73.6 – 70) / [4.2/(6)] = 3.6/1.71 Z = 2.1 Critical Z (two-tail) for p < 0.05 = 1.96 Ubangi, u bet!! Have Mursi on me!!!! 2.1 > 1.96 Sample probably not Ubangi Reject Null Hypothesis!!! z, t, F = z, t, F = What Do You Do If You Don’t Know σ? Beer And Western Civilization •Louis Pasteur pasteurization •Carlsberg Brew Master pH system •Guinness Statistician (Gosset) t-test What Do You Do If You Don’t Know σ? If you want to evaluate the probability of a Sample (Mean): If you know μ but you don’t know σ: 1. Estimate σ from the sample using N-1 2. Then Estimate the SEM using the Estimated Standard Deviation My Ugly Cousin Estimates An Estimated SEM will have some Error: The larger the sample size: •The less error in estimating the population SD •Hence, the less error in estimating the SEM An Estimated SEM Has A Corresponding Estimated Sampling Distribution For an estimated value of the SEM: The shape of the Sampling Distribution changes as N increases The SD/SEM is Wider than SN-Curve the Smaller the Sample N There Is A Separate Estimated Sampling Distribution For Every Sample N Look up the probability of t in the t-table The t-Table Is A Table Of Tables A separate table for every sample size: •Degrees of Freedom: df = N-1 •Use Row in t-table with Degrees of Freedom corresponding To your sample size (A Z-table does not use N or Degrees of Freedom) One-Tail The t-Table Two-Tail Going From Sample To Population If you have a Sample Mean, this is the best estimate of μ How Confident can you be about your Estimate? Step 1: Step 2: Step 3: Step 4: Step 5: Step 6: Estimate σ from the sample using N-1 Estimate SEM from Estimated σ Look up the 95% Confidence t-value two-tail) for N-1 df Multiple t-value (Step 2) by estimated SEM (Step 2) Add Value from Step 3 to X-bar (UL: Upper Limit) Subtract Value from Step 4 from X-bar (LL: Lower Limit) μ has a 95% chance of being between UL & LL Free One-Size Fits All Pants For The Men of UMD How tall is the Average (μ) UMD Male? N=25 X-Bar = 70” Estimated σ = 3” Estimated SEM = 3/25 = 3/5 = 0.6” Critical t-value for 95% Confidence Interval (df=25-1) = 2.064 Estimated SEM * Critical t-value = 0.6” * 2.064 = 1.2384” UL = X-bar + (Estimated SEM * Critical t) = 70” + 1.24” = 71.24” LL = X-bar - (Estimated SEM * Critical t) = 70” – 1.24” = 68.76”