Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
DESCRIBING A POPULATION The variation of a trait in a population - or lack therefore – can be described quantitatively using statistics. mathematical average A not quite ‘normal’, or bell curve data distribution half of the data points above, half below the most instances ANOTHER MEASURE : STANDARD DEVIATION Case in Point: Two classes took a recent quiz. There were 10 students in each class, and each class had a mean average score of 81.5 Since the averages are the same, can we assume that the students in both classes all did pretty much the same on the exam? Class A Mean Class B The mean does not tell us anything about the grade distribution or variation of grades in the population Need a way to measure the spread of grades Mean average Range (spread) Mean and range tell only part of the ‘story’ Here, most of the numbers hover around the mean, and are not evenly distributed throughout the range. mean 72 - 81.5 = -9.5 Standard Deviation is a measure of how spread out the values in the data set are from the mean SD = the difference of all of the values from the mean of the population, summed, divided by the population size SD IS ADDED/SUBTRACTED FROM THE MEAN Mean +/- the SD defines a range of values. In a normally distributed population, +/- 1 SD describes 68% of the population +/- 2SD describes 95% of the population +/- 3 SD describes 98% of the population Anything outside of 3SD is an outlier 68% 95% 98% If SD is small , the data is closer to the mean. More likely the IV is affecting the DV. If SD is LARGE, the numbers are spread out from the mean. Other factors are likely influencing the DV. [SIDEBAR: THE GRADE CURVE] F D C B A mean Best used with an exam that is difficult yielding a wide range of scores – why? RESULTS Team A Average on the Quiz Standard Deviation Using: Team B 81.5 81.5 4.88 15.91 Variance and Standard Deviation: Step by Step • Calculate the mean, x. • Subtract the mean from each observed value Hint: do each value once, account for multiple later • Square each of the differences. multiply by the number of each value now Sum this column. • Divide by n -1 where n is the number of items in the sample. This is the variance. • To get the standard deviation, take the square root of the variance. Example 1. Data (datum-mean)2 2.3 (2.3-3.4)2 = 1.21 3.7 (3.7-3.4)2 = 0.09 4.1 (4.1-3.4)2 = 0.49 Mean = 3.4 Summed* = 1.79 2. 1.79/(3-1) = 0.90 variance 3. Square root of 0.90 = 0.95 standard deviation 4. Mean +/- 1SD = 2.45 – 4.35 range that describes 68% of a normally distributed population *If you have multiple incidences of the same datum, multiply the (datum-mean)2 by the number of occurrences before summing Alternative approach: spreadsheet functions A B 1 2.3 =(A1-A4)^2 2 3.7 =(A2-A4)^2 3 4.1 =(A3-A4)^2 4 =(SUM(A1:A3)/3) =(SUM(B1:B3) variance =SUM(B4/(3-1) SD =SQRT(B4) Remember you can drag by the bottom left corner to SUM, or to fill subsequent cells (but you’ll have to correct the mean cell reference Alternative approach: graphing calculator s = square root of [(sum of X2 - ((sum of X) * (sum of X)/N)) / (N-1)] Step 1: Square each of the scores X 1 2 3 4 5 X2 1 4 9 16 25 Step 2: Use the x, x2 in formula = square root of [(55-((15)*(15)/5))/(5-1)] = square root of [(55-(225/5))/4] = square root of [(55-45)/4] = square root of [10/4] = square root of [2.5] s = 1.58113 Save in graphing calculator or spreadsheet!