Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Where we are Where we are going Statistical analyses by number of sample groups 1 - Single sample Z or t tests 2 – T tests (dependent, independent) 3 or more ? Analysis of variance Between subjects (independent) Equal N (Easy) Unequal N (Harder – structural) Within subjects (dependent – structural) Two-way (independent - structural) Analysis of Variance Equal N (Easy) Used with 3 or more groups Extends logic of independent groups t-test Some additional things to think about The alternative hypothesis in ANOVA is always - The population means are different (at least one mean is different from another) ASSUMPTIONS of ANOVA: equal variances (required for pooling) normality (required for test distribution) The null hypothesis in ANOVA is always: 1 2 3 j This implies that any combination of means are also equal (1 2 ) / 2 3 **** Null hypothesis is tested by comparing two estimates of the population variance (σ2 ): (MSB) between-group estimate of (σ2 ) AFFECTED by whether the null is true (MSW) within-group estimate of (σ2 ) UNAFFECTED by whether the null is true ANOVA: Null Hypothesis Is TRUE Score Distributions Sample 3 Scores Sample 2 Scores Sample 1 Scores Mean Distributions, N=9 Sample 3 Mean Sample 2 Mean Sample 1 Mean ANOVA: Null Hypothesis Is False Score Distributions Mean Distributions (N=9) -- when the null hypothesis is true (MSB) F ratio = ------- = About 1 (MSw) -- when the null hypothesis is not true (MSB) F ratio = ------- = Much greater than 1 (MSw) Let: K J NJ _ Xj NG _ XG I S2_ X = # of groups (treatments) = a particular group 1…K = # of people in group J = Sample mean for group J = Total (grand) number of people = The grand mean x = Individual = Variance of the means ij / nG Between Groups variance S2_ Is an estimate of X σ2_ X σ2_ X Variance of the means σ2 = --- Because the variance of the means is the n variance of the variable divided by sample size S2_ (n) estimates the population variance (σ2 ) X S2_ (n) = MSB X IF groups are same size (n1=n2=n3…) Between Groups variance If Null is true, then these are just random samples S2_ (n) estimates σ2 just as well as any random samples would X If Null is false, then these are not just random samples S2_ (n) = MSB will be higher than the populations X variance because the means are farther away from each other than would be expected by chance Within Groups variance If Null is true, then these are just random samples So we can poll the variances just as we did with the independent samples t-test 2 + S 2 + S 2 +… S 2 S 1 2 3 k 2 S pooled = ----------------------k MSW = S2pooled MSB F = --------MSW • F Ratio • Critical values of f depend on dfW and dfB • Look up in table Post-hoc tests ANOVA tells us there is some difference, but it does not tell us which groups are different from each other ANOVA is like a shotgun – firing many pellets at many different hypothesis like u1=u2 u2=u3 (u1+u2)/2=u3 Post-hoc tests - Tukey Tests all the pairwise comparisons – does not test complex hypotheses (such as (u1+u2)/2=u3) u1=u2 u1=u3 u1=u4 u2=u3 u2=u4 u3=u4 Apriori tests Also referred to as “planned comparisons” “planned contrast” A rifle instead of a shotgun. Used to test a specific hypothesis that is a subset of all hypotheses. For example, with 3 groups – if you wanted to test if group 3 was different from the other two groups, then you would test the following: (u1+u2)/2=u3 But WHAT IF the sample sizes are not the same? Structural Model of ANOVA An alternative way of understanding ANOVA - used whenever Nj is not equal across groups All the basic logic stays the same, computationally however XG = mean of all the scores For each score, deviation from grand mean is divided into two parts a) deviation of score from mean of its group b) deviation of group mean from grand mean 2 ( X X ) ij G SST = 2 ( X X ) j G = SSB DFB = . S MS B 2 B F = MSB/MSW (X j XG ) K 1 DFB = + 2 ( X X ) ij j + K 1 DFW = SSW Ng K 2 SW2 MSW K 1 2 ( X X ) ij j NG K DFW = N g K STANDARD WAY OF SETTING UP ANOVA EFFECT SS DF MS BETWEEN 2 ( X X ) j G K 1 WITHIN 2 ( X X ) ij j N g K SSW / DFW TOTAL 2 ( X X ) ij G N g 1 2 ( X X ) ij G = 2 ( X X ) j G F SS B / DFB MS B / MSW + 2 ( X X ) ij j • F Ratio • Critical values of f depend on dfW and dfB • Look up in table Anova Effect size _ _ Xmax - Xmin dM = -------------Spooled = How far apart the means are divided by the standard deviation Similar to the effect size for independent t-test (mean difference/stdev) Small Medium Large .2 .5 .8