Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Analysis of variance (ANOVA) (from Chapter 7) Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 1 Tests on multiple hypotheses • Consider the situation where the means for more than two groups are compared, e.g. mean alcohol expenditure for: (a) students; (b) unemployed; (c) employees • One could run a set of two mean comparison tests (students vs. unemployed, students vs. employed, employed vs. unemployed) • But.....too many results... Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 2 Analysis of Variance • It is an alternative approach to mean comparison for multiple groups • It is applicable to a sample of individuals that differ for one or more given factors • It allows tests where variability in a variable is attributable to one (or more) factors Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 3 Example EFS: Total Alcoholic Beverages, Tobacco Economic position of Household Reference Person Unoc Ret unoc SelfFulltime Pt under Unempl. over min employed employee employee min ni ni age age Mean 18.56 14.64 12.39 19.48 St. Dev. 19.0 18.5 15.0 19.7 7.34 14.6 TOTAL 11.99 12.67 19.1 17.8 Are there significant difference across the means of these groups? Or do the differences depend on the different levels of variability across the groups? Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 4 Analysis of Variance • Here: the target variable is alcohol, bev., tobacco expenditure, the factor is the economic position of the HRP Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 5 One-way ANOVA • Only one categorical variable (a single factor) • Several levels (categories) for that factor • The typical hypothesis tested through ANOVA is that the factor is irrelevant to explain differences in the target variable (i.e. the means are equal, as in bivariate mean comparisons/t-tests) • Apart from the tested factor(s), the groups should be safely considered homogeneous between each other Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 6 Null and alternative hypothesis for ANOVA • Null hypothesis (H0): all the means are equal • Alternative hypothesis (H1): at least two means are different Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 7 Measuring and decomposing the total variation VARIATION BETWEEN THE GROUPS + VARIATION WITHIN EACH GROUP= ________________________________ TOTAL VARIATION Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 8 The basic principle of the ANOVA: If the variation explained by the different factor between the groups is significantly more relevant than the variation within the groups, then the factor is assumed to be statistically relevant in explaining the differences The test statistic: • The test statistic is computed as: 2 B 2 W s Variance between groups F s Variance within groups Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 9 Distribution of the F-statistic (one-tailed test) if p<0,05 we refuse H0: i.e. the means are not equal Rejection area Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 10 ANOVA in SPSS Target variable Factor Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 11 SPSS output ANOVA Variance between EFS: Total Alc oholic Beverages, Tobac co Between Groups W ithin Groups Total Sum of Squares 6171.784 151535.3 157707.1 df 5 494 499 Mean Square 1234.357 306.752 F 4.024 Sig. .001 p-value < 0.05 Variance within Variation decomposition Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi Degrees of freedom The null is rejected 12 Post-hoc tests • They open the way to further explore the sources of variability when the null hypothesis of mean equality is rejected. • It is usually relevant to understand which particular means are different from each other. Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 13 Some post-hoc tests • • • • • • LSD (least significant difference) Duncan test Tukey’s test Scheffe test Bonferroni post-hoc method ....... Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 14 ANOVA assumptions Two key assumptions are needed for running analysis of variance without risks 1)that the sub-samples defined by the treatment are independent 2)that no big discrepancies exist in the variances of the different sub-samples Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 15 Multi-way (factorial) analysis of variance • This analysis measures the influence of two or more factors • Beside the influence of each individual factor, it provides testing of interactions between treatments belonging to different factors • ANOVA with more than two factors is rarely employed, as interpretation of results becomes quite complex Statistics for Marketing & Consumer Research Copyright © 2008 - Mario Mazzocchi 16