Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
CHAPTER 8 Data Analysis and Statistical Methods: Univariate and Bivariate Analyses © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Dr. Ravi Zacharias – Are all non-Christians going to hell? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. The Use of Statistical Methods in Marketing Research statistical analysis helps distinguish “signal” from “noise” and contrast their relative size data analysis procedures univariate, bivariate & multivariate procedures regression methods factor analysis cluster and latent class analyses multidimensional scaling conjoint analysis interdependence methods – to elucidate the structure of a set of variables dependence methods – to help explain a separate dependent variable © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 1. What are the steps in the data analysis process? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Overview of Data Analysis Process Coding Transcribing Data cleaning Variable specification and recoding Selecting a data-analsyis strategy © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 2. What questions help researchers identify appropriate analytical techniques after data collection? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Overview of Data Analysis Procedures to choose data analysis technique, first determine: the number of variables to be analyzed together • focus on a single variable: univariate analysis • considering two variables: bivariate analysis • considering many variables at once: multivariate analysis whether data is to be described or used to make inferences • descriptive statistics: summary measures of sample data • inferential statistics: probability theory used to make statements about the population what level of measurement is available in variable(s) of interest • nominal scale • ordinal scale • interval scale © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 3. What are two types of analytical statistics? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Analytical Statistics Univariate Descriptive • Measures of central tendency (Mean, Median Mode) • Measures of dispersion (Standard Deviation) Inferential • Z Test • T-Test © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Analytical Statistics Bivariate Descriptive • Correlation Coefficient • Regression Inferential • Chi-Square Test © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 4. What are the steps involved in the univariate data analysis? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Overview of Univariate Data Analysis Procedures FIGURE 8.1 Overview of univariate data analysis procedures © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 5. Define Descriptive Statistics. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Descriptive Statistics It provides summary measures of the data contained in all the elements of ta sample, particularly measures of central tendency, which describes where the bulk of the data are located, and dispersion, which describes how ‘spread out’ the data values are around the central measure. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Descriptive Statistics central tendency – where are the bulk of the data? mean – average value, used for interval data median – middle value of the data, used for ordinal data mode – the category of a variable that occurs most often, used for nominal data (can be bimodal) dispersion – how spread out is the data? standard deviation –used for interval data absolute frequencies – number of items in the sample in each category of the variable, used for nominal data relative frequencies – proportion of items in the sample in each category of the variable, used for nominal data © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Mean Back © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Standard Deviation Back © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Dr. Ravi Zacharias – What is the future of our culture? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 6. Define Null Hypothesis and Alternative Hypothesis. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis Testing null hypothesis (H0) – a specific statement, opposed to the alternative hypothesis, subjected to statistical testing (Ho); statement that a population parameter takes on a particular value with enough evidence, can be rejected in favor of the alternate hypothesis never accepted as valid, only unable to be rejected alternative hypothesis – a specific statement, opposed to the null hypothesis, subjected to statistical testing (H1); the sampling distribution reveals whether the sample value is different enough from the null hypothesis value to have occurred through sampling error alone “one-tailed” test when alternative hypothesis is directional © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Null Hypotheses?? A statement of “equality” “No relationship between the variables you are studying” “No differences between the two groups you are trying to compare” There will be no difference between the average scores of freshmen and seniors in my Financial Stewardship class There will be no relationship between absenteeism and grades achieved in my Production and Operations class © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Research Hypotheses?? A statement of “inequality” “There is a relationship between the variables you are studying” “There are differences between the two groups you are trying to compare” There will be a difference between the average scores of freshmen and seniors in my Financial Stewardship class There will be a relationship between absenteeism and grades achieved in my Production and Operations class © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Null vs. Research Hypotheses Null Hypotheses Equality Research Hypotheses Inequality Population based Sample based Indirectly tested Directly tested Greek symbols Roman symbols Implied Hypotheses Stated Hypotheses © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Hypotheses Null Hypotheses There is no difference in the frequency or the proportion of occurrences in each category. HO: P1 = P2 = P3 Research Hypotheses There is a difference in the frequency or proportion of occurrences in each category. H1: P1 ≠ P2 ≠ P3 © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 7. What are two possible errors in hypothetical testing? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis Testing (continued) Type I error (α): H0 is true and is rejected when values are outliers on the distribution significance level – tolerable level of Type I error (α) confidence level of the test (1 - α) – likelihood of being correct (not rejecting H0 when it is true) Type II error (): H0 is false and is not rejected when different sampling distribution happens to be likely under H0 power of the test (1 - ) – probability of rejecting a false null hypothesis For a given sample size, increases as α decreases have to balance ‘tolerable’ degrees of the two types of error, α and © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis Testing (continued) TABLE 9-1 S ummary of Hypothesis Testing Errors True condition S ample conclusion H0 is true H0 is false 1 Correct decision 1 Type II error Do not reject H0 Reject H0 2 Confidence level 3 Probability = 1 – a 2 Probability = β 1 Type I error 1 Correct decision 2 Significance level 3 Probability = a 2 Power of the test 3 Probability = 1 – β © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 8. What are two methods used in controlling hypothesis testing errors? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Bivariate Analytical Techniques 1. Significance Level (p<0.05) – toleration of error 2. Confidence level (95%) – likelihood of being correct © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 9. What are the steps in hypothesis testing? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Steps in Hypothesis Testing 1. Formulate null (H0) and alternative (H1) hypotheses 2. Select appropriate statistical test for the data type 3. Specify significance level, α 4. Determine value of the test statistic for the given α 5. Perform statistical test, yielding a value of the statistic 6. If the computed test statistic from step 5 is greater than the tabulated value from step 4, the null hypothesis is rejected. The information in the hypothesis test is summarized in a single quantity called the p-value. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 10. What are the two inferential statistical tests available for analyzing interval data? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Inferential Statistics z-test is X likely to have come from a population with the hypothesized mean value μ = μ0? if is known, then z X / n if is unknown and n is large (generally >30), then: z X X sX s/ n If calculated z value is large enough to occur by sampling error alone less than α of the time, the null hypothesis is rejected in favor of the alternate hypothesis with 1- α confidence. For proportions, when np(1-p) 5, z p p(1 p)/n where p = sample and π = hypothesized proportions. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Inferential Statistics (continued) t-test If n < 30 and is unknown, the t-test should be used • population must be known to be normally distributed • degrees-of-freedom must be known (for means, df = n-1) holds for any sample size population standard deviation () is estimated by the sample standard deviation, s critical values of the t statistic are in Table A-2, page 659 (Oi Ei )2 Ei i 1 k 2 © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Dr. Ravi Zacharias – Why the Bible? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 11. What is the inferential statistical test used for analyzing nominal data? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Inferential Statistics (continued) chi-square test for nominal data compares hypothesized population distribution (“E”) against observed distribution (“O”): for a univariate chi-square test, df = k - 1 (Oi Ei )2 Ei i 1 k 2 © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Bivariate Procedures Bivariate analysis determines whether the values of one variable offer useful information about the values of another. FIGURE 8.4 Bivariate data analysis procedures © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 12. What questions help researchers identify appropriate bivariate analytical techniques after data collection? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Bivariate Analytical Techniques 1. Relationship 2. Association 3. Prediction © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Q. 13. Define ANOVA. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. ANOVA (Analysis of Variance) A statistical technique for examining the difference among means for two or more populations. © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Dr. Ravi Zacharias – How do I know God is working in my life? © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Descriptive Statistics for Bivariate Analysis Regression reveals how independent variables relate to a dependent variable and aids predictions based on this. linear correlation coefficient (rXY) measure of the linear relationship between X and Y: n rXY Cov( X , Y ) Cov( X , X )Cov(Y , Y ) n (X i 1 i X )(Yi Y ) (n 1) sx s y (X i 1 i X )(Yi Y ) n n 2 2 ( X X ) ( Y Y ) i i i 1 i 1 n ( X i X ) (Yi Y ) 1 (n 1) i 1 sx s y If r = 0, there is no linear relationship between the variables coefficient of determination (r2) – the exact percentage of variation shared by 2 variables © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Descriptive Statistics for Bivariate Analysis (continued) partitioning the sum of squares: fitting the regression line to predict Yˆi based on Xi , estimate a regression line Yˆ a bX that minimizes the sum of squared errors: i i n b SS XY SS XX (X i 1 i X )(Yi Y ) n (X i 1 i X )2 a Y bX F-test – ratio of regression variance to error variance © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Inferential Statistics for Bivariate Analysis regression coefficient: how many standard errors is the sample (b) value from the hypothesized value (β): t b sb population means: how many standard errors is difference between the means from zero: t X Y s pool where spool is the pooled standard error nominal association: check chi-square of cross-tabulation tables for significance © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.