Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Hmwk 1: DATA PREPARATION and UNDERSTANDING (50 pts): Name ____Maitrinh Nguyen_____ Please provide your answers in this document. Feel free to Copy-and-Paste from pertinent Enterprise Guide output. Save your document in a file called: Day 3 – Homework 1 – last name and first name. 1. Using the Codebook and viewing the data, for each of the following variables, please indicate the Variable Type (categorical or numeric). For each Categorical Variable, indicate whether the level of measure is Nominal or Ordinal. Also put a check by any variable that is considered BINARY. (12 pts) VARIABLE NAME AGE GENDER ETHNICITY CLASSIFICATION COLLEGE OF BUSINESS MAJOR ISDS GPA R1 R2 R3 R4 2. VARIABLE TYPE Numeric Categorical Categorical Categorical Categorical Categorical Categorical Numeric Categorical Categorical Categorical Categorical NOMINAL or ORDINAL Nominal Nominal Ordinal Nominal Nominal Nominal BINARY Binary Binary Binary Ordinal Ordinal Ordinal Ordinal For each of the NUMERIC VARIABLES, provide each of the following: (12 pts) AGE a. Histogram b. Box-and-Whisker Plot c. Are there any outliers (Yes or No)? Explain using Box-and-Whisker Plot. Yes, every box that are above and below the vertical lines are outliers. d. Normal Probability Plot e. Are the data normal (Yes or No)? Explain using K-S test P < .01 therefore reject h0… we have evidence based upon our sample that the data come from a population that is not normal. f. Mean, Median, Mode, Skewness, Min, Max, Range, Variance, and Standard Deviation 2. For each of the NUMERIC VARIABLES, provide each of the following: (12 pts) GPA a. Histogram b. Box-Whisker-Plot c. Are there any outliers (Yes or No)? Explain using Box-and-Whisker Plot. Yes, every box that are above and below the vertical lines are outliers. d. Normal Probability Plot e. Are the data normal (Yes or No)? Explain using K-S test P < .01 therefore we reject h0… we have evidence based upon our sample that the data come from a population that is not normal. f. Mean, Median, Mode, Skewness, Min, Max, Range, Variance, and Standard Deviation 3. For each of the CATEGORICAL VARIABLES, provide each of the following: (10 pts) a. Bar Graph b. Proportion in Each Group 4. Analyze GPA for Males and Females separately. Describe various ways in which the two groups differ on GPA. (4 pts) The mean GPA for males was 2.98 which is .15 less than female (3.13) The Skewness for males (-.022) was way less than females (-.319) 5. Analyze GPA for ISDS majors and NON-ISDS separately. Describe various ways in which the two groups differ on GPA. (4 pts) NON-ISDS majors mean average GPA is 3.06 which is .15 higher than ISDS students (2.91) but the count of students is off to a big amount. There are way more NON-ISDS students than ISDS students. (2444:161) The Skewness of ISDS majors (.10) are higher than NON-ISDS majors (-.16) 6. Analyze CAS (Computer Anxiety Score) for ISDS majors and NON-ISDS majors separately. Describe various ways in which the two groups differ on CAS. (4 pts) ISDS majors (5.88), on average, scored much higher than NON-ISDS majors (2.41) The skewness of the two were very different. ISDS majors (-1.33) and NON-ISDS majors (-.43) NON-ISDS majors (3.44) had a higher standard deviation than ISDS students (2.58) The variance had a big difference as well (11.86 : 6.65) 7. How is GENDER related to whether or not a student majors in ISDS or not? Use Data Visualization to answer this question. (4 pts) NON-ISDS male majors slightly outnumber females who are also NON-ISDS majors ISDS male majors significantly outnumber ISDS female students approx. 3:1