Download MATCHING: For the following problems, write the letter of the most

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Categorical variable wikipedia , lookup

Regression toward the mean wikipedia , lookup

Transcript
MATCHING: For the following problems, write the letter of the most appropriate
statistical analysis technique next to the story. Note: each answer choice may be used
once, more than once, or not at all.
A. Mean and/or
standard deviation
E. Matched pairs
t-test
I. Chi-squared test
B. Simple linear
regression
F. Comparison of
means t-test
J. One-way
ANOVA
C. Multiple
linear regression
G. 1-sample
proportion Z-test
K. Two-way
ANOVA
D. 1-sample mean
t-test
H. 2-sample
proportion Z-test
L. Five number
summary
_____ 1.
Is college major associated with the region of the country a student is from
in the population?
_____ 2.
A group of taste testers is asked to rate (in random order) both Coke and
Pepsi on a scale of 1-10. Is Coke better tasting than Pepsi on average in
the population?
_____ 3.
What is the average taste test rating for Coke on a scale of 1-10 from a
sample of 30 taste testers?
_____ 4.
Do major and region of the country a student is from affect the population
mean GPA?
_____ 5.
Which of the following are important for predicting a student’s graduation
GPA in the population: the number of semesters a student spends in
college, their high school GPA, and their SAT scores?
_____ 6.
Is the population percentage of people in Tippecanoe county who are
Purdue students significantly higher in 2007 than it was in 1994 if a
random sample of residents was taken both years?
_____ 7.
Are height and weight independent in the population?
_____ 8.
Are freshman taller than seniors on average in the population?
_____ 9.
Are population average GPAs the same for science and engineering
majors?
_____ 10.
Are population average GPAs the same for science, engineering, nursing,
and communications majors?
_____ 11.
Is the population average GPA 3.0?
_____ 12.
What is the middle GPA for this sample of students?
1
MATCHING: (3 points each) For problems 1-11, write the letter of the most
appropriate statistical analysis technique next to the story.
Note: each answer choice may be used once, more than once, or not at all.
_____ 1.
Does knowing a college student’s SAT score tell us anything
about his or her first year college GPA in the population?
a.
Mean and/or
standard deviation
_____ 2.
Do college grade point averages differ for male athletes in
football, swimming, and crew in the population?
b.
Five number
summary
_____ 3.
A college instructor wants to know whether the population
average high school GPA for freshman enrolled in college
algebra is below 3.0.
c.
Simple linear
regression
d.
Multiple linear
regression
e.
1-sample mean ttest
f.
Matched pairs t-test
g.
2-sample
(Comparison of
means) t-test
h.
1-sample proportion
Z-test
i.
2-sample proportion
Z-test
j.
Chi-squared test
k.
One-way ANOVA
l.
Two-way ANOVA
_____ 4.
_____ 5.
Does knowing high school seniors’ IQ scores, SAT scores,
and high school GPA tell us anything about their first year
college GPAs in the population?
Do New Mexican high school seniors have higher population
averages than Vermont high school seniors on the national
achievement exam?
_____ 6.
Is college major related to political party affiliation in the
population?
_____ 7.
Do identical twins differ from each other on average in
reading achievement scores in the population?
_____ 8.
_____ 9.
Is support for a school bond issue (Yes or No) higher in the
University Farms than in Chauncey Village neighborhoods?
Is there a difference in population average national
achievement exam scores for seniors from New Mexico,
Florida, and Vermont?
_____ 10.
What is the average national achievement exam score for
200 New Mexico high school seniors?
_____ 11.
Do college major and gender affect the mean GPA for the
population of Purdue students?
2
A. Mean and/or standard B. Simple linear regression
C. Multiple linear regression
deviation
E. Matched pairs t-test F. Comparison of
G. 1-sample
means t-test
proportion Z-test
I. Chi-squared test
J. One-way ANOVA K. Two-way ANOVA
_____ 1.
vacations?
D. 1-sample mean t-test
H. 2-sample proportion
Z-test
L. Five number summary
What is the average amount of money students in this class spend each year on
_____ 2.
Do more than 75% of college students in the population go someplace other than
to their parents’ house for spring break?
_____ 3.
Does the amount of debt a student has increase, on average in the population, if
you compare before and after spring break credit card statements?
_____ 4.
Are g.p.a. and amount of credit card debt independent in the population?
_____ 5.
Is there a difference in average amount of credit card debt if you compare
freshman, sophomores, juniors and seniors in the population of college students?
_____ 6.
What is the range of the middle 50% of my 50 students’ exam scores?
_____ 7.
Is there a relationship between the number of students enrolled in a class and the
average exam score for that class for all the classes at Purdue?
_____ 8.
Do Indiana-born students score higher on average in the population on this test
than students born outside the state?
_____ 9.
Is the population mean of all students taking a national exam different from the 80
that the authors aimed for?
_____ 10. Are political party preference (Democrat, Republican, and Independent) and ice
cream flavor preference related in the population?
_____ 11.
women?
Is the population percentage of unemployed men lower than for unemployed
_____ 12. Are the type of pet you own and your hair color related to your annual salary in
the population?
_____ 13. Is there a population mean difference in preschoolers’ spatial-temporal reasoning
scores after having piano lessons?
_____ 14.
Do higher I.Q. people also have higher college g.p.a.s in the population?
3
_____ 15. Do annual salary, I.Q., and number of children affect a person’s happiness rating
(on a scale of 1-10) in the population?
_____ 16. Do flavor of M&Ms (plain, peanut, and peanut butter) and color
distribution (red, orange, yellow, green, blue, and brown) make a difference to the
population mean taste ratings (scale of 1-10)?
_____ 17. Is there a difference in the population average number of chips in a regular
Chips Ahoy cookie and a reduced-fat cookie?
_____ 18. Is the type of music playing over the speakers in a store associated with
the type of wine a customer buys in the population?
_____ 19. What is the average number of calories in a sample of 20 “fun packs” of
Halloweeen candy?
_____ 20. Does the season (fall, winter, spring, summer) make a difference to
population average pollen counts?
_____ 21. Does the season (fall, winter, spring, summer) and type of trees growing
nearby make a difference to population average pollen counts?
_____ 22. Do the temperature (in degrees), number of trees growing nearby, and %
humidity affect pollen counts in the population?
_____ 23. Do pollen counts decrease on average between noon and midnight on
particular days if a sample of 30 days is used to estimate what is true for the population?
_____ 24.
allergies?
Do at least 30% of the population of Lafayette residents have pollen
_____ 25.
Are gender and type of allergy (pet, dust, pollen) related in the population?
_____ 26. Does increasing the number of ounces of water a person drinks decrease a
person’s calorie consumption on a daily basis in the population?
_____ 27. Does the type of class activity (lab, homework, lecture) used to teach a
concept make a difference to the population average quiz score on that activity?
_____ 28.
What is the minimum score on this exam?
_____ 29. Is the population percentage of students passing this exam higher for
students who did their practice problems than for those who didn’t?
_____ 30. Is the population average number of ounces of water a college student
drinks daily less than the 64 ounces recommended by health experts?
4
So how do you know which is which? Start by deciding whether your variable(s) is/are
categorical or quantitative. The first 2 are statistics, the rest are inference techniques.
Mean and/or standard deviation: just looking for a summary statistic, variable is
quantitative
Five-number summary: Min, Q1, M, Q3, Max, summary statistics for a quantitative
variable
Simple linear regression: using x to predict y, are x and y independent?, x and y are
correlated, as x increases what happens to y, x and y are both quantitative variables
Multiple linear regression: similar to simple linear regression except there will be more
than one x, all variables will be quantitative
1-sample mean t-test: comparing a single mean to a number, each unit is asked a
numerical question (“how many times a week do you ride the bus?”), don’t know the
population standard deviation, quantitative variable
Matched pairs t-test: before and after, left and right, all units are tested twice (a unit
could be a pair of subjects or pair of units) and measurements are compared to each other,
average difference, quantitative variable
Comparison of means t-test: 2 distinct populations with samples chosen independently,
each unit is measured only once, one quantitative variable (your measurement) and one
categorical variable (how you distinguish between your groups, like gender or eye color)
1-sample proportion Z-test: each unit is asked a yes/no question (“did you ride the bus
today?”), proportion comes from taking the # of successes / # of trials, comparing a
proportion or percentage to a number between 0 and 1
2-sample proportion Z-test: similar to 1-sample proportion Z-test except there are 2
distinct populations with samples selected independently so that the proportions can be
compared to each other
Chi-squared test: 2 categorical variables, testing whether there is an association
between the variables
One-way ANOVA: one categorical variable (how you distinguish between your groups)
and one quantitative variable (your measurement), similar to comparison of means t-test
except you can have more than 2 groups to compare
Two-way ANOVA: two categorical variables (how you distinguish between your
groups) and one quantitative variable (your measurement), similar to doing one-way
ANOVA twice except you can also look for the interaction between the two categorical
variables
5