Download Test 2A solution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Seven Countries Study wikipedia , lookup

Transcript
M 140
Test 2
A
Name_____________________
SHOW YOUR WORK FOR FULL CREDIT!
Problem
1-10
11
12
13
14
15
16
17 Extra Credit
Total
Max. Points
10
4
4
15
23
8
6
4
70
Your Points
1
Multiple Choice Questions (1 point each)
1. A new headache remedy was given to a group of 25 subjects who had headaches. Four hours after taking
the new remedy, 20 of the subjects reported that their headaches had disappeared. From this information
you conclude:
a. that the remedy is effective for the treatment of headaches.
b. nothing, because the sample size is too small.
c. nothing, because there is no control group for comparison.
d. that the new treatment is better than aspirin.
e. that the remedy is not effective for the treatment of headaches.
2. There are ten departments at SMART University. To conduct a survey, a researcher selects ten faculty
members randomly from each department. The sample selected in this way is called a:
a. Simple random sample.
b. Systematic sample.
c. Stratified random sample.
d. Voluntary response sample.
e. Multistage sample.
3. Refer to Question 2. If the researcher selects every 10th person from the alphabetical list of faculty
members, then the sample selected this way would be:
a. Simple random sample.
b. Systematic sample.
c. Stratified random sample.
d. Voluntary response sample.
e. Multistage sample.
4. Glamour magazine asked readers to fill out a questionnaire printed in the magazine and mail it in.
The questionnaire asked about childhood sexual abuse. 42.6 % percent of the 176 respondents said they
had been sexually abused as children. This figure is probably biased because,
a. the 42.6% of 176 does not make sense here.
b. the sample was voluntary.
c. the sample size was too small.
d. None of the above, there is no reason to doubt the accuracy of the results.
5. There are two statistics classes. The first has 350 students and the second has 250 students. In the first
class the students are instructed to each toss a coin 20 times and record the value of p$ , the proportion of
heads. The instructor them makes a histogram of the 350 values of p$ obtained. The second class did the
same, except that each student tossed a coin 40 times. The histogram of p$ values for the first class should
be
a. more biased since it is based on a smaller number of tosses.
b. less biased since it is based on a larger number of students.
c. more variable since it is based on a smaller number of tosses.
d. less variable since it is based on a larger number of students.
2
6. A polling agency took a random sample of 1000 likely voters in Florida (population: about 18 million),
and another random sample of 1000 likely voters in New Mexico (population: about 2 million). Other
things being equal, the sample from Florida would tend to be ___________ as the sample from New
Mexico.
a. 9 times as accurate
b. 3 times as accurate
c. as accurate
d. one third as accurate
e. one ninth as accurate
7. Right before a presidential election, the Gallup Poll increases its sample size to 4000 instead of the usual
1200. The reason Gallup does this is,
a. to increase the margin of error.
b. to reduce the margin of error.
c. to get better coverage of special interest groups.
d. to use better trained interviewers.
The next three questions refer to the Harvard Nurses Health Study, which began in 1976. The study
included more than 120,000 women who in successive years were asked to answer questions about overall
health, smoking, diet, use of birth control pills, and estrogen supplements. Epidemiologists reviewed the
medical records of these women every year. Today after more than a decade the research team found that
women on an extremely high-fat diet (nearly 50% of energy intake as fat) are at no more risk of breast cancer
than those who adhere to an extremely lean diet (less than 29% fat). They did find a clear association
between high fat diets and colon cancer, however.
8. What type of study is this?
a. randomized experiment
b. observational study
c. opinion poll
9. What two response variables were mentioned in the description?
a. nurses and medical records
b. high fat diet and low fat diet
c. breast cancer and colon cancer
d. smoking, diet, use of birth control pills, and estrogen supplements
e. epidemiologists and Harvard University Nurses
10. Amount of fat in a person's diet is,
a. a lurking variable
b. a response variable
c. an explanatory variable
d. a placebo
3
11. (4 points) Decide if each of the following is an observational study or a designed experiment.
a. Thirty tuna were randomly assigned to two tanks of water, 15 tuna in each tank. One tank was polluted
with methyl mercury, while the other tank was not polluted. The survival times of the fish in the two
tanks were compared.
i. Observational Study
ii. Designed Experiment
b. A biologist measured the increasing amounts of phosphorus in Percy Priest Lake and recoreded a
decreasing number of lake trout over a 7-year period.
i. Observational Study
ii. Designed Experiment
c. A study compared the heights of adults who regularly smoked as adolescents to the height of adults who
never smoked as adolescents.
i. Observational Study
ii. Designed Experiment
d. Fifty children were randomly assigned to listen to heavy metal music (25 children) or a relaxation tape
(25 children). The IQs of the students were measured and compared afterward.
i. Observational Study
ii. Designed Experiment
12. (4 points) Identify each of the following as a parameter or a statistic.
a. the proportion of a random sample of CSUN students who prefer to hear “good news” before bad
statistic
b. the mean number of states visited by all students at CSUN
parameter
c. the mean number of cats in a random sample of 300 American households
statistic
d. the proportion of American adults who told a Gallup pollster that they prefer to abolish rather than retain the
penny
statistic
4
13. (15 points) A recent research project at Lenox Hill Hospital in New York City provided some
information about the laser procedure surgeons have been using in heart patients for more than a
decade. Using a laser beam, they drill a little hole in the patient’s heart to relieve chest pain. A
group of 298 heart patients volunteered for this experiment. They were sedated during the
procedure but awake. They could hear the doctors discuss the laser process, and each patient
thought he or she was receiving the treatment, although the surgeons actually drilled the hole only
in half of the patients’ hearts.
a. What are the explanatory and response variables?
Explanatory variable: whether or not the surgeons drilled a hole in the patient’s heart
Response variable:
chest pain
b. Explain what the placebo was in this study, and why they included it.
The placebo here was that they pretended that drill a hole into the patient’s heart in the control group.
c. Was this also a double-blind study? Explain your answer.
No, obviously, the surgeons knew when they actually drilled the hole.
298 heart
patients
Random assignment
d. Draw the outline of the study. Make sure you indicate the response variable, and group sizes.
149 heart patients –
treatment group: the
hole was drilled in
their hearts
Compare chest pain
149 heart patients –
control group: the
hole wasn’t drilled
in their hearts
5
e. Explain where at what point in the study the experimenters should use random assignment:
When they divided the 298 patients into the two groups. They should have randomly assign each
patient into one of the groups.
f. The experimenters suspected that gender could be one of the lurking variables. Explain briefly what
they could do to control for this variable.
They could use a block design. First they can separate the patients by gender (blocking), then do the
experiment with the females and the males separately.
14. (23 points) Suppose that in order to study/estimate the proportion of all pennies in the U.S. that are
older than 10 years, each student in a statistics class was asked to bring in 100 randomly collected
pennies and check whether each was older than 10 years or not.
a. Clearly identify in words the population, the sample, the parameter of interest, and the statistic in this
situation.
Population: all pennies in the U.S.
Sample: the 100 pennies the students randomly picked the brought to class
Parameter: the proportion of all pennies in the U.S. that are older than 10 years
Statistic: the proportion of the 100 pennies in each student’s sample that are older than 10 years
b. One of the students, Leslie, counted 39 pennies in her sample that were older than 10 years. Let’s
assume that in reality 53% of all pennies in the U.S. are older than 10 years.
Fill in the blanks:
p$ = _0.39__
p = __0.53__
c. Another student had 57 pennies in his sample that were older than 10 years. This fact that different
random samples from a population will give different results is called
_sampling variability_______.
6
d. If we could create the sampling distribution of the sample proportions for all possible samples of size
100, then according to the Central Limit Theorem what would be the shape, the mean and the
standard deviation of this sampling distribution?
Shape: bell shaped because np = 100(0.53) >10, and n(1-p) = 100(1 – 0.53) >10
Mean: p = 0.53
Standard deviation:
p(1 − p)
=
n
0.53(1 − 0.53)
= 0.0499 ≈ 0.05
100
e. Draw the sampling distribution of the sample proportions. Mark the mean, and three standard
deviations below and above the mean, and label them with values.
0.38
0.43
0.48 0.53 0.58 0.63 0.68
d. Based on the sampling distribution, was Leslie’s sample unusual? Explain.
Yes, it’s unusual. Her sample proportion, 0.39, is more than two standard deviations below the mean
(see read mark).
e. If we would create the sampling distribution of the sample proportions for all possible samples of size
of 1000 instead of 100, how would the mean and standard deviation of this sampling distribution
compare to the sampling distribution of n = 100 you described in part (d). Fill in the blanks with one
of the following: smaller than, greater than, the same as.
The mean of the sampling distribution for n = 1000 would be _the same as______ the mean of the
sampling distribution for n = 100.
The standard deviation of the sampling distribution for n = 1000 would be _smaller than__ the
standard deviation of the sampling distribution for n = 100.
7
15. (8 points) The weights of the contents of a cereal box are normally distributed with a mean weight
of 20 ounces and a standard deviation of 0.37 ounce. (Don’t forget to draw and shade in both
parts!)
a. What percent of cereal boxes weigh more than 21 ounces?
z=
21 − 20
= 2.70
0.37
Probability: 0.0035
21
About 0.35% of cereal boxes weigh more than 21 ounces.
b.
Boxes in the lower 5% do not meet the minimum weight requirements and must be repackaged.
What is the minimum weight requirement for a cereal box?
Using the table, or the calculator,
the z-score corresponding the
lower 5% is -1.64 (or -1.65)
z=
5%
21 − 20
= 2.70
0.37
− 164
. =
x − 20
⇒
0.37
x = 19.39
The minimum weight requirement for a cereal box is 19.39 ounces.
8
16. (6 points) Adam and his girlfriend, Betty, often teased each other about their heights: he called
Betty “Shortie” because she was short , and she called him “Beanstalk” because he was tall.
The heights of men are approximately normally distributed with mean 70.1 in and standard
deviation of 2.7in. The heights of women are approximately normally distributed too with mean of
64.8in and 2.5 in. If Adam is 75.9 in tall, and Betty’s height is 59.7 in, who has “more right” to call
the other one as “Shortie” or “Beanstalk” (i.e. whose height is more extreme)?
The standardized value for Adam’s height is z =
75.9 − 701
.
= 2.15 .
2.7
The standardized value for Betty’s height is z =
59.7 − 64.8
= −2.04 .
2.5
Since Adam’s height is farther from the mean than Betty’s height, she has more right to call him
Beanstalk. Adam’s height is more extreme.
9
17. EXTRA CREDIT: The histograms below show four sampling distributions of statistics
intended to estimate the same parameter. Label each distribution relative to the others as having
large or small bias and as having large or small variability. (Circle the correct choices.)
Bias:
large
Variability: large
Bias:
large
Variability: large
small
small
small
small
Bias:
large
Variability: large
small
small
Bias:
large
Variability: large
small
small
10