Download Spring 2016 Unit 3 Review Pink SOLUTIONS

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Time series wikipedia , lookup

Transcript
Math 075
Exam 1 Review
Module 2
NAME:_____________________________________
Directions: Do not use minitab. A calculator is allowed.
FORMULAS:
βˆ‘(π‘₯βˆ’π‘₯Μ… )2
SD = √
π‘›βˆ’1
IQR = 𝑄3 βˆ’ 𝑄1
ADM =
βˆ‘|π‘₯βˆ’π‘₯Μ… |
𝑛
1) The body temperature of students is taken each time a student goes to the nurse’s office. The
five-number summary for the temperatures (in degrees Fahrenheit) of students on a particular
day is:
a) Would you expect the mean temperature of all students who visited the nurse’s office to be
higher or lower than the median? Explain.
Look at the distance from the median to the extremes (max/min). If the distance to one
extreme is MUCH bigger than to the other extreme than the distribution is skewed. If the
two distances are close in length, then the distribution is symmetric.
Median – Min = 98.25 - 96.6 = 1.65
Max – Median = 101.6 - 98.25 = 3.35
The distance from the Median to the Max is twice as big as the distance from the Median
to the Min. So the distribution is skewed to the right. This means that the mean will be
bigger than the median since the mean is pulled towards the tail.
b) After the data were picked up in the afternoon, three more students visited the nurse’s
office with temperatures of 96.7°, 98.4°, and 99.2°. Were any of these students outliers?
Explain using the fences at 𝑄1 βˆ’ 1.5 × πΌπ‘„π‘… and 𝑄3 + 1.5 × πΌπ‘„π‘….
Higher fence is at 𝑄3 + 1.5 × πΌπ‘„π‘… = 98.6+1.5*(98.6-97.85) = 99.725
Lower fence is at 𝑄1 βˆ’ 1.5 × πΌπ‘„π‘… = 97.85-1.5*(98.6-97.85) = 96.725
The student with the 96.7° temperature is an outliers because it goes past the fence.
1
2) The boxplots show the age of people s Box Plot involved in accidents according to their
role n
the accident.
a) Which role involved the youngest person,
and what is the age?
Passenger, 0
b) Which role involved the person with the
lowest median age, and what is the age?
Passenger, 23
c) Which role involved the smallest range of
age, and what is it?
Cyclist, 52-10 = 42
d) Which role involved the largest IQR of age,
and what is it?
Pedestrian, 64-20 = 44
e) Which role has the most symmetric distribution? Explain.
Pedestrian, because the distance from the median to the extremes are about the same.
f) Which role has the most skewed distribution? Explain.
Passenger, because the distance from the median to the max is so much bigger than the
distance from the median to the min.
3) All students in the physical education class completed a basketball free-throw shooting event
and the highest number of shots made was 32. The next day, the PE teacher realized that he
had made a mistake. The student had actually made 35 shots. Indicate whether changing the
student’s score made each of these summary statistics increase, decrease, or stay about the
same:
a) Mean - Increase
d) IQR – Remain the same
b) Median – Remain the same
c) range - Increase
2
4) The mean number of hours worked for the 30 males was 6, and for the 20 females was 9. What
is the overall mean number of hours worked?
π‘₯Μ… =
30 βˆ— 6 + 20 βˆ— 9
= 7.2
50
5) Create a boxplot for this set of data using the scale below. Remember the fences are at 𝑄1 βˆ’
1.5 × πΌπ‘„π‘… and 𝑄3 1.5 × πΌπ‘„π‘….
6) We collect these data from 50 male students. Which variable is categorical and which is
quantitative?
A) eye color - C
B) head circumference - Q
C) marital status - C
D) number of cigarettes smoked daily - Q
E) number of TV sets at home - Q
7) Why might you choose to display data with a dotplot rather than a boxplot?
A dotplot will tell you approximately what the values are whereas a boxplot only give you the
five number summary.
3
8) The 1999 Consumer Reports new Car Buying Guide reported the number of seconds required for
a variety of cars to accelerate form 0 to 30 mph. The cars were also classified into six
categories by type. The following boxplots display the distributions of acceleration times for
each type of car. (Note: the asterisks on the boxplot for the small type of cars, these denote
outliers.)
a) If we compare a typical car in each category, which type accelerates the fastest? What
part(s) of the boxplots did you compare to make your choice?
The sports car accelerates the fastest. I used the median to make my choice.
b) If we compare the range of acceleration times for each car type, which type performs
the most consistently? What part of the boxplots did you compare to make your choice?
The large car has the most consistent acceleration time. I looked at the IQR (distance
from Q1 to Q3 or the length of the entire box). I also looked at the range (distance
from min to max or the length of the entire graph.) I noticed that the small car has the
smallest IQR, but the large car has the smallest range. I decided to pick the large car
as the most consistent, because it had the smallest range and no unpredictable outliers.
c) Now, lets only focus on the Small cars. If the outliers were removed from the dataset
of Small cars, which of the following measures of spread would be least affected?
Overall range, interquartile range (the distance between the 1st and 3rd quartile marks),
or standard deviation.
The IQR would be the least affected because the middle 50% values would remain about
the same after the outlier is removed. The SD and range would definitely decrease.
4
C) is 7.5
be determined.
a stemplot rather than a boxplot because a
stribution.9) Which is true of the data whose distribution is shown? I only
.
III only
D) I and III
E) I, II, and III
bution is shown?
to the right.
he median.
I. The distribution is skewed to the right.
h mean and standard
I and II
II. The mean is smaller than the median.
III. We should summarize with mean and standard deviation.
D) II and III
E) I, II, and III
10) The IQR of the data displayed in this dotplot is most likely to be ...
played in
) 20.
40
A) 5
60
80
B) 12
100
C) 1
es the acidity (pH) of rainwater, and that water
re of 1.8. This means that the acidity of that rain
D) 65
11) A boxplot is a graphical summary of the data set. I cannot tell by looking at the boxplot how
many data points are in the data set or how the data is distributed within each quartile. To
illustrate this important idea, make up a two different sets of data to match this boxplot. Put
10 numbers in one of the data sets and 13 numbers in the other data set.
n of 1.
ge rainfall.
erage rainwater.
ns higher than that of average rainwater.
Set 1: 0, 1 or 2, 2, 2 or 3, 2 or 3, 3 or 4, 4 or 5, 5, 5 or 6, 6
The numbers underlined must have an average of 3.
Set 2: 0, 0 or 1 or 2, 0 or 1 or 2, 2, 2 or 3, 2 or 3, 3, 3 or 4 or 5, 3 or 4 or 5, 5, 5 or 6, 5 or
6, 6
Note: I may ask for a data set with a large SD or ADM. Keep in mind that if you want a
large spread try to keep the numbers as far away from the mean as possible and if you want
a small spread try to keep the numbers as close to the mean as possible. In this type of
problem I would have to mention what the mean is.
5
12) A class of fourth graders takes a diagnostic reading test, and scores are reported by reading
grade level. The 5 number summaries for the boys and girls are shown below.
Boys: 2.8 4.1 4.8 5.5 5.6
Girls: 2.1 4.5 4.9 5.6 5.8
e) Which group has the highest score?
Circle one: Boys /
Girls
Circle one: Boys /
Girls
Circle one: Boys /
Girls
Girls with a max of 5.8
f) Which group has the greatest range?
Girls with a range of 5.8-2.1 = 3.7
g) Which group has the highest IQR?
Boys with an IQR = 5.5-4.1 = 1.4
h) Which group’s scores appear to be more skewed? Explain.
Girls:
Median – Min = 4.9 – 2.1 = 2.8
Max – Median = 5.8 – 4.9 = .9
Boys:
Median – Min = 4.8 – 2.8 = 2
Max – Median = 5.6 – 4.8 = .8
The girls score is more skewed because the distance from the Median to the Min (2.8) is
three times bigger than the distance from the Max to the Median (.9). The boys distances
(2 and .8) are closer to each other than the girls distances (.9 and 2.8).
i) Which group generally did better on the test? Explain.
Girls did better because they have a higher Q1, median, Q3 and Max. They had a low min, but the
distribution was skewed to the left meaning that there weren’t very many low scores.
6
13) Consider the following data values. Find the following for each of the data sets.
Set A
1
2
4
5
5
6
6
6
7
8
Set B
0
1
1
2
2
2
3
4
5
6
9
7
8
Set A:
e) Min: 1
Set B:
a) Min : 0
f) Q1: 4
b) Q1: 2
g) Median: 6
c) Median: 3.5
h) Q3: 7
d) Q3: 7
i) Max: 9
e) Max: 9
j) IQR: 3
f) IQR: 5
k) Range: 8
g) Range: 9
l) Mean: 5.36
h) Mean: 4.14
m) ADM: 1.79
i) ADM: 2.59
n) Standard Deviation: 2.38
j) Standard Deviation: 3.01
7
8
9