Download Introduction to Statistics Sample Exam Chapters 1

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Time series wikipedia , lookup

Student's t-test wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Introduction to Statistics
Sample Exam Chapters 1 - 4
1. To estimate the percentage of Regis College students who use illegal drugs, a uniformed police officer
approached students outside the student center, asked their names, and then asked whether they had
used illegal drugs in the last month. She continued asking students until she had 40 responses.
(a) What is the population for this survey?
(b) What is the sample for this survey?
(c) What factors make this survey subject to bias?
(d) Describe another way of collecting the data, and mention at least one advantage and at least one
disadvantage of your method.
2. Fairfield College is considering a new course evaluation form for all the classes taught on campus. One
concern is the amount of time necessary to fill out the form. A random sample of 30 students were
asked to fill out the form for the class they had at 10:00 a.m. The times in minutes to fill out the form
were as follows:
18 19 8 12 18 20 15 17 19 23
22 30 21 16 44 17 15 12 18 26
51 23 35 31 42 5
2
3
9 12
Compute the five number summary and make a box and whisker plot for this data. Be sure to indicate
the scale.
Describe the shape of the display; is it skewed one way or the other?
3. The Richter scale is used to measure the magnitude of earthquakes. A sample of 15 earthquakes
occurring in the United States had Richter scale ratings of
6.0
2.0
2.1
3.1
1.3
4.6
5.1
1.2
3.4
1.8
2.1
2.7
1.4
3.5
1.5
Compute the five number summary and the interquartile range. Make a box and whisker plot. Is this
data skewed right or left?
4. We wish to make a histogram representing the following data.
63
20
34
21
31
24
13
46
54
51
12
10
34
18
15
21
27
22
14
35
18
15
4
42
If we wish the histogram to have 6 bars, what should the class width be?
5. The frequency table for the data in problem 4 above is shown below. Fill in the blank entries.
Class Limits
Class Boundaries
Midpoint
8.5
Freq.
4
Rel. Freq.
16.7%
9
37.5%
3
Cumulative Freq
4
16
4
16.7%
20
6. The histogram below shows the frequencies of the number of houses sold in a month by a particular
realty company.
Use the information from the histogram to compute the sample mean and sample standard deviation.
Organize your work in a table showing all your calculations.
7. The Richter scale is used to measure the magnitude of earthquakes. A sample of 6 earthquakes occurring
in the United States had Richter scale ratings of
6.0
2.1
1.3
5.1
3.4
2.1
Compute the sample mean and standard deviation for this data.
8. A population of golfish in a large fish tank was studied and the mean length was found to be 6
centimeters with a standard deviation of 1.1 cm. A separate study looked at the lengths of sharks
found in fishermen’s nets. This second study found that the mean length of the sharks measured was
8 feet with a standard deviation of 2.1 feet.
Compute the coefficient of variation for the length of goldfish and for the length of sharks.
Which data set is more spread out?
A sample of a trout population from a lake was measured and the mean length was found to be 8
inches with a standard deviation of 1.5 inches. Is the distribution of lengths of trout more like the
distribution of lengths of goldfish or the distribution of lengths of sharks? Explain.
9. (a) Find a Chebyshev interval which will contain at least 75% of the lengths of trout in the previous
problem.
(b) Find a Chebyshev interval which will contain at least 88.9% of the lengths of sharks in the previous
problem.
(c) Find a Chebyshev interval which will contain at least 93.8% of the lengths of goldfish in the
previous problem.
10. Is the data represented by the following box and whisker plot skewed left or right?
10
15
20
25
30
35
40
45
(True or False) More than half of the data is larger than the median.
(True or False) Exactly half of the data is inside the box.
(True or False) The left whisker represents the first quartile.
(True or False) There is more data in the right whisker than in the left whisker.
11. We’ve got 20 cookies in a jar. There are 14 chocolate chip and 6 oatmeal raisin. If you draw one
cookies at random from the jar, what is the probability that it is oatmeal raisin?
12. In another cookie jar we have 6 chocolate chip, 4 oatmeal raisin, and 3 sugar cookies. If you take one
cookie at random out of this jar, what is the probability that it is either oatmeal raisin or a sugar
cookie? Are these mutually exclusive events?
13. In yet another cookie jar we have
10 chocolate chip cookies, 3 of which are burnt,
12 oatmeal raisin, 5 or which are burnt, and
9 sugar cookies, 4 of which are burnt.
If you draw one cookie at random from this jar, what is the probability that it is either a sugar cookie
or a burnt cookie (of any kind). Are these events mutually exclusive?
14. If we draw one cookie from the jar in problem 11, and one cookie from the jar in problem 12, what is
the probability that they are both chocolate chip?
15. The results of an extensive medical study to determine the efficacy of a new medical test for a common
medical condition are summarized in the following table.
Test Result +
Test Result Column Total
Condition
Present
63
252
315
Condition
Absent
96
564
660
Row Total
159
816
975
Find
(a) P (A),
(b) P (A, given +),
(c) P (+),
(d) P (P and +),
(e) P (+ given P )
(f) P (+ given A)
Mean and Standard Deviations:
X
Sample Data
x=
x
n
sX
s=
sX
X
Grouped Data
P
n= f
x=
xf
n
s=
(x − x)2
n−1
(x − x)2 f
n−1
Chebyshev Intervals:
at least 75%
µ − 2σ to µ + 2σ
at least 88.9%
µ − 3σ to µ + 3σ
at least 93.8%
µ − 4σ to µ + 4σ