Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Introduction to Statistics Sample Exam Chapters 1 - 4 1. To estimate the percentage of Regis College students who use illegal drugs, a uniformed police officer approached students outside the student center, asked their names, and then asked whether they had used illegal drugs in the last month. She continued asking students until she had 40 responses. (a) What is the population for this survey? (b) What is the sample for this survey? (c) What factors make this survey subject to bias? (d) Describe another way of collecting the data, and mention at least one advantage and at least one disadvantage of your method. 2. Fairfield College is considering a new course evaluation form for all the classes taught on campus. One concern is the amount of time necessary to fill out the form. A random sample of 30 students were asked to fill out the form for the class they had at 10:00 a.m. The times in minutes to fill out the form were as follows: 18 19 8 12 18 20 15 17 19 23 22 30 21 16 44 17 15 12 18 26 51 23 35 31 42 5 2 3 9 12 Compute the five number summary and make a box and whisker plot for this data. Be sure to indicate the scale. Describe the shape of the display; is it skewed one way or the other? 3. The Richter scale is used to measure the magnitude of earthquakes. A sample of 15 earthquakes occurring in the United States had Richter scale ratings of 6.0 2.0 2.1 3.1 1.3 4.6 5.1 1.2 3.4 1.8 2.1 2.7 1.4 3.5 1.5 Compute the five number summary and the interquartile range. Make a box and whisker plot. Is this data skewed right or left? 4. We wish to make a histogram representing the following data. 63 20 34 21 31 24 13 46 54 51 12 10 34 18 15 21 27 22 14 35 18 15 4 42 If we wish the histogram to have 6 bars, what should the class width be? 5. The frequency table for the data in problem 4 above is shown below. Fill in the blank entries. Class Limits Class Boundaries Midpoint 8.5 Freq. 4 Rel. Freq. 16.7% 9 37.5% 3 Cumulative Freq 4 16 4 16.7% 20 6. The histogram below shows the frequencies of the number of houses sold in a month by a particular realty company. Use the information from the histogram to compute the sample mean and sample standard deviation. Organize your work in a table showing all your calculations. 7. The Richter scale is used to measure the magnitude of earthquakes. A sample of 6 earthquakes occurring in the United States had Richter scale ratings of 6.0 2.1 1.3 5.1 3.4 2.1 Compute the sample mean and standard deviation for this data. 8. A population of golfish in a large fish tank was studied and the mean length was found to be 6 centimeters with a standard deviation of 1.1 cm. A separate study looked at the lengths of sharks found in fishermen’s nets. This second study found that the mean length of the sharks measured was 8 feet with a standard deviation of 2.1 feet. Compute the coefficient of variation for the length of goldfish and for the length of sharks. Which data set is more spread out? A sample of a trout population from a lake was measured and the mean length was found to be 8 inches with a standard deviation of 1.5 inches. Is the distribution of lengths of trout more like the distribution of lengths of goldfish or the distribution of lengths of sharks? Explain. 9. (a) Find a Chebyshev interval which will contain at least 75% of the lengths of trout in the previous problem. (b) Find a Chebyshev interval which will contain at least 88.9% of the lengths of sharks in the previous problem. (c) Find a Chebyshev interval which will contain at least 93.8% of the lengths of goldfish in the previous problem. 10. Is the data represented by the following box and whisker plot skewed left or right? 10 15 20 25 30 35 40 45 (True or False) More than half of the data is larger than the median. (True or False) Exactly half of the data is inside the box. (True or False) The left whisker represents the first quartile. (True or False) There is more data in the right whisker than in the left whisker. 11. We’ve got 20 cookies in a jar. There are 14 chocolate chip and 6 oatmeal raisin. If you draw one cookies at random from the jar, what is the probability that it is oatmeal raisin? 12. In another cookie jar we have 6 chocolate chip, 4 oatmeal raisin, and 3 sugar cookies. If you take one cookie at random out of this jar, what is the probability that it is either oatmeal raisin or a sugar cookie? Are these mutually exclusive events? 13. In yet another cookie jar we have 10 chocolate chip cookies, 3 of which are burnt, 12 oatmeal raisin, 5 or which are burnt, and 9 sugar cookies, 4 of which are burnt. If you draw one cookie at random from this jar, what is the probability that it is either a sugar cookie or a burnt cookie (of any kind). Are these events mutually exclusive? 14. If we draw one cookie from the jar in problem 11, and one cookie from the jar in problem 12, what is the probability that they are both chocolate chip? 15. The results of an extensive medical study to determine the efficacy of a new medical test for a common medical condition are summarized in the following table. Test Result + Test Result Column Total Condition Present 63 252 315 Condition Absent 96 564 660 Row Total 159 816 975 Find (a) P (A), (b) P (A, given +), (c) P (+), (d) P (P and +), (e) P (+ given P ) (f) P (+ given A) Mean and Standard Deviations: X Sample Data x= x n sX s= sX X Grouped Data P n= f x= xf n s= (x − x)2 n−1 (x − x)2 f n−1 Chebyshev Intervals: at least 75% µ − 2σ to µ + 2σ at least 88.9% µ − 3σ to µ + 3σ at least 93.8% µ − 4σ to µ + 4σ