Download Solutions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Foundations of statistics wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Law of large numbers wikipedia , lookup

Categorical variable wikipedia , lookup

Transcript
SOLUTIONS
Stat 250.3 First Midterm Exam
Wednesday, October 15, 2003
Part I: (100 points) Written Problems: Show ALL work, calculations and formulas used. Partial
credit will be awarded for using the correct procedures.
1. Joe is interested in conducting a study to determine if exposure to the sun (# hours per week in the
sun) is related to skin cancer. [18]
(a) What are the two variables of interest? What type is each of these variables, quantitative,
categorical or ordinal? [8]


Numbers of hours per week in the sun – Quantitative
Having or not a skin cancer - Categorical
(b) Which one is the explanatory variable and which one is the response variable? [6]


Numbers of hours per week in the sun – Explanatory
Having or not a skin cancer - Response
(c) Ethically, what type of study would I have to do to determine if there is a relationship
between the two variables? [4]
Observational study
2. Suppose that the probability of having a girl is .45. A couple has decided to have 5 children. Let
X = the number of girls out of the five children. Below are a partial PDF and CDF for X. [22]
(a) The notation P(X ≤ x) refers to which of the distribution functions? [4]
CDF
(b) Complete the table below. [12]
K
0
1
2
3
4
5
P(X=k)
0.050
0.206
0.337
0.276
0.113
0.018
P(X≤k)
0.050
0.256
0.593
0.869
0.982
1.000
(c) What is the probability that the couple will have an odd number of girls? [6]
0.206+0.276+0.018 = 0.5
3. A basketball player has 60% probability of success on a free throw. Suppose that in a game he
attempted 12 free throws, and let X be the number of free throws he made. We can assume that
the result in each of the free throws is independent from the others. [30]
(a) What is the distribution of X? (Include the parameter values necessary to fully describe its
distribution!) [10]
X is binomial random variable n = 12 and p = 0.6.
(c) Find the probability that he made at least five free throws. [10]
P( X  5)  1  P( X  4)  1  0.0573  0.9427
(c) Find the mean and the variance of X. [10]
μ = E(X) = np = 12( .6) = 7.2
σ2 = Var(X) = np(1-p) = 12(.6)(.4) = 2.88
4. Suppose the number of calories PSU students consume in a day is normally distributed with mean
2000 and standard deviation 300. [30]
a) What is the probability that a randomly selected individual consumed between 1800 and
2100 calories yesterday? [10]
P(1800< X < 2100) = P( -200/300 < Z < 100/300) = P(-0.67 < Z < 0.33)
= P (Z < 0.33) – P(Z<-0.67) = 0.6293 – 0.2514 = 0.3779
b) About 95% of PSU students have a daily caloric intake between what two values? [10]
μ ±2σ = 2000 ± 600, i.e from 1400 to 2600 calories.
c) What is the 90th percentile for the amount of calories consumed in a day by a PSU
student? (i.e. what is the amount of calories, that 90% of the PSU students consume less
calories than that in a day?) [10]
P (Z < z)=.90 ==> z = 12.8, thus x = 300 (1.28) + 2000 = 2384
-2-
Midterm Exam Part II: (100 points) Multiple Choice Problems
SOLUTIONS: A-B-C-A-A B-C-D-B-C D-A-A-D-C B-C-C-A-B
1. Which of these is a quantitative variable?
A. The length of your right foot.
B. The month you were born.
C. The political party you belong to.
D. The newspaper you prefer to read.
2. Researchers were interested in the relationship between whether students attended a public or private high
school and involvement in activities at college. Fifty public school students and 50 private school students
were randomly chosen to participate in the study. Which of the following statements is true?
A. Type of school and involvement in activities are both response variables.
B. Type of school is an explanatory variable and involvement in activities is a response variable.
C. Type of school is a response variable and involvement in activities is an explanatory variable.
D. There is not enough information given about the study to classify the variables.
3. Which of the following can be used to summarize the two random variables in question 2?
A. Stem and leaf plot
B. Dot plot
C. Two-way table
D. Pie Chart
__________________________________________________________________________
Questions 4 and 5: A sample of students was asked if they believe that it is right or wrong to have sex before
marriage. Below is a table summarizing the responses for males and females in the sample.
Rows: Gender
Columns: SexB4Mar
Right
Wrong
All
122
43
165
96
20
116
63
Count
281
female
male
All
218
Cell Contents –
4. What percent of males in the sample believe it is wrong to have sex before marriage?
A. 0.17
B. 0.21
C. 0.07
D. 0.83
5. What percent of all those surveyed are females who believe it is wrong to have sex before marriage?
A. 0.15
B. 0.68
C. 0.26
D. 0.59
6. The shape of the data below is described as __________________.
B. left-skewed C. right-skewed
D. normal
10
Frequency
A. bell-shaped
5
0
50
55
60
65
70
75
80
85
90
scores
-3-
95
100
________________________________________________________________________________
Questions 7 and 8: The following are the responses of 11 people when asked how much change (in cents)
they had in their wallet.
12
14
15
16
22
26
46
54
61
64
85
7. What is the median for this data?
A. 22
B. 36
C. 26
D. 24
8. Using the data in the previous question, what is the IQR?
A. 6
B. 35
C.73
D. 46
_________________________________________________________________________________
9. A data value that is inconsistent with the bulk of the data is a(n):
A. minimum
B. outlier
C. maximum
D. placebo effect
10. The third quartile is:
A. 75
B. the data value that is .75 standard deviations from the mean
C. the data value below which 75% of the values fall
D. the data value below which 25% of the values fall
11. A researcher wants to determine the effect of a new drug on growth. He selects 40 adolescents for the
study and randomly divides all subjects into two groups. He will treat one of the groups using the new drug
and the other group placebo. This is an example of a(n):
A. Observational study
B. Matched pair design
C. Latin square design
D. Randomized design
12. In reference to a five-number summary, fifty percent of the data values fall between what two numbers?
A. first quartile and third quartile
B. third quartile and maximum
C. first quartile and maximum
D. minimum and maximum
13. A variable that affects the response variable and is related to the explanatory variable is known as:
A. confounding
B. skewing
C. discrete
D. random
14. Which of the following is a discrete random variable?
A.
B.
C.
D.
The time in seconds that an elephant can hold its breath underwater.
The height of an ant on an elephant in Zimbabwe.
The weight of an adult, ant free, female elephant in Zimbabwe.
The number of ants on an elephant in Zimbabwe.
-4-
15. The payoff (X) for a lottery game has the following probability distribution.
k
P(X=k)
$0
0.8
$5
0.2
What is the expected value of X= payoff?
A. $0
B. $0.50
C. $1.00
D. $2.50
16. A birth is selected at random. Define events A=baby is a boy and B=the mother had the flu during her
pregnancy. The events A and B are
A. Disjoint but not independent.
B. Independent but not disjoint.
C. Disjoint and independent.
D. Neither disjoint nor independent.
17. What is the probability that Z is between 1 and 1, P(1  Z  1)? (Z is a standard normal random variable.)
A. 0.1587
B. 0.3174
C. 0.6826
D. 0.8413
18. The time taken for a computer to boot up, X, follows a normal distribution with mean 30 seconds and standard
deviation 5 seconds. What is the standardized score (z-score) for a boot up of time x =35 seconds?
A. 2.0
B. 0.0
C. 1.0
D. 2.0
_______________________________________________________________________________________
Questions 19 and 20: Suppose that P (  Z   )   , where Z is a standard normal random variable, 
is a positive number and  is a number between 0 and 1. (e.g for P(2  Z  2)  .95,   2 and
  .95 ). Find the following probabilities in terms of  : (it might be helpful to draw the standard normal
curve and draw the desired areas for each question.)
19. P(0  Z   )
A.

2
B.
1 
2
C. 1  
D. 2 
B.
1 
2
C. 1  
D. 2 
20. P ( Z   )
A.

2
-5-