Download Answers to Confidence Interval

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Taylor's law wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Opinion poll wikipedia , lookup

Student's t-test wikipedia , lookup

German tank problem wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Answers to “Confidence Interval Learning Task” or as we see it “The Trouble With
Confidence Intervals:
1. Review
a) We can use Z-scores and the normal curve to approximate that, by typing
normalcdf((700-550)/50, 99999999) = .0013499672. And, although I would accept
.1350% on a test, I plan on storing that actual probability into my calculator as P for the
rest of this task.
b) This means 5 or 6 or 7 or 8 or…  C(10, 5)p5(1-p)5 + C(10, 6)p6(1-p)4 + C(10,
7)p7(1-p)3 + C(10, 8)p8(1-p)2 + C(10, 9)p9(1-p)1 + C(10, 10)p10(1-p)0 =
sum(seq((10nCrR)P^R*(1-P)^(10-R), R, 5, 10) = 1.125 or by using 1 – binomcdf(10, P, 4).
2. The Empirical Rule
a) The distribution of the sample mean scores is approximately normal.
b) The mean of the sample mean scores is 500 and the sample standard deviation is
50/10 = 5.
c) 68% of the sample means should fall between 545 and 555.
95% of the sample means should fall between 540 and 560.
99.7% of the sample means should fall between 535 and 565.
The mean score of the sample will fall within 2 standard deviations or 10 points from
the mean.
3. Confidence Intervals
a) The middle 95% of our data should lie within 1.96 standard deviations from the
mean.
b) The middle 90% of our data should lie within 1.645 standard deviations of the
mean.
c)
4.
Confidence Level
z*
90%
1.645
95%
1.96
99%
2.576
President’s Approval Ratings: Part 1
a) Between 43% and 49% of US citizens approved of Obama yesterday.
b) Margin of error = z*∙standard error, so for a 95% confidence interval and a
3% margin of error, .03 = SE ∙ 1.96  the Standard Error is .015306.
c) Using the same logic, .03 = SE ∙ 2.576  the Standard Error is .011646 for a
99% confidence level.
d) The standard deviation for sample proportions is
.
The standard error is an approximation for the SD and is equal to
e) Knowing from part b that the Standard Error for a 95% confidence interval is
.015306, and using the formula from part d, we get
.46(.54)
 .015306 . Solving
n
this equation, we get n = 1060. For a 99% confidence level, we get:
.46(.54)
 .011646 , and that means n = 1831. **Note: We got different answers in
n
class, and it’s because we used .0153 and .0116 for our standard errors. In
that case, we got 1062 or 1068 and 1846 or 1858 – depending on how we
rounded the standard error. It’s important to show your work, so I can tell
where your numbers are coming from!
In actuality 1067 people were polled, so the confidence level can be found using:
z*
.46(.54)
 .03 . This means z* = 1.9661999 and the confidence level can be found
1067
using normalcdf(-1.966199, 1.966199) = 95.07%, which still rounds to a 95%
confidence level.
f) If we maximize
by looking at the graph of y = x(1-x), we see that the
expression has a maximum value when p = .5, so p* = 0.5 .
If
, then n ≥ 1067, so more than 1067 people should have been
surveyed. ***Note that
for the margin of error last year! Solving
which is what you might have used
would give us n ≥ 1111.
g) Adding 500 people increases the sample size needed for a 95% confidence
interval to 1560, 1562 or 1586 (depending on the class). The new margin of error
is 1.96
.5(.5)
 .02481  2.48% - or 2.46%
1560
Adding 500 people increases the sample size needed for a 99% confidence interval
to 2331, 2346 or 2358. The new margin of error is 2.576
.5(.5)
 .026678  2.67% 2331
or 2.66% or 2.65%. The new margins of error are smaller, by .52% or .33 %
respectively. (You can find these other differences.)
h) Subtracting 500 people from 1060 creates a new margin of error of:
1.96
2.576
.5(.5)
 .041413  4.14% (4.13%, 4.05%) for the 95% confidence interval or
560
.5(.5)
 .0353 3.53% (3.51%, 3.50%) for the 99% confidence interval. These
1331
intervals are 1.14% and .53% larger. (Do the math!)
i) Sample sizes of 200 yield margins of error of
and
for the 95%
for the 99% confidence intervals.
j) With p* and the confidence level remaining the same, as n increases, the margin
of error decreases, and vice versa. This is true because if we are only changing n,
is just a transformation of y =
.
k) As the confidence level increases, the margin of error increases, too. This
makes sense because as we get more confident, we must have a larger range to pull
our proportion from. (I’m 10% confident that between 70% to 71% of you play a
musical instrument, but I’m 90% confident that between 40% and 90% of you do.)
5. Presidential Approval
a) If the margin of error is to be  5%, then we need to solve 1.96
.5 2
 .05 for n.
n
Doing so tells us that we must choose 385 people or more to construct a 95%
confidence interval with no more than 5% margin of error.
b) Since 34/50 = 68% approve, we assume that our proportion is 68%. So a 90%
confidence interval can be found using: .068  1.645
.68(.32)
 (.5715, .7885),
50
and the margin of error is 10.85%. The 95% confidence level means
.068  1.96
.68(.32)
 (.5507, .8093) and that’s a margin of error of 12.93%.
50
The 99% confidence level means the margin of error is .068  2.576
.68(.32)
=
50
16.99% so the confidence interval is (.5101, .8499).
c) If we could take all the samples of size 50 in our population and create a
histogram of the sample proportions that we found, that distribution would have a
mean close to the actual proportion. Furthermore, 95% of these sample
proportions should fall between 1.96 standard deviations of that true proportion.
Since we have created an interval by going 1.96 standard errors of our sample
proportion, we believe that our interval - from 55% to 81% - will contain the true
proportion – unless our sample is one of the 5% that doesn’t contain the true
proportion to begin with.
d) “Between 55% and 81% of the student body approves of the job that President
Obama is doing based on recent survey of 50 randomly selected high school
students. We can state this with a 95% confidence level.”
1-PropZint probably means “1 proportion interval based on z scores”. x represents the
number of successes, n represents the number of data entries and C-level represents the
confidence level. This worked when x = 34, n = 50 and C-level was .95
e) Press STAT, toggle over to TESTS, then arrow down to A: 1-PropZInt. The
value of x must be 34, n = 50, and the C-Level = .95. We get (.5507, .8093), which
is exactly what we got earlier…
f) Press STAT, toggle over to TESTS, then arrow down to A: 1-PropZInt. The
value of x must be 58% of 350, so type in .58*350 and 203 will appear. Let n = 350
and change C-level to .90. Press Calculate and you should get (.53661, .62339).
(In order to find the margin of error, just subtract the lower bound from the
upper bound to get the width of the interval. Divide that number by 2. For the
.78257  .57743
 .10257 , so our margin of
interval given, (.57743, .78257), find
2
error is 10.23%).
g) The 88% confidence interval is (.57743, .78257). The margin of error is half of
the difference between these two numbers = .10257. This means another way to
state this confidence interval is to say it is “68% ± 10.257”
h) randBin would randomly return 0 or 1, but on average, a 1 would be returned
55% of the time. This best represents a random sample of our population. The sum
represents the total of students who approve out of a random sample of size 50.
i) My interval was (.38152, .65848), and, yes, 55% lies in this interval. Considering
my proportion was 26/50 = 55%, of course I expected it. But this should happen
95% of the time, no matter what.
j) (.34152, .61848), (.57298, .82702) (.48546, .75454) (.5287, .7913)
k) This must be done in class, but I suspect that about 95% of the intervals will
contain 55%. Make a listing of all your intervals on the chalkboard to see what
proportion really does.
l) We could reduce the margin of error by gathering more than 50 students’
opinions or else accepting a lower level of confidence.
5. Racing Hearts
a) The estimate of the population mean is the same as the mean of the samples.
And the standard deviation of the means is the sample standard deviation divided
by
n . i.e.
 x   and  x 

n
.
b) The confidence interval should be   z *

n
.
c) X is the estimate for . So the approximate confidence interval is x  z *
s
n
.
d) The T distributions approximate the normal distribution as n increases, but at
small values of n, the T distribution is wider, meaning that a 95% confidence
interval would be wider, too.
e) We will find the resting heart rate of a typical high school student, so your
data ready to share in class.
f) Try this using 78, 72, 66, 70, and 62. You should get (62.068, 77.132), with a
mean of 69.6 and a standard deviation of 6.066. (Compare this to a Zinterval.)
g) Use {72, 56, 60, 64, 68} as your other 5 pieces of data.
h) Use {72, 68, 68, 58, 60, 60, 64, 72, 68, 76, 56, 72, 56, 60, 64, 68, 64, 66, 70,
54} as your other 20.
i)
We should use a Z* score instead of a t* score, because t* scores change
depending on the sample size. Z* scores don’t depend on a sample size.
4.9
Solve 2.576 
 3 and get n  18 students.
n
j) The interval is (57.717, 59.483). I am 99% confident that the average resting
heart rate of a high school student is between 57.7 and 59.5.
6. Synthesis of Confidence Intervals