Download Chapter 11 - Mr. Casas` Weebly

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Notation - Comparing Two Means
Notation - Comparing Two Means
Chapter 11
Mean
Standard
Value Variance Deviation
Comparing Two Populations
or Treatments
1
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
2
1
σx −x =
1
4
2
2
σ12 σ22
+
n1 n 2
Population or Treatment 1
n1
x1
s12
s1
Population or Treatment 2
µ2
σ22
σ2
Population or Treatment 2
n2
x2
s 22
s2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
5
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Hypothesis Test
Hypothesis Tests Comparing Two Means
against one of the usual alternate
hypotheses using the statistic
7
( x1 − x 2 ) − hypothesized value
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Sampling Distribution for
Comparing Two Means
If n1 and n2 are both large or the population
distributions are (at least approximately)
normal, then the distribution of
z=
8
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
σ12 σ22
+
n1 n 2
6
Example
A sample of 35 cans from machine 1 gave a
mean of 16.031 and a sample of 31 cans from
machine 2 gave a mean of 16.009. State,
perform and interpret an appropriate
hypothesis test using the 0.05 level of
significance.
x1 − x 2 − (µ1 − µ 2 )
Is described (at least approximately) by the
standard normal (z) distribution.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Hypothesis Test
Example
Comparing Two Means
µ1 = mean fill from machine 1
We would like to compare the mean fill of 32
ounce cans of beer from two adjacent filling
machines. Past experience has shown that the
population standard deviations of fills for the
two machines are known to be σ1 = 0.043 and
σ2 = 0.052 respectively.
H0: µ1 - µ2 = hypothesized value
σ12 σ 22
+
n1 n 2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Comparing Two Means
Large size sample techniques allow
us to test the null hypothesis
z=
3
This implies that the sampling distribution of
x1 − x 2 is also normal or approximately
normal.
σ12 σ22
+
n1 n 2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
σ1
If n1 and n2 are both large or the population
distributions are (at least approximately)
normal then x1 and x 2 each have (at least
approximately) a normal distribution.
1. µ x −x = µ x − µ x = µ1 − µ 2
1
σ12
Sampling Distribution for Comparing
Two Means
If the random samples on which x1 and x 2
are based are selected independently of
one another, then
2. σ2x1 − x2 = σ2x1 + σ2x2 =
µ1
2
Sampling Distribution for
Comparing Two Means
Sample
Standard
Size
Mean Variance Deviation
Population or Treatment 1
µ2 = mean fill from machine 2
H0: µ1 - µ2 = 0
Ha: µ1 - µ2
Significance level: α = 0.05
Test statistic:
z=
9
x 1 − x 2 − hypothesized value
σ12 σ 22
+
n1 n 2
=
x1 − x 2 − 0
σ12 σ22
+
n1 n 2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
1
Hypothesis Test
Example
Hypothesis Test
Comparing Two Means
Since n1 and n2 are both large (> 30) we do not
have to make any assumptions about the nature of
the distributions of the fills.
z=
(16.031 − 16.009 )
0.0432 0.0522
+
35
31
The p-value of the test is 0.0628. There
is insufficient evidence to support a claim
that the two machines produce bottles
with different mean fills at a significance
level of 0.05.
= 1.86
Equivalently, we might say.
P-value = 2P(z > 1.86) = 2P(z < -1.86)
With a p-value of 0.063 we have been
unable to show the difference in mean
fills is statistically significant at the 0.05
significance level.
= 2(0.0314) = 0.0628
11
Two-Sample t Test for Comparing
Two Population Means
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
12
df =
x1 − x 2 − (µ1 − µ 2 )
s12 s 22
+
n1 n 2
( V1 + V2 )
Alternate hypothesis and finding the P-value:
1. Ha: µ 1 - µ 2 > hypothesized value
P-value = Area under the
z curve to the right
of the calculated z
2
V12
V2
+ 2
n1 − 1 n 2 − 1
where V1 =
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Two-Sample t Tests for Difference
of Two Means
Two-Sample t Test for Comparing
Two Population Means
If n1 and n2 are both large or if the population
distributions are normal and when the two random
samples are independently selected, the
standardized variable
t=
σ12 σ 22
+
n1 n 2
=
P-value:
Accept this example for what it is, just a sample of
the calculation. Generally this statistic is used
when dealing with “what if” type of scenarios and
we will move on to another technique that is
somewhat more commonly used when σ1 and σ2
are not known.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
X1 − X 2 − 0
Example
Comparing Two Means
Calculation:
This example is a bit of a stretch, since knowing
the population standard deviations (without
knowing the population means) is very unusual.
10
Hypothesis Test
Example
Comparing Two Means
s12
s2
and V2 = 2
n1
n2
2. Ha: µ1 - µ2 < hypothesized value
P-value = Area under the
z curve to the left
of the calculated z
Has approximately a t distribution with
df should be truncated to an integer.
13
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
14
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Hypothesis Test
Two-Sample t Tests for Difference
of Two Means
15
Hypothesis Test
Example
i. 2•(area to the right of z) if z is positive
ii. 2•(area to the left of z) if z is negative
Brand A
517, 495, 503, 491
503, 493, 505, 495
498, 481, 499, 494
H0: µ1 = µ2 (µ1 - µ2 = 0)
Ha: µ1 ≠ µ2 (µ1 - µ2 ≠ 0)
Significance level: α = 0.01
Brand B
493, 508, 513, 521
541, 533, 500, 515
536, 498, 515, 515
Test statistic:
t=
State and perform an appropriate hypothesis test.
16
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
17
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Example
µ1 = the mean amount of acetaminophen in cold
tablet brand A
µ2 = the mean amount of acetaminophen in cold
tablet brand B
In an attempt to determine if two competing
brands of cold medicine contain, on the
average, the same amount of acetaminophen,
twelve different tablets from each of the two
competing brands were randomly selected and
tested for the amount of acetaminophen each
contains. The results (in milligrams) follow. Use
a significance level of 0.01.
3. Ha: µ 1 - µ 2 ≠ hypothesized value
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
18
x1 − x 2 − hypothesized value
σ12 σ 22
+
n1 n 2
=
x1 − x 2 − 0
σ12 σ 22
+
n1 n 2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
2
Hypothesis Test
Example
Hypothesis Test
Example
Hypothesis Test
Example
Assumptions (continued):
Assumptions:The samples were selected
independently and randomly. Since the
samples are not large, we need to be able to
assume that the populations (of amounts of
acetaminophen are both normally distributed.
Calculation:
n1 = 12, x1 = 497.83, s1 = 8.8300
n 2 = 12, x 2 = 515.67, s 2 = 15.144
t=
As we can see from the normality plots and the
boxplots, the assumption that the underlying
distributions are normally distributed appears to be
quite reasonable.
19
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Hypothesis Test
20
Example
Hypothesis Test
Calculation:
s 2 8.83002
V1 = 1 =
= 6.4974
n1
121
V2 =
df =
2
1
2
2
2
V
V
+
n1 − 1 n 2 − 1
=
We truncate the degrees of freedom to give df = 17.
22
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Confidence Intervals
23
df =
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Confidence Intervals
Comparing Two Means
2
V12
V2
+ 2
n1 − 1 n 2 − 1
s12
s2
and V2 = 2
n1
n2
df should be truncated to an integer. The t
critical values are in the table of t critical
values (table III).
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
8.83002 15.144 2
+
12
12
Comparing Two Means
The general formula for a confidence interval
for µ1 – µ2 when
1. The two samples are independently
chosen random samples, and
2. The sample sizes are both large
(generally n1 ≤ 30 and n2 ≤ 30) OR the
population distributions are
approximately normal is
(X
1
− X 2 ) ± (t critical value)
24
s12 s 22
+
n1 n 2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Confidence Intervals
Example
Example
Comparing Two Means
s12 5.62 2
=
= 0.632
n1
50
Construct a 98% confidence interval for the
difference of the population means.
V2 =
s 22 6.312
=
= 0.796
n2
50
Sample
Sample Sample Standard
Size
mean Deviation
50
78.3
5.62
50
87.2
6.31
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
= −3.52
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
V1 =
Thread A
Thread B
26
497.83 − 515.67 − 0
Two kinds of thread are being compared for
strength. Fifty pieces of each type of thread
are tested under similar conditions. The
sample data is given in the following table.
where V1 =
25
=
Confidence Intervals
Comparing Two Means
The t critical value is based on
( V1 + V2 )
Example
Conclusion: Since P-value = 0.002 < 0.01 = α,
H0 is rejected. The data provides strong
evidence that the mean amount of
acetaminophen is not the same for both brands.
Specifically, there is strong evidence that the
average amount per tablet for brand A is less
than that for brand B.
(6.4974 + 19.112) 2
= 17.7
6.49742 19.1122
+
11
11
σ12 σ 22
+
n1 n 2
21
P-value: From the table of tail areas for t curve
(Table IV) we look up a t value of 3.5 with df = 17
to get 0.001. Since this is a two-tailed alternate
hypothesis, P-value = 2(0.001) = 0.002.
s 22 15.1442
=
=19.112
n2
12
( V1 + V2 )
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
x1 − x 2 − 0
df =
( 0.632 + 0.796 )
2
0.632 0.796
+
49
49
2
2
= 96.7
Truncating, we have df = 96.
27
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
3
Confidence Intervals
Confidence Intervals
Example
5.62 2 6.312
+
50
50
−8.9 ± 2.82
87 Octane
26.4, 27.6, 29.7
28.9, 29.3, 28.8
The 98% confidence interval estimate for the
difference of the means of the tensile strengths is
x1 =28.45, s1 = 1.228, x 2 =30.73, s 2 = 1.392
V1 =
90 Octane
30.5, 30.9, 29.2
31.7, 32.8, 29.3
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Confidence Intervals
29
Example
Looking on the table under 95% with 9 degrees of
freedom, the critical value of t is 2.26.
x1 - x 2 ± (t critical value)
= 28.45 − 30.73 ± 2.26
2
1
2
2
s
s
+
n1 n 2
The 95% confidence interval for the true difference of
the mean mileages is (-3.99, -0.57).
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
35
2
= 9.8
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Comments: We had to assume that the samples
were independent and random and that the
underlying populations were normally
distributed since the sample sizes were small.
By looking at the following normality plots, we see
that the assumption of normality for each of the two
populations of mileages appears reasonable.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
33
1.
The samples are paired.
2.
The n sample differences can be viewed as a
random sample from a population of
differences.
3.
The number of sample differences is large
(generally at least 30) OR the population
distribution of differences is approximately
normal.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Given the small sample sizes, the assumption of normality
is very important, so one would be a bit careful utilizing
this result.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Comparing Two Population or
Treatment Means – Paired t Test
Alternate hypothesis and finding the P-value:
1. Ha: µd > hypothesized value
P-value = Area under the appropriate t
curve to the right of the calculated t
2. Ha: µd < hypothesized value
P-value = Area under the appropriate t
curve to the left of the calculated t
3. Ha: µd ≠ hypothesized value
i. 2•(area to the right of t) if t is positive
ii. 2•(area to the left of t) if t is negative
Assumptions:
Where n is the number of sample differences and x d
and sd are the sample mean and standard deviation of the
sample differences. This test is based on df = n-1
34
30
Comparing Two Population or
Treatment Means – Paired t Test
x d − hypothesized value
sd
n
( 0.2512 + 0.3231)
Confidence Intervals
Example
Comments about assumptions
Comparing Two Population or
Treatment Means – Paired t Test
t=
s 22 1.3922
=
= 0.3231
n2
6
Confidence Intervals
Example
Comments about assumptions
32
Null hypothesis: H0:µd = hypothesized value
Test statistic:
V2 =
0.2512 2 0.32312
+
5
5
Truncating, we have df = 9.
If we randomized the order of the tankfuls of
the two different types of gasoline we can
reasonably assume that the samples were
random and independent. By using all of the
observations from one car we are simply
controlling the effects of other variables such as
year, model, weight, etc.
1.2282 1.392 2
+
6
6
-2.28 ± 1.71
31
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
s12 1.2282
=
= 0.2512
n1
6
df =
(-11.72, -6.08)
28
Example
Let the 87 octane fuel be the first group and the 90
octane fuel the second group, so we have
n1 = n2 = 6 and
A student recorded the mileage he obtained
while commuting to school in his car. He
kept track of the mileage for twelve different
tankfuls of fuel, involving gasoline of two
different octane ratings. Compute the 95%
confidence interval for the difference of
mean mileages. His data follow:
Looking on the table of t critical values (table
III) under 98% confidence level for
df = 96, (we take the closest value for df ,
specifically df = 120) and have the t critical
value = 2.36.
( 78.3 − 87.2 ) ± 2.36
Confidence Intervals
Example
Comparing Two Means
Comparing Two Means
36
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
4
Paired Sample Example
Paired t Test Example
A weight reduction center advertises that
participants in its program lose an average of
at least 5 pounds during the first week of the
participation. Because of numerous
complaints, the state’s consumer protection
agency doubts this claim. To test the claim
at the 0.05 level of significance, 12
participants were randomly selected. Their
initial weights and their weights after 1 week
in the program appear on the next slide. Set
up and perform an appropriate hypothesis
test.
37
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Paired Sample Example
Member
Initial
Weight
One Week
Weight
1
195
195
2
153
151
3
174
170
4
125
123
5
149
144
6
152
149
7
135
131
8
143
147
9
139
138
10
198
192
11
215
211
12
153
152
38
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Paired Sample Example
continued
Paired Sample Example
continued
This is equivalent to the difference of means:
µd = µ1 – µ2 = µinitial weight - µ1 week weight
H0: µd = 5
Ha: µd < 5
Member
Initial
Weight
1
195
2
153
151
2
3
174
170
4
4
125
123
2
5
149
144
5
6
152
149
7
135
131
4
8
143
147
-4
9
139
138
1
10
198
192
6
11
215
211
4
12
153
152
1
39
40
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Paired Sample Example
3
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
continued
Calculations: According to the statement of the
example, we can assume that the sampling is
random. The sample size (10) is small, so
n = 12, x d = 2.333, s d = 2.674
t=
Significance level: α = 0.05
Test statistic:
x − hypothesized value x d − 5
t= d
=
sd
sd
n
n
One Week Difference
Initial -1week
Weight
0
195
Paired Sample Example
continued
Assumptions: According to the statement of the
example, we can assume that the sampling is random.
The sample size (12) is small, so from the boxplot we
see that there is one outlier but never the less, the
distribution is reasonably symmetric and the normal
plot confirms that it is reasonable to assume that the
population of differences (weight losses) is normally
distributed.
µd = mean of the individual weight changes
(initial weight–weight after one week)
continued
x d − 5 2.333 − 5
=
= −3.45
sd
2.674
12
n
P-value: This is an lower tail test, so looking up the t value
of 3.0 under df = 11 in the table of tail areas for t curves
(table IV) we find that the P-value = 0.002.
41
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Paired Sample Example
continued
42
continued
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Large-Sample Inferences
Difference of Two Population (Treatment) Proportions
Minitab returns the following when asked to
perform this test.
Conclusions: Since P-value = 0.002 < 0.05 = α,
we reject H0.
Some notation:
This is substantially the same result.
Population
Sample
Sample Proportion of Proportion of
Size
Successes Successes
Paired T-Test and CI: Initial, One week
We draw the following conclusion.
There is strong evidence that the mean weight
loss for those who took the program for one
week is less than 5 pounds.
Paired T for Initial - One week
Initial
One week
Difference
N
12
12
12
Mean
160.92
158.58
2.333
StDev
28.19
27.49
2.674
95% upper bound for mean difference: 3.720
T-Test of mean difference = 5 (vs < 5): T-Value = -3.45
43
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
44
Population or
treatment 1
Population or
treatment 2
SE Mean
8.14
7.93
0.772
n1
π1
p1
n2
π2
p2
P-Value = 0.003
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
45
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
5
Properties: Sampling Distribution of p1- p2
Large-Sample z Tests for π1 – π2 = 0
If two random samples are selected
independently of one another, the following
properties hold:
The combined estimate of the common
population proportion is
1. µ p − p = π1 − π2
1
2
z=
1
2
π1 (1 − π1 ) π 2 (1 − π2 )
+
and
n1
n2
2
=
3. If both n1 and n2 are large [n1 p1 ≥ 10,
n1(1- p1) ≥ 10, n2p2 ≥ 10, n2(1- p2) ≥ 10],
then p1 and p2 each have a sampling
distribution that is approximately normal
46
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
total sample size
47
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Example - Student Retention
Gender
Male
Female
n1 = number of males surveyed
n2 = number of females surveyed
p1 = x1/n1 = sample proportion of males who plan on
returning
p2 = x2/n2 = sample proportion of females who plan on
returning
Null hypothesis: H0: π1 – π2 = 0
Alternate hypothesis: Ha: π1 – π2 ≠ 0
51
Example - Student Retention
Significance level: α = 0.05
Calculations:
Test statistic:
pc =
p1 − p 2
pc (1 − p c ) p c (1 − p c )
+
n1
n2
P-value:
+
182
Conclusion:
0.76727 − 0.77473
275
53
P-value = 2(0.4247) = 0.8494
p c (1 − p c )
0.77024(1 − 0.77024)
=
The P-value for this test is 2 times the area
under the z curve to the left of the computed
z = -0.19.
p1 − p 2
p c (1 − p c )
=
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Example - Student Retention
n1p1 + n 2 p 2 211 + 141 352
=
=
= 0.7702
n1 + n 2
275 + 182 457
z=
Assumptions: The two samples are independently
chosen random samples. Furthermore, the sample sizes
are large enough since
n1 p1 = 211 ≥ 10, n1(1- p1) = 64 ≥ 10
n2p2 = 141 ≥ 10, n2(1- p2) = 41 ≥ 10
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
π2 = true proportion of females who plan on returning
Response
Yes
No Maybe
211
45
19
141
32
9
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
275
52
Example - Student Retention
Test to see if the proportion of students planning on returning is
the same for both genders at the 0.05 level of significance?
50
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
π1 = true proportion of males who plan on returning
The responses along with the gender of the person
responding are summarized in the following table.
Example - Student Retention
z=
48
A group of college students were asked what they
thought the “issue of the day”. Without a pause the
class almost to a person said “student retention”. The
class then went out and obtained a random sample
(questionable) and asked the question, “Do you plan
on returning next year?”
Alternate hypothesis and finding the P-value:
1. Ha: π1 - π2 > 0
P-value = Area under the z curve to the
right of the calculated z
2. Ha: π1 - π2 < 0
P-value = Area under the z curve to the
left of the calculated z
3. Ha: π1 - π2 ≠ 0
i. 2•(area to the right of z) if z is positive
ii. 2•(area to the left of z) if z is negative
p1 − p 2
p c (1 − p c ) p c (1 − p c )
+
n1
n2
Assumptions:
1. The samples are independently chosen random
samples OR treatments are assigned at random to
individuals or objects (or vice versa).
2. Both sample sizes are large:
n1 p1 ≥ 10, n1(1- p1) ≥ 10, n2p2 ≥ 10, n2(1- p2) ≥ 10
total number of successes in two samples
Large-Sample z Tests for π1 – π2 = 0
49
n1p1 + n 2p 2
n1 + n 2
pc =
π1 (1 − π1 ) π 2 (1 − π2 )
+
n1
n2
σ p −p =
1
Test statistic:
2
2
2
2
2. σ p −p = σ p + σ p =
1
Large-Sample z Tests for π1 – π2 = 0
Null hypothesis: H0: π1 – π2 = 0
+
Since P-value = 0.849 > 0.05 = α, the hypothesis H0 is
not rejected at significance level 0.05.
0.77024(1 − 0.77024)
182
-0.0074525
= -0.19
0.040198
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
There is no evidence that the return rate is different for
males and females..
54
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
6
Example
Example
A consumer agency spokesman stated that he
thought that the proportion of households having
a washing machine was higher for suburban
households then for urban households. To test to
see if that statement was correct at the 0.05 level
of significance, a reporter randomly selected a
number of households in both suburban and
urban environments and obtained the following
data.
Suburban
Urban
55
Number
surveyed
Number
having
washing
machines
Proportion
having
washing
machines
300
250
243
181
0.810
0.724
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Ha: π1 - π2 > 0
56
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Large-Sample Confidence
Interval for π1 – π2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
1. The samples are independently selected random
samples OR treatments that were assigned at
random to individuals or objects (or vice versa), and
The P-value = 1 - 0.9916 = 0.0084
Conclusion:
2. Both sample sizes are large:
Since P-value = 0.0084 < 0.05 = α, the hypothesis H0 is
rejected at significance level 0.05. There is sufficient
evidence at the 0.05 level of significance that the
proportion of suburban households that have washers is
more that the proportion of urban households that have
washers.
59
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Example
Example
A student assignment called for the students to survey
both male and female students (independently and
randomly chosen) to see if the proportions that
approve of the College’s new drug and alcohol policy.
A student went and randomly selected 200 male
students and 100 female students and obtained the
data summarized below.
For a 90% confidence interval the z value to use is
1.645. This value is obtained from the bottom row of
the table of t critical values (Table III).
n1 p1 ≥ 10, n1(1- p1) ≥ 10, n2p2 ≥ 10, n2(1- p2) ≥ 10
A large-sample confidence interval for π1 – π2 is
(p1 − p2 ) ± ( z critical value )
60
p1 (1 − p1 ) p 2 (1 − p 2 )
+
n1
n2
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
We use p1 to be the female’s sample approval
proportion and p2 as the male’s sample approval
proportion.
Number Number that Proportion
surveyed
approve
that approve
100
43
0.430
200
61
0.305
(0.430 − 0.305) ± 1.645
0.430(1 − 0.430) 0.305(1 − 0.305)
+
100
200
(0.125) ± 0.097
Use this data to obtain a 90% confidence interval estimate
for the difference of the proportions of female and male
students that approve of the new policy.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
When
The P-value for this test is the area under the z
curve to the right of the computed z = 2.39.
0.810 − 0.742
1
1 
0.7709(1 − 0.7709) 
+

 300 250 
= 2.390
61
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Example
=
Female
Male
57
P-value:
p1 − p2
pc (1 − pc ) pc (1 − pc )
+
n1
n2
p1 − p 2
p c (1 − p c ) p c (1 − p c )
+
n1
n2
Assumptions: The two samples are independently
chosen random samples. Furthermore, the sample sizes
are large enough since
n1 p1 = 243 ≥ 10, n1(1- p1) = 57 ≥ 10
n2p2 = 181 ≥ 10, n2(1- p2) = 69 ≥ 10
H0: π1 - π2 = 0
n1p1 + n 2 p 2 243 + 181 424
=
=
= 0.7709
n1 + n 2
300 + 250 550
58
z=
π1 - π2 is the difference between the proportions
of suburban households and urban
households that have washing machines.
Example
z=
Significance level: α = 0.05
Test statistic:
π2 = proportion of urban households having
washing machines
Calculations:
pc =
Example
π1 = proportion of suburban households having
washing machines
62
or
(0.028,0.222)
Based on the observed sample, we believe that the
proportion of females that approve of the policy exceeds the
proportion of males that approve of the policy by
somewhere between 0.028 and 0.222.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
7
Related documents