Download Chapter 9

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Definitions
Section 9.3
Two samples are independent if the sample
values selected from one population are not
related to or somehow paired or matched with
the sample values from the other population
Inferences About Two Means
(Independent)
Objective
Examples:
Compare the proportions of two independent
means using two samples from each population.
Flipping two coins (Independent)
Hypothesis Tests and Confidence Intervals of
two proportions use the t-distribution
Drawing two cards (not independent)
1
2
Notation
Notation
First Population
μ1
First population mean
σ1
First population standard deviation
n1
First sample size
x1
First sample mean
s1
First sample standard deviation
Second Population
1
μ2
Second population mean
σ2
Second population standard deviation
n2
Second sample size
x2
Second sample mean
s2
Second sample standard deviation
3
Requirements
4
Tests for Two Independent Means
The goal is to compare the two Means
(1) Have two independent random samples
(2) σ1 and σ2 are unknown and no assumption is
made about their equality
(3) Either or both the following holds:
Both sample sizes are large (n1>30, n2>30)
H0 : μ1 = μ2
H0 : μ1 = μ2
H0 : μ1 = μ2
H1 : μ1 ≠ μ2
H1 : μ1 < μ2
H1 : μ1 > μ2
Two tailed
or
Left tailed
Right tailed
Both populations have normal distributions
Note: We only test the relation between μ1 and μ2
(not the actual numerical values)
All requirements must be satisfied to make a
Hypothesis Test or to find a Confidence Interval
5
6
Test Statistic
Finding the Test Statistic
t
x  x  
1
2
1
 2

s12 s22

n1 n2
Note: 1 –
2 =0 according to H0
Degrees of freedom
df = min(n1 – 1, n2 – 1)
Degrees of freedom: df = smaller of n1 – 1 and n2 – 1.
This equation is an altered form of the test statistic
for a single mean when σ unknown (see Ch. 8-5)
Note: Hypothesis Tests are done in same way as
in Ch.8 (but with different test statistics)
7
Steps for Performing a Hypothesis
Test on Two Independent Means
•
Write what we know
•
State H0 and H1
•
Draw a diagram
•
Find the Test Statistic
•
Find the Degrees of Freedom
•
Find the Critical Value(s)
•
State the Initial Conclusion and Final Conclusion
8
Example 1
2
A headline in USA Today proclaimed that “Men,
women are equal talkers.” That headline referred to
a study of the numbers of words that men and
women spoke in a day.
Use a 0.05 significance level to test the claim that
men and women speak the same mean number of
words in a day.
Note: Same process as in Chapter 8
9
n1 = 186
x1 = 15668.5
s1 = 8632.5
Example 1
H0 : µ1 = µ2
H1 : µ1 ≠ µ2
Two-Tailed
H0 = Claim
n2 = 210
x2 = 16215.0
s2 = 7301.2
t-dist.
df = 185
t = 7.602
-tα/2 = -1.97
Test Statistic
α = 0.05
Claim: μ1 = μ2
tα/2 = 1.97
10
Example 1
H0 : µ1 = µ2
H1 : µ1 ≠ µ2
n1 = 186
x1 = 15668.5
s1 = 8632.5
Two-Tailed
H0 = Claim
α = 0.05
Claim: μ1 = μ2
Stat → T statistics → Two sample → With summary
Sample 1:
Using StatCrunch
(Be sure to not use pooled variance)
n2 = 210
x2 = 16215.0
s2 = 7301.2
Sample 2:
Mean 15668.5
Std. Dev. 8632.5
Size
186
Mean 16215.0
Std. Dev. 7301.2
Size
210
● Hypothesis Test
Null: prop. diff.=
Alternative
0
≠
(No pooled variance)
P-value = 0.4998
Degrees of Freedom
df = min(n1 – 1, n2 – 1) = min(185, 209) = 185
Critical Value
tα/2 = t0.025 = 1.97
(Using StatCrunch)
Initial Conclusion: Since t is not in the critical region, accept H0
Initial Conclusion: Since P-value > α (0.05), accept H0
Final Conclusion: We accept the claim that men and women speak the
same average number of words a day.
11
Final Conclusion: We accept the claim that men and women speak the
same average number of words a day.
12
Example 2
Confidence Interval Estimate
Use the same sample data in Example 1 to construct a
95% Confidence Interval Estimate of the difference
between the two population proportions (µ1–µ2)
We can observe how the two proportions relate by
looking at the Confidence Interval Estimate of μ1–μ2
n1 = 186
n2 = 210
df = min(n1–1, n2–1) = min(185, 210) = 185
df = min(n1–1, n2–1) = min(185, 210) = 185
x1 = 15668.5
x2 = 16215.0
tα/2 = t0.05/2 = t0.025 = 1.973
tα/2 = t0.1/2 = t0.05 = 1.973
s1 = 8632.5
CI = ( (x1–x2) – E, (x1–x2) + E )
2
s2 = 7301.2
x1 - x2 = 15668.5 – 16215.0 = -546.5
x1 - x2 = 15668.5 – 16215.0 = -546.5
2
Where
(x1 - x2) + E = -546.5 + 1596.17 = 1049.67
(x1 - x2) – E = -546.5 – 1596.17 = -2142.67
df = min(n1–1, n2–1)
CI = (-2142.7, 1049.7)
13
14
Example 2
Example 3
Use the same sample data in Example 1 to construct a
95% Confidence Interval Estimate of the difference
between the two population proportions (µ1–µ2)
Consider two different classes. The students in the first
class are thought to generally be older than those in the
second. The students’ ages for this semester are summed
as follows:
n1 = 186
x1 = 15668.5
s1 = 8632.5
n2 = 210
x2 = 16215.0
s2 = 7301.2
Stat → T statistics → Two sample → With summary
Sample 1:
Sample 2:
Using StatCrunch
Mean 15668.5
Std. Dev. 8632.5
Size
186
Mean 16215.0
Std. Dev. 7301.2
Size
210
n1 = 93
x1 = 21.2
s1 = 2.42
● Confidence Interval
Level:
3
0.95
(No pooled variance)
n2 = 67
x2 = 19.8
s2 = 4.77
(a) Use a 0.1 significance level to test the claim that the
average age of students in the first class is greater than the
average age of students in the second class.
(b) Construct a 90% confidence interval estimate of the
difference in average ages.
Note: slightly different because
of rounding errors
CI = (-2137.4, 1044.4)
Example 3a
H0 : µ1 = µ2
H1 : µ1 > µ2
n1 = 93
x1 = 21.2
n2 = 67
x2 = 19.8
s1 = 2.42
s2 = 4.77
15
α = 0.1
Claim: µ1 > µ2
16
Example 3a
H0 : µ1 = µ2
Right-Tailed
H1 = Claim
t-dist.
df = 66
H1 : µ1 > µ2
n1 = 93
x1 = 21.2
n2 = 67
x2 = 19.8
s1 = 2.42
s2 = 4.77
Right-Tailed
H1 = Claim
Stat → T statistics → Two sample → With summary
Sample 1:
Mean
Std. Dev.
Size
Sample 2:
Mean
Std. Dev.
Size
Using StatCrunch
Test Statistic
𝒕=
𝒙1 −𝒙𝟐
𝒔1 𝒔2
+
𝒏1 𝒏2
=
21.2−19.8
(2.42)𝟐 (4.77)𝟐
+
93
67
= 2.207
tα/2 = 1.668
t = 7.602
(Be sure to not use pooled variance)
α = 0.1
Claim: µ1 > µ2
21.2
2.42
93
19.8
4.77
67
● Hypothesis Test
Null: prop. diff.=
Alternative
0
≠
(No pooled variance)
P-value = 0.0299
Degrees of Freedom
df = min(n1 – 1, n2 – 1) = min(92, 66) = 66
Critical Value
tα/2 = t0.05 = 1.668
(Using StatCrunch)
Initial Conclusion: Since t is in the critical region, reject H0
Initial Conclusion: Since P-value < α (0.1), reject H0
Final Conclusion: We accept the claim that the average age of students
in the first class is greater than that in the second.
17
Final Conclusion: We accept the claim that the average age of students
in the first class is greater than that in the second.
18
Example 3b
(90% Confidence Interval)
n1 = 93
x1 = 21.2
s1 = 2.42
n2 = 67
x2 = 19.8
s2 = 4.77
α = 0.1
Example 3b
(90% Confidence Interval)
df = min(n1–1, n2–1) = min(92, 66) = 66
𝒔1
𝒏1
Sample 1:
Mean
Std. Dev.
Size
Sample 2:
Mean
Std. Dev.
Size
Using StatCrunch
x1 - x2 = 21.2 – 19.8 = 1.4
2
𝒔
mp
+ 𝒏1 = 1.668
1
n2 = 67
x2 = 19.8
s2 = 4.77
α = 0.1
Stat → T statistics → Two sample → With summary
tα/2 = t0.1/2 = t0.05 = 1.668
𝑬 = 𝒕𝜶
n1 = 93
x1 = 21.2
s1 = 2.42
(Be sure to not use pooled variance)
2.42 𝟐
93
+
4.77 𝟐
67
21.2
2.42
93
19.8
4.77
67
● Hypothesis Test
Null: prop. diff.=
Alternative
0
≠
(No pooled variance)
= 1.058
(x1 - x2) + E = 1.4 + 1.058 = 2.458
(x1 - x2) – E = 1.4 – 1.058 = 0.342
CI = (0.34, 2.46)
CI = (0.35, 2.45)
19
20
4
Related documents