Download Lecture Notes 22

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Lecture 22
Dustin Lueker


Similar to testing one proportion
Hypotheses are set up like two sample mean
test
◦ H0:p1-p2=0
 Same as H0: p1=p2

Test Statistic
z
( pˆ 1  pˆ 2 )  ( p1  p2 )
pˆ 1 (1  pˆ 1 ) pˆ 2 (1  pˆ 2 )

n1
n2
STA 291 Summer 2008 Lecture 21
2

Hypothesis involves 2 parameters from 2
populations
◦ Test statistic is different
 Involves 2 large samples (both samples at least 30)
 One from each population

H0: μ1-μ2=0
◦ Same as H0: μ1=μ2
◦ Test statistic
z
( x1  x2 )  ( 1   2 )
2
1
2
2
s
s

n1 n2
STA 291 Summer 2008 Lecture 21
3

Used when comparing means of two samples
where at least one of them is less than 30
◦ Normal population distribution is assumed for both
samples

Equal Variances
◦ Both groups have the same variability

Unequal Variances
 
2
1
2
2
◦ Both groups may not have the same variability
 
2
1
2
2
STA 291 Summer 2008 Lecture 21
4

Test Statistic
t
( x1  x2 )  ( 1   2 )
(n1  1) s  (n2  1) s
n1  n2  2
2
1
2
2
1 1
  
 n1 n2 
◦ Degrees of freedom
 n1+n2-2
STA 291 Summer 2008 Lecture 21
5
( x1  x2 )  t / 2,n1  n2 2
(n1  1) s  (n2  1) s
n1  n2  2
2
1
2
2
1 1
  
 n1 n2 
◦ Degrees of freedom
 n1+n2-2
STA 291 Summer 2008 Lecture 21
6

Test statistic
t
( x1  x2 )  ( 1   2 )
2
1
2
2
s
s

n1 n2

Degrees of freedom
2
s
s 
  
n1 n2 

df 
2 2
2 2
 s1   s2 
   
 n1    n2 
n1  1 n2  1
2
1
STA 291 Summer 2008 Lecture 21
2
2
7
2
1
2
2
s
s

n1 n2
( x1  x2 )  t / 2,df
2
s
s 
  
n1 n2 

df 
2 2
2 2
 s1   s2 
   
 n1    n2 
n1  1 n2  1
2
1
2
2
STA 291 Summer 2008 Lecture 21
8

How to choose between Method 1 and Method
2?
◦ Method 2 is always safer to use
◦ Definitely use Method 2
 If one standard deviation is at least twice the other
 If the standard deviation is larger for the sample with the
smaller sample size
◦ Usually, both methods yield similar conclusions
STA 291 Summer 2008 Lecture 21
9

Comparing dependent means
◦ Example
 Special exam preparation for STA 291 students
 Choose n=10 pairs of students such that the students
matched in any given pair are very similar given
previous exam/quiz results
 For each pair, one of the students is randomly selected
for the special preparation (group 1)
 The other student in the pair receives normal
instruction (group 2)
STA 291 Summer 2008 Lecture 21
10

“Matches Pairs” plan
◦ Each sample (group 1 and group 2) has the same
number of observations
◦ Each observation in one sample ‘pairs’ with an
observation in the other sample
◦ For the ith pair, let
Di = Score of student receiving special preparation
– score of student receiving normal instruction
STA 291 Summer 2008 Lecture 21
11

The sample mean of the difference scores is
an estimator for the difference between the
population means
xD



n
D
i 1 i
n
D1  D2    Dn

n
We can now use exactly the same methods as
for one sample
◦ Replace Xi by Di
STA 291 Summer 2008 Lecture 21
12

Small sample confidence interval
xD  tn 1
Note:
sD 

n
i 1
sD
n
( Di  xD )
2
n 1
◦ When n is large (greater than 30), we can use the zscores instead of the t-scores
STA 291 Summer 2008 Lecture 21
13

Small sample test statistic for testing
difference in the population means
xD   D
t
sD n
◦ For small n, use the t-distribution with df=n-1
◦ For large n, use the normal distribution instead (z
value)
STA 291 Summer 2008 Lecture 21
14


Ten college freshman take a math aptitude
test both before and after undergoing an
intensive training course
Then the scores for each student are paired,
as in the following table
Student
1
2
3
4
5
6
7
Before
60 73 42 88 66 77 90 63 55 96
After
70 80 40 94 79 86 93 71 70 97
STA 291 Summer 2008 Lecture 21
8
9
10
15
STA 291 Summer 2008 Lecture 21
16

Student
1
2
3
4
5
6
7
8
9
10
Before
60 73 42 88 66 77 90 63 55 96
After
70 80 40 94 79 86 93 71 70 97
Compare the mean scores after and before the
training course by
◦ Finding the difference of the sample means
◦ Find the mean of the difference scores
◦ Compare


Calculate and interpret the p-value for testing
whether the mean change equals 0
Compare the mean scores before and after the
training course by constructing and interpreting a
90% confidence interval for the population mean
difference
STA 291 Summer 2008 Lecture 21
17
Output from Statistical Software Package SAS
N
Mean
Std Deviation
10
7
5.24933858
Tests for Location: Mu0=0
Test
-Statistic-
-----p Value------
Student's t
Sign
Signed Rank
t
M
S
Pr > |t|
Pr >= |M|
Pr >= |S|
4.216901
4
25.5
STA 291 Summer 2008 Lecture 21
0.0022
0.0215
0.0059
18

Variability in the difference scores may be
less than the variability in the original scores
◦ This happens when the scores in the two samples
are strongly associated
◦ Subjects who score high before the intensive
training also dent to score high after the intensive
training
 Thus these high scores aren’t raising the variability for
each individual sample
STA 291 Summer 2008 Lecture 21
19