Download Slide 25

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Psychometrics wikipedia , lookup

History of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Chapter 25
Paired Samples
and Blocks
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Paired Data


Data are paired when the observations are collected in
pairs or the observations in one group are naturally
related to observations in the other group.
Paired data arise in a number of ways. Perhaps the most
common is to compare subjects with themselves before
and after a treatment.
 When pairs arise from an experiment, the pairing is a
type of blocking.
 When they arise from an observational study, it is a
form of matching.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 2
Paired Data (cont.)


If you know the data are paired, you can (and must!) take
advantage of it.
 To decide if the data are paired, consider how they
were collected and what they mean (check the W’s).
 There is no test to determine whether the data are
paired.
Once we know the data are paired, we can examine the
pairwise differences.
 Because it is the differences we care about, we treat
them as if they were the data and ignore the original
two sets of data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 3
Paired Data (cont.)


Now that we have only one set of data to
consider, we can return to the simple one-sample
t-test.
Mechanically, a paired t-test is just a one-sample
t-test for the mean of the pairwise differences.
 The sample size is the number of pairs.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 4
Assumptions and Conditions



Paired Data Assumption: The data must be
paired.
Independence Assumption: The differences must
be independent of each other. Check the:
 Randomization Condition
Normal Population Assumption: We need to
assume that the population of differences follows
a Normal model.
 Nearly Normal Condition: Check this with a
histogram or Normal probability plot of the
differences.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 5
The Paired t-Test


When the conditions are met, we are ready to test
whether the paired differences differ significantly
from zero.
We test the hypothesis H0: d = 0, where the d’s
are the pairwise differences and 0 is almost
always 0.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 6
The Paired t-Test (cont.)

We use the statistic
d  0
tn 1 
SE  d 
where n is the number of pairs.

SE  d  
sd
is the ordinary standard error for the mean
n
applied to the differences.

When the conditions are met and the null hypothesis is true, this
statistic follows a Student’s t-model on n – 1 degrees of freedom, so
we can use that model to obtain a P-value.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 7

The table below represents mileage driven by 11
employees during an ordinary five day work week
and a flexible four day work week. Was the 4 day
work week successful in reducing miles driven?
Name
5 Day Mileage
4 Day Mileage
Jeff
2798
2914
Betty
7724
6112
Roger
7505
6177
Tom
838
1102
Aimee
4592
3281
Greg
8107
4997
Larry G
1228
1695
Tad
8718
6606
Larry M
1097
1063
Leslie
Lee
8089
3807
6392
3362
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 8

Calculator – just like a one sample t-test

Stats-Tests-2:T-test
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 9
Confidence Intervals for Matched Pairs


When the conditions are met, we are ready to find the confidence
interval for the mean of the paired differences.
The confidence interval is
d t

n 1
 SE  d 
where the standard error of the mean difference is
sd
SE  d  
n
The critical value t* depends on the particular confidence level, C, that you
specify and on the degrees of freedom, n – 1, which is based on the number
of pairs, n.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 10
Paired T-Interval Example

Find a 95% confidence interval for the difference
mean age between 170 married wives and
husbands. The mean difference is 2.2 years with
a standard deviation of 4.1 years.

Calc: Stats- Tests-8:TInterval
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 11
Blocking


Consider estimating the
mean difference in age
between husbands and
wives.
The following display is
worthless. It does no good
to compare all the wives
as a group with all the
husbands—we care about
the paired differences.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 12
Blocking (cont.)

In this case, we have paired data—each husband is
paired with his respective wife. The display we are
interested in is the difference in ages:
Insert histogram from page 579 of the text.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 13
Blocking (cont.)


Pairing removes the extra variation that we saw in
the side-by-side boxplots and allows us to
concentrate on the variation associated with the
difference in age for each pair.
A paired design is an example of blocking.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 14
1-sample t-test or 2-sample t-test or
paired t-method?



1) random samples of 50 men and 50 women are
asked to imagine buying a birthday present for
their best friend. We want to estimate the
difference in how much they are will to spend.
2) Mothers of twins were surveyed and asked
how often in the past month strangers has asked
whether the twins were identical.
3) Are parents equally strict with boys and girls?
In a random sample of families, researchers
asked a brother and a sister from each family to
rate how strict their parents were.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 15
What Can Go Wrong?




Don’t use a two-sample t-test for paired data.
Don’t use a paired-t method when the samples
aren’t paired.
Don’t forget outliers—the outliers we care about
now are in the differences.
Don’t look for the difference in side-by-side
boxplots.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 16
What have we learned?


Pairing can be a very effective strategy.
 Because pairing can help control variability between
individual subjects, paired methods are usually more
powerful than methods that compare individual groups.
Analyzing data from matched pairs requires different
inference procedures.
 Paired t-methods look at pairwise differences.


We test hypotheses and generate confidence intervals based
on these differences.
We learned to Think about the design of the study that
collected the data before we proceed with inference.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
Slide 25- 17