Download Two-Sample Inference Procedures

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Sufficient statistic wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Regression toward the mean wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Suppose we have a population of
adult men with a mean height of
71 inches and standard deviation
of 2.6 inches. We also have a population of
adult women with a mean height of 65 inches
and standard deviation of 2.3 inches. Assume
heights are normally distributed.
Describe the distribution of the difference in
heights between males and females (malefemale).
Normal distribution with
mx-y =6 inches & sx-y =3.471 inches
Female
65
Male
71
Difference = male - female
6
m  m m
x y
s
x y
x
y
 s s
2
2
x
y
We will
be
interested
in the
difference
of means,
so we will
use this to
find
standard
error.
The goal of these inference
procedures is to compare the
responses to two treatments or to
compare the characteristics of
two populations.
 We have INDEPENDENT samples
from each treatment or population





Have two SRS’s from the
populations or two randomly
assigned treatment groups
Samples are independent
Both populations are normally
distributed
 Have large sample sizes
 Graph BOTH sets of data
s’s known/unknown
Since in real-life, we
will NOT know both s’s,
we will do t-procedures.
Option 1: use the smaller of the two
values n1 – 1 and n2 – 1
This will produce conservative results
– higher p-values & lower
Calculator
confidence.
Option 2: approximation used bydoes this
automatically!
s s 
technology
  
n n 

df 
1 s 
1 s 
  
 
n  1 n  n  1 n 
1
2
2
1
2
1
2
2
2
2
1
2
1
2
2
Called
standard
error
CI  statistic  critical value SD of statistic
s
s
x  x   t *

n n
1
2
2
1
2
1
2
2




Used for two populations with the
same variance
When you pool, you average the
two-sample variances to estimate
the common population variance.
DO NOT use on AP Exam!!!!!
We do NOT know the variances of the population,
so ALWAYS tell the calculator NO for pooling!
Two competing headache remedies claim to give fastacting relief. An experiment was performed to
compare the mean lengths of time required for bodily
absorption of brand A and brand B. Assume the
absorption time is normally distributed. Twelve people
were randomly selected and given an oral dosage of
brand A. Another 12 were randomly selected and given
an equal dosage of brand B. The length of time in
minutes for the drugs to reach a specified level in the
blood was recorded. The results follow:
mean
SD
n
Brand A
20.1
8.7
12
Brand B
18.9
7.5
12
Describe the shape & standard error for sampling
distribution of the differences in the mean speed of
absorption. (answer on next screen)
Describe the sampling distribution of the differences
in the mean speed of absorption.
Normal distribution with S.E. = 3.316
Find a 95% confidence interval difference in
mean lengths of time required for bodily
absorption of each brand. (answer on next screen)
State assumptions!
Assumptions: Think “Price is Right”!
Have 2 independent SRS from volunteers
Closest
going
Given the absorption
rate without
is normally
distributed
over
s’s unknown s 2 s 2
Formula & calculations
1
2
x1  x2   t *
n1

n2

df  21.53
8.7 2 7.52
20.1  18.9  2.080

 (5.685,8.085)
12
12
From calculator df =
Conclusion in context
We are 95% confident that the true difference in mean
21.53, use t* for df =
lengths of time required for bodily absorption of each
21 & 95% confidence
brand is between –5.685 minutes and 8.085 minutes.
level
 Matched
pairs – refer to
“mean difference”
 Two-Sample
– refer to
“difference of means”
In a recent study on
biofeedback, it was reported
that meditation could alter the
alpha & beta waves in the brain
thus changing the rate at which
the heart beats. This is
important for relieving the
effects of stress.
Let’s test this!
H0H
: 0m: 1m-1 =
m2m=2 0
HaH
: m: 1m- <m2m < 0
a
1
2
Ha: m1 - m2 > 0
Ha: m1> m2
Ha: m1 - m2 ≠ 0
Ha: m1 ≠ m2
Be sure
to define
BOTH m1
and m2!
Test statistic 
Since we usually
assume H0 is true,
statistic
parameter
then this equals 0 –
can usually
SDsoofwestatistic
leave it out
 x  x   m  m 
t
1
2
1
2
2
1
2
1
2
s s

n n
2
The length of time in minutes for the drugs
to reach a specified level in the blood was
recorded. The results follow:
Brand A
Brand B
mean
20.1
18.9
SD
8.7
7.5
n
12
12
Is there sufficient evidence that these
drugs differ in the speed at which they
enter the blood stream?
Assump.: Have 2 independent SRS from volunteers
State assumptions!
Given the absorption rate is normally
distributed
s’s unknown
H0: mA= mB
Hypotheses & define variables!
Where mA is the true mean absorption time
for Brand A & mB is the true mean
absorption time for Brand B
Ha:mA= mB
x1  x2
20.1  18.9
t

 .361& calculations
Formula
s12 s22
8.7 2 7.52


n1 n2
12
12
Conclusion in context
p  value  .7210 df  21.53 α  .05
Since p-value > a, I fail to reject H0. There is not
sufficient evidence to suggest that these drugs differ in
the speed at which they enter the blood stream.
Suppose that the sample mean of Brand
B is 16.5, then is Brand B faster?
t
x1  x2
s12 s22

n1 n2

20.1  16.5
8.7 2 7.52

12
12
 1.085
p  value  .2896 df  21.53 α  .05
No, I would still fail to reject the null
hypothesis.


Two-sample procedures are more
robust than one-sample procedures
BEST to have equal sample sizes! (but
not necessary)

Chapter 10 – Section 2
Text Problems #33 – 54 by 3’s and 57 – 62 all
Worksheet 10.2

Chapter 10 AP Review (3/17)

Chapter 10 Test (3/18)

