Download 9.4 Day 2

Document related concepts

Psychometrics wikipedia , lookup

Transcript
Section 9.4
Day 2
To Pool or Not to Pool?
To Pool or Not to Pool?
Almost always select the unpooled option.
To Pool or Not to Pool?
Almost always select the unpooled option.
The only situation in which the pooled
procedure has definite advantage over
unpooled is when the population
standard deviations are equal but the
sample sizes are unequal.
To Pool or Not to Pool?
Almost always select the unpooled option.
The only situation in which the pooled
procedure has definite advantage over
unpooled is when the population standard
deviations are equal but the sample sizes
are unequal.
Population standard deviations usually
unknown
Significance test for the
difference of two means
Components of a significance test
for the difference of two means
1)
2)
3)
4)
Components of a significance test
for the difference of two means
1) Name test and check conditions
2)
3)
4)
Components of a significance test
for the difference of two means
1) Name test and check conditions
2) State hypotheses
3)
4)
Components of a significance test
for the difference of two means
1) Name test and check conditions
2) State hypotheses
3) Compute test statistic, find P-value, and
draw sketch
4)
Components of a significance test
for the difference of two means
1) Name test and check conditions
2) State hypotheses
3) Compute test statistic, find P-value, and
draw sketch
4) Write conclusion, linked to computations
and in context of problem
Name Test
Name:
One-sided significance test for the difference
of two means
or
Two-sided significance test for the difference
of two means
Check Conditions
1)
For survey,
Check Conditions
1)
For survey, two samples randomly and
independently selected from two
different populations.
Check Conditions
1)
For survey, two samples randomly and
independently selected from two
different populations.
For experiment,
Check Conditions
1)
For survey, two samples randomly and
independently selected from two
different populations.
For experiment, two treatments randomly
assigned to available experimental units.
Check Conditions
2) normality: two samples must look like
they came from normally distributed
populations
or
Check Conditions
2) normality: two samples must look like
they came from normally distributed
populations
or
Sample sizes are large enough that
sampling distributions of sample means
will be approximately normal
Check Conditions
15/40 guideline can be applied to each
sample or treatment group, although it is a
bit conservative.
What does this mean?
Check Conditions
15/40 guideline can be applied to each
sample or treatment group, although it is a
bit conservative.
What does this mean?
You can get by with smaller sample sizes
when taking a difference of two means
Check Conditions
You can get by with smaller sample sizes
when taking a difference.
Subtracting two sample means brings in the
tails.
With skewed populations, the sampling
distribution of the difference of two means
tends to be more symmetric than the two
separate sampling distributions of the
sample means.
Check Conditions
3) For survey, population sizes should be
at least ten times larger than sample sizes
for both samples.
Check Conditions
3) For survey, population sizes should be at
least ten times larger than sample sizes
for both samples.
Remember, this condition does not apply to
experiment.
Check Conditions
Conditions: same conditions as for
confidence interval for difference of two
means
State Hypotheses
Null hypothesis is usually that the two
population means are equal.
H o: μ 1 = μ 2
or
H o: μ 1 - μ 2 = 0
where μ1 is mean of first population and μ2 is
mean of second population
State Hypotheses
Three forms of alternative hypothesis:
State Hypotheses
Three forms of alternative hypothesis:
1) Ha: 1   2 or Ha: 1   2  0
State Hypotheses
Three forms of alternative hypothesis:
1) Ha: 1   2 or Ha: 1   2  0
or
2) Ha: 1   2 or Ha: 1   2  0
State Hypotheses
Three forms of alternative hypothesis:
1) Ha: 1   2 or Ha: 1   2  0
or
2) Ha: 1   2 or Ha: 1   2  0
or
3) Ha: 1   2 or Ha: 1   2  0
Compute test statistic, find P-value,
and draw sketch
Compute difference between sample means
(because hypothesized mean difference is
zero), measured in estimated standard
errors.
Compute test statistic, find P-value,
and draw sketch
Write conclusion, linked to computations and
in context of problem
a) when do you reject the null hypothesis?
b) when do you not reject the null
hypothesis?
c) when do you accept the null hypothesis?
Write conclusion, linked to computations and
in context of problem
If you are using fixed-level testing:
a) when do you reject the null hypothesis?
When P-value is less than significance
level
b) when do you not reject the null
hypothesis?
c) when do you accept the null hypothesis?
Write conclusion, linked to computations and
in context of problem
If you are using fixed-level testing:
a) when do you reject the null hypothesis?
When P-value is less than significance level
b) when do you not reject the null
hypothesis? When P-value is greater
than or equal to significance level
c) when do you accept the null hypothesis?
Write conclusion, linked to computations and
in context of problem
If you are using fixed-level testing:
a) when do you reject the null hypothesis?
When P-value is less than significance level
b) when do you not reject the null
hypothesis? When P-value is greater than
or equal to significance level
c) when do you accept the null hypothesis?
never
Page 632, P29
Page 632, P29
Name: Two-sided significance test for the
difference in two means
Two-sided because we are testing to see if
there is evidence of a statistically
significant difference.
Page 632, P29
Check conditions:
(1) We are told this class is a random
sample taken from all students in this
course.
Page 632, P29
Check conditions:
(1) We are told this class is a random
sample taken from all students in this
course.
Samples are independent as one is females
and other is males and no pairing of
subjects is being done.
Page 632, P29
Check conditions:
(2) Data appears fairly symmetric for each
sample so reasonable to assume both
samples came from normal populations.
However, each contains one unusually large
value, which may have great influence on
the results.
Page 632, P29
Check conditions:
(2) Data appears symmetric for each sample so
reasonable to assume both samples came from
normal populations.
However, each contains one unusually large value,
which may have great influence on the results.
Two analyses will be done: once with all data
and once without outliers.
Page 632, P29
Check conditions:
(3) It is reasonable to assume that there are
more than 460 females and 150 males in
the populations of students who take this
course.
Page 632, P29
State hypotheses:
Ho: μf = μm, where μf is the mean study time
of all female students taking this course
and μm is the mean study time of all male
students taking this course.
Page 632, P29
State hypotheses:
Ho: μf = μm, where μf is the mean study time
of all female students taking this course
and μm is the mean study time of all male
students taking this course.
H a : μf ≠ μ m
Computations
2-SampTTest
Inpt: Stats
x1: 10.93
sx1: 6.22
n1: 46
x2: 8.20
sx2: 5.94
n2: 15
μ 1: ≠ μ 2
Pooled: No Yes??
Calculate
Computations
2-SampTTest
Inpt: Stats
x1: 10.93
sx1: 6.22
n1: 46
x2: 8.20
sx2: 5.94
n2: 15
μ 1: ≠ μ 2
Pooled: No
Calculate
Page 632, P29
Compute test statistic, find P-value, and
draw sketch
Use 2-SampTTest
If reverse males and females:
t  ±1.53
t  ± 1.53
P-value  0.1392
P-value  0.1392
Page 632, P29
Write a conclusion in context, linked to your
computations
Page 632, P29
Write a conclusion in context, linked to your
computations
I do not reject the null hypothesis because
the P-value of 0.1392 is greater than the
significance level of 0.05.
Page 632, P29
Write a conclusion in context, linked to your
computations
I do not reject the null hypothesis because
the P-value of 0.1392 is greater than the
significance level of 0.05.
There is not sufficient evidence to support
the claim that there is a statistically
significant difference in mean weekly study
hours for females and males.
Page 632, P29
Are we finished with the problem?
Page 632, P29
Are we finished with the problem?
Need to conduct test again with outliers
removed.
Page 632, P29
Removing outliers:
Name of test:
Page 632, P29
Removing outliers:
Name of test: same
Page 632, P29
Removing outliers:
Name of test: same
Conditions:
Page 632, P29
Removing outliers:
Name of test: same
Conditions: still met
Page 632, P29
Removing outliers:
Name of test: same
Conditions: still met
Hypotheses:
Page 632, P29
Removing outliers:
Name of test: same
Conditions: still met
Hypotheses: same
Page 632, P29
Removing outliers:
Name of test: same
Conditions: still met
Hypotheses: same
Computations:
Page 632, P29
Removing outliers:
Name of test: same
Conditions: still met
Hypotheses: same
Computations: need to do without outliers
What do you get for t and P-value now?
Page 632, P29
Page 632, P29
t ≈ ± 2.657
P-value ≈ 0.012
Page 632, P29
Removing the outliers:
Reject the null hypothesis because the
P-value of 0.012 is less than 0.05.
There is sufficient evidence to support the
claim that there is a statistically significant
difference in mean weekly study hours for
females and males.
Page 632, P29
Removing the outliers:
Reject the null hypothesis because the P-value of
0.012 is less than 0.05.
There is sufficient evidence to support the claim
that there is a statistically significant difference in
mean weekly study hours for females and
males.
So, now do we reject or not reject the null
hypothesis?
Page 632, P29
So, now do we reject or not reject the null
hypothesis?
Since we have a split-decision, we
should:
Page 632, P29
So, now do we reject or not reject the null
hypothesis?
Since we have a split-decision, we should:
(1) get more data
(2)
Page 632, P29
So, now do we reject or not reject the null
hypothesis?
Since we have a split-decision, we should:
(1) get more data
(2) recheck the accuracy of the outliers (if
possible)
Power
Power of a significance test is the probability
of rejecting the null hypothesis.
Best way to get more power to reject a
false null hypothesis is to increase
sample sizes.
Power
If we have reason to believe the population
standard deviations are about equal,
make the sample sizes the same.
Power
If we have reason to believe the population
standard deviations are about equal, make
the sample sizes the same.
If we have reason to believe that one
population’s standard deviation is
larger than the other’s, allocate our
resources so you take a larger sample
from the population with the larger
standard deviation.
Questions?