Download 9.3 Notes for SUB

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Foundations of statistics wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

Psychometrics wikipedia , lookup

Taylor's law wikipedia , lookup

Omnibus test wikipedia , lookup

Misuse of statistics wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Section 9-3
Inferences About Two Means:
Independent Samples
Key Concept
This section presents methods for using sample data
from two independent samples to test hypotheses
made about two population means or to construct
confidence interval estimates of the difference between
two population means.
Key Concept
In Part 1 we discuss situations in which the standard
deviations of the two populations are unknown and are
not assumed to be equal. In Part 2 we discuss two
other situations: (1) The two population standard
deviations are both known; (2) the two population
standard deviations are unknown but are assumed to
be equal. Because is typically unknown in real
situations, most attention should be given to the
methods described in Part 1.
Part 1: Independent Samples with σ1
and σ2 Unknown and Not Assumed
Equal
Definitions
Two samples are independent if the sample values
selected from one population are not related to or
somehow paired or matched with the sample
values from the other population.
Two samples are dependent if the sample values
are paired. (That is, each pair of sample values
consists of two measurements from the same
subject (such as before/after data), or each pair of
sample values consists of matched pairs (such as
husband/wife data), where the matching is based
on some inherent relationship.)
Notation
1 = population mean
σ1 = population standard deviation
n1 = size of the first sample
x1 = sample mean
s1 = sample standard deviation
***When we have SAMPLE st. dev. (s)
we use Student T Distributions! (t* and t)
Corresponding notations for 2, σ2, s2, x2 , and n2 apply to
population 2.
Hypothesis Test for Two Means:
Independent Samples
Test Statistic for Two Means: Independent Samples
x1  x2    1  2 

t
2
1
2
2
s
s

n1 n2
(where 1 – 2 is often assumed to be 0)
Hypothesis Test - cont
Degrees of freedom: In this book we use this simple and conservative estimate: df = the
smaller of n1 – 1 and n2 – 1. In other words, use whichever sample is smaller, and take one
away.
Critical values: invT(area to the left, df), with degrees of freedom (df) = the smaller of n1 –
1 and n2 – 1.
*left tailed test, α is in the left tail: invT(, df)
*right tailed test, α is in the right tail: invT(1- , df)
𝛼
*two tailed test, α is divided equally between the two tails: invT( 2 , df)
P-values: Use the tcdf feature on your calculator, with degrees of freedom (df) = the smaller
of n1 – 1 and n2 – 1.
*For right-tailed tests: To find this probability in your calculator, type:
tcdf(t test statistic, 99999999, df)
*For left-tailed tests: To find this probability in your calculator, type:
tcdf(–99999999, –t test statistic, df)
***Don’t forget if your test is two-sided, double your P-value.***
Example 1: A headline in USA Today proclaimed that “Men, women are equal talkers.” That
headline referred to a study of the numbers of words that samples of men and women spoke
in a day. Given below are the results from the study. Use a 0.05 significance level to test the
claim that men and women speak the same mean number of words in a day. Does there
appear to be a difference?
a) State the null hypothesis and
the alternative hypothesis.
H0: μ1 = μ2
H1: μ1 ≠ μ2
b) What is/are the critical value(s)?
.05
= 0.025
2
The degrees of freedom is 185 (186 is the smaller sample. n – 1 = 185)
Type into your calculator: invT(0.025, 185)
z* = ±1.973
𝛼
Since this test is two-tailed, find 2 .
c) Calculate the test statistic.
Use the TMEAN program: The calculator
t = –0.676
will ask for these
6 pieces of data
d) What is the P-value?
Type into your calculator: tcdf(–99999, –0.68, 185)
Since this test is two-tailed, don’t forget to DOUBLE the p –value!
= 0.250×2 = 0.500
e) What is the conclusion?
Compare the p-value to . 0.5 > 0.05
Fail to reject H0. There is NOT enough evidence to suggest that men and
women DO NOT speak the same mean number of words in a day.
Make sure your context matches your alternative hypothesis; not the null hypothesis!
Example 2: A researcher wishes to determine whether people with high blood pressure can
reduce their blood pressure, measured in mm/Hg, by following a particular diet. Use a
significance level of 0.01 to test the claim that the treatment group is from a population with
a smaller mean than the control group.
Treatment Group
Control Group
n1 = 101
n2 = 105
x1  120.5
x2  149.3
s1 = 17.4
s2 = 30.2
a) State the null hypothesis and the alternative hypothesis.
H0: μ1 = μ2
H1: μ1 < μ2
b) What is/are the critical value(s)?
Since this test is left-tailed,  is in the left tail
The degrees of freedom is 100 (101 is the smaller sample. n – 1 = 100)
Type into your calculator: invT(0.01, 100) t* = –2.364
c) Calculate the test statistic. The calculator
Use the TMEAN program:
will ask for these
6 pieces of data
Treatment Group
Control Group
n1 = 101
n2 = 105
x1  120.5
s1 = 17.4
x2  149.3
s2 = 30.2
t = –8.426
d) What is the P-value?
Type into your calculator: tcdf = (–99999, –8.426, 100) = 0
e) What is the conclusion?
Compare the p-value to . 0 < 0.01
Reject H0. There IS enough evidence to suggest that the treatment group
is from a population with a smaller mean than the control group.
Example 3: A researcher was interested in comparing the resting pulse rates of people who
exercise regularly and of those who do not exercise regularly. Independent simple random
samples of 16 people who do not exercise regularly and 12 people who exercise regularly
were selected, and the resting pulse rates (in beats per minute) were recorded. The summary
statistics are as follows.
Do not exercise regularly
Exercise regularly
n1 = 16
n2 = 12
x1  73.4 beats/min
x2  68.1 beats/min
s1 = 10.9 beats/min
s2 = 8.2 beats/min
Use a 0.025 significance level to test the claim that the mean resting pulse rate of people who
do not exercise regularly is larger than the mean resting pulse rate of people who exercise
regularly.
a) State the null hypothesis and the alternative hypothesis.
H0: μ1 = μ2
H1: μ1 > μ2
b) What is/are the critical value(s)?
Since this test is right-tailed,  is in the right tail
The degrees of freedom is 11 (12 is the smaller sample. n – 1 = 11)
Type into your calculator: invT(1 – 0.025, 11) t* = 2.201
c) Calculate the test statistic.
Use the TMEAN
The calculator
will ask for these
program: 6 pieces of data
t = 1.468
Do not exercise regularly
Exercise regularly
n1 = 16
n2 = 12
x1  73.4 beats/min
s1 = 10.9 beats/min
s2 = 8.2 beats/min
x2  68.1 beats/min
d) What is the P-value?
Type in your caclulator: tcdf = (1.468, 99999, 11)
= 0.085
e) What is the conclusion?
Compare the p-value to . 0.085 > 0.025
Fail to reject H0. There IS NOT enough evidence to suggest that the
mean resting pulse rate of people who do not exercise regularly is larger
than the mean resting pulse rate of people who exercise regularly.