Download Two Sample Problems

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Psychometrics wikipedia , lookup

Foundations of statistics wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Taylor's law wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

German tank problem wikipedia , lookup

Statistical inference wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Misuse of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Chapter 19: Two-Sample Problems
STAT 1450
19.0 Two-Sample Problems
Connecting Chapter 18 to our
Current Knowledge of Statistics
Population
Parameter
Point Estimate
Confidence
Interval
ΞΌ (Οƒ known)
π‘₯
π‘₯ ± π‘§βˆ—
𝜎
𝑛
𝑧=
π‘₯ βˆ’ πœ‡0
𝜎 𝑛
ΞΌ (Οƒ unknown)
s
π‘₯ ± π‘‘βˆ—
𝑠
𝑛
𝑑=
π‘₯ βˆ’ πœ‡0
𝑠 𝑛
Test Statistic
β–Έ Remember that these formulas are only valid when appropriate simple
conditions apply!
19.0 Two-Sample Problems
Connecting Chapter 18 to our
Current Knowledge of Statistics
β–Έ Matched pairs were covered at the end of Chapter 18.
A common situation requiring matched pairs is when before-and-after
measurements are taken on individual subjects.
β–Έ Example: Prices for a random sample of tickets to a 2008 Katy Perry
concert were compared with the ticket prices (for the same seats) to
her 2013 concert..
 The data could be consolidated into 1 column of differences in ticket prices.
 A test of significance, or, a confidence interval would then occur for
β€œ1 sample of data.”
19.1 The Two-Sample Problem
The Two-Sample Problems
β–Έ Two-sample problems require us to compare:
 the response to two treatments
- or  the characteristics of two populations.
β–Έ We have a separate sample from each treatment or population.
19.1 The Two-Sample Problem
The Two-Sample Problem
β–Έ Example: Suppose a random samples of ticket prices for concerts by
the Rolling Stones was obtained. For comparison purposes another
random sample of Coldplay ticket prices was obtained. Note these are
not necessarily the same seats or even the same venues.
β–Έ Question: Are these samples more likely to be independent or
dependent?
a)
Independent
b)
Dependent
c)
Not sure
19.1 The Two-Sample Problem
The Two-Sample Problem
β–Έ Example: Suppose a random samples of ticket prices for concerts by
the Rolling Stones was obtained. For comparison purposes another
random sample of Coldplay ticket prices was obtained. Note these are
not necessarily the same seats or even the same venues.
β–Έ Question: Are these samples more likely to be independent or
dependent?
a)
Independent
b)
Dependent
c)
Not sure
19.1 The Two-Sample Problem
Two-Sample Problems
β–Έ The end of Chapter 18 described inference procedures for the mean
difference in two measurements on one group of subjects (e.g., pulse
rates for 12 students before-and-after listening to music).
β–Έ Given our answer from above, and the likelihood that each sample has
different sample sizes, variances, etc… Chapter 19 focuses on the
difference in means for 2 different groups.
Population
Parameter
Point Estimate
πœ‡1 βˆ’ πœ‡2
π‘₯1 βˆ’ π‘₯2
Confidence
Interval
Test Statistic
19.2 Comparing Two Population Means
Sampling Distribution of Two Sample Means
β–Έ Recall that for a single sample mean π‘₯
 The standard deviation of a statistic is estimated from data the
result is called the standard error of the statistic.
 The standard error of π‘₯ is 𝑠
𝑛
.
Inference in the two-sample problem will
require the standard error of the difference
of two sample means π’™πŸ βˆ’ π’™πŸ .
x12
19.2 Comparing Two Population Means
Sampling Distribution of Two Sample Means
β–Έ The following table stems from the above comment on standard error
and statistical theory.
Variable
Parameter Point Estimate
Population
Standard Deviation
Standard Error
x1
m1
π‘₯1
s1
𝑠1
𝑛1
x2
m2
π‘₯2
s2
𝑠2
𝑛2
π‘₯1 βˆ’ π‘₯2
𝜎12 𝜎22
+
𝑛1 𝑛2
𝑠12 𝑠22
+
𝑛1 𝑛2
Diff =
x1 - x2
m1 - m2
19.2 Comparing Two Population Means
Example: SSHA Scores
β–Έ The Survey of Study Habits and Attitudes (SSHA) is a psychological
test designed to measure various academic behaviors (motivation,
study habits, attitudes, etc…) of college students. Scores on the SSHA
range from 0 to 200. The data for random samples 17 women
(**the outlier from the original data set was removed**) and 20 men
yielded the following summary statistics.
β–Έ Is there a difference in SSHA performance based upon gender?
19.2 Comparing Two Population Means
Example: SSHA Scores
β–Έ Summary statistics for the two groups are below:
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women**
139.588
20.363
17
Men
122.5
32.132
20
 There is a difference in these two groups.
The women’s average was 17.5 points > than the men’s average.
19.2 Comparing Two Population Means
Example: SSHA Scores
β–Έ Summary statistics for the two groups are below:
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women**
139.588
20.363
17
Men
122.5
32.132
20
 There is a difference in these two groups.
The women’s average was 17.5 points > than the men’s average.
 Yet, the standard deviations are larger than this sample difference,
and the sample sizes are about the same.
19.2 Comparing Two Population Means
Example: SSHA Scores
β–Έ Summary statistics for the two groups are below:
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women**
139.588
20.363
17
Men
122.5
32.132
20
 There is a difference in these two groups.
The women’s average was 17.5 points > than the men’s average.
 Yet, the standard deviations are larger than this sample difference,
and the sample sizes are about the same.
 Is this difference significant enough to conclude
that πœ‡women is larger than πœ‡men?
19.2 Comparing Two Population Means
Example: SSHA Scores
β–Έ Summary statistics for the two groups are below:
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women**
139.588
20.363
17
Men
122.5
32.132
20
 There is a difference in these two groups.
The women’s average was 17.5 points > than the men’s average.
 Yet, the standard deviations are larger than this sample difference,
and the sample sizes are about the same.
 Is this difference significant enough to conclude
that πœ‡women is larger than πœ‡men?
Let’s learn more!
19.3 Two-Sample t Procedures
The Two-sample t Procedures: Derived
β–Έ Now that we have a point estimate and a formula for the standard error,
we can conduct statistical inference for the difference in two population
means.
Chapter
Parameter of Interest
Point
Estimate
Standard
Error
18
m
(Οƒ unknown; 1-sample)
π‘₯
𝑠
𝑛
19
ΞΌ 1 - ΞΌ2
(Οƒ1, Οƒ2 unknown;
2-samples)
Confidence Interval
π‘₯ ± π‘‘βˆ—
𝑠
𝑛
pt. estimate ± t*(standard error)
π‘₯1 βˆ’ π‘₯2
𝑠12 𝑠22
+
𝑛1 𝑛2
19.3 Two-Sample t Procedures
The Two-sample t Procedures: Derived
β–Έ Now that we have a point estimate and a formula for the standard error,
we can conduct statistical inference for the difference in two population
means.
Chapter
Parameter of Interest
Point
Estimate
Standard
Error
18
m
(Οƒ unknown; 1-sample)
π‘₯
𝑠
𝑛
19
ΞΌ 1 - ΞΌ2
(Οƒ1, Οƒ2 unknown;
2-samples)
Confidence Interval
π‘₯ ± π‘‘βˆ—
𝑠
𝑛
pt. estimate ± t*(standard error)
π‘₯1 βˆ’ π‘₯2
𝑠12 𝑠22
+
𝑛1 𝑛2
(π‘₯1 βˆ’ π‘₯2 ) ± t*
𝑠12
𝑛1
+
𝑠22
𝑛2
19.3 Two-Sample t Procedures
The Two-sample t Procedures: Derived
Chapter
Parameter of
Interest
Point
Estimate
Standard
Error
Test Statistic
18
ΞΌ
(Οƒ unknown;
1-sample)
π‘₯
𝑠
𝑛
π‘₯ βˆ’ πœ‡0
𝑑=
𝑠/ 𝑛
m 1 - ΞΌ2
19
(Οƒ1, Οƒ2
unknown;
2-samples)
π‘₯1 βˆ’ π‘₯2
𝑠12 𝑠22
+
𝑛1 𝑛2
pt. estimate – m0
standard error
Note: H0 for our purposes will be that m1=m2;
which is equivalent to there being a mean difference of β€˜0.’
19.3 Two-Sample t Procedures
The Two-sample t Procedures: Derived
Chapter
Parameter of
Interest
Point
Estimate
Standard
Error
Test Statistic
18
ΞΌ
(Οƒ unknown;
1-sample)
π‘₯
𝑠
𝑛
π‘₯ βˆ’ πœ‡0
𝑑=
𝑠/ 𝑛
m 1 - ΞΌ2
19
(Οƒ1, Οƒ2
unknown;
2-samples)
π‘₯1 βˆ’ π‘₯2
𝑠12 𝑠22
+
𝑛1 𝑛2
pt. estimate – m0
standard error
𝑑=
(π‘₯1 βˆ’ π‘₯2 ) βˆ’ 0
𝑠12 𝑠22
𝑛1 + 𝑛2
Note: H0 for our purposes will be that m1=m2;
which is equivalent to their being a mean difference of β€˜0.’
19.3 Two-Sample t Procedures
The Two-sample t Procedures
β–Έ Now we can complete the table from earlier:
Population
Parameter
Point Estimate
πœ‡1 βˆ’ πœ‡2
π‘₯1 βˆ’ π‘₯2
Confidence Interval
Test Statistic
t* is the critical value for confidence level C for the
t distribution with df = smaller of (n1-1) and (n2-1).
Find P-values from the t distribution with df = smaller of (n1-1) and (n2-1).
19.3 Two-Sample t Procedures
The Two-sample t Procedures
β–Έ Now we can complete the table from earlier:
Population
Parameter
πœ‡1 βˆ’ πœ‡2
Point Estimate
π‘₯1 βˆ’ π‘₯2
Confidence Interval
(π‘₯1 βˆ’ π‘₯2 ) ± t*
𝑠12
𝑛1
+
Test Statistic
𝑠22
𝑛2
t* is the critical value for confidence level C for the
t distribution with df = smaller of (n1-1) and (n2-1).
Find P-values from the t distribution with df = smaller of (n1-1) and (n2-1).
19.3 Two-Sample t Procedures
The Two-sample t Procedures
β–Έ Now we can complete the table from earlier:
Population
Parameter
πœ‡1 βˆ’ πœ‡2
Point Estimate
π‘₯1 βˆ’ π‘₯2
Confidence Interval
(π‘₯1 βˆ’ π‘₯2 ) ± t*
𝑠12
𝑛1
+
𝑠22
𝑛2
Test Statistic
𝑑=
(π‘₯1 βˆ’ π‘₯2 ) βˆ’ 0
𝑠12 𝑠22
+
𝑛1 𝑛2
t* is the critical value for confidence level C for the
t distribution with df = smaller of (n1-1) and (n2-1).
Find P-values from the t distribution with df = smaller of (n1-1) and (n2-1).
19.3 Two-Sample t Procedures
The Two-sample t Procedures: Confidence
Intervals
β–Έ Draw an SRS of size n1 from a large Normal population with unknown mean
πœ‡1 , and draw an independent SRS of size n2 from another large Normal
population with unknown mean πœ‡2 . A level C confidence interval for πœ‡2 -πœ‡1 is
given by
(π‘₯1 βˆ’ π‘₯2 ) ± t*
𝑠12
𝑛1
𝑠2
+ 𝑛2
2
β–Έ Here t* is the critical value for confidence level C for the t distribution with
degrees of freedom from either Option 1(computer generated) or
Option 2 (the smaller of n1 – 1 and n2 – 1).
19.3 Two-Sample t Procedures
The Two-sample t Procedures: Significance
Tests
β–Έ To test the hypothesis H0: ΞΌ1 - ΞΌ2 , calculate the two-sample t statistic
𝑑=
(π‘₯1 βˆ’ π‘₯2 )
𝑠12 𝑠22
𝑛1 + 𝑛2
β–Έ Find p-values from the t distribution with df = smaller of (n1-1) and (n2-1).
19.0 Two-Sample Problems
Conditions for Inference Comparing TwoSample Means and Robustness of t Procedures
β–Έ The general structure of our necessary conditions is an extension of
the one-sample cases.
 Simple Random Samples:
 Do we have 2 simple random samples?
 Population : Sample Ratio:
 The samples must be independent and from two large populations of
interest.
19.0 Two-Sample Problems
Conditions for Inference Comparing TwoSample Means and Robustness of t Procedures
 Large enough sample:
Both populations will be assumed to be from a Normal distribution and
 when the sum of the sample sizes is less than 15, t procedures can
be used if the data close to Normal (roughly symmetric, single peak, no
outliers)? If there is clear skewness or outliers then, do not use t.
 when the sum of the sample sizes is between 15 and 40, t procedures
can be used except in the presences of outliers or strong skewness.
 when the sum of the sample sizes is at least 40, the t procedures can
be used even for clearly skewed distributions.
19.0 Two-Sample Problems
Conditions for Inference Comparing TwoSample Means and Robustness of t Procedures
β–Έ Note: In practice it is enough that the two distributions have similar
shape with no strong outliers. The two-sample t procedures are even
more robust against non-Normality than the one-sample procedures.
β–Έ Now that we have a point estimate and a formula for the standard error,
we can conduct statistical inference for the difference in two population
means.
19.3 Two-Sample t Procedures
Poll: SSHA Scores
β–Έ Suppose we have a goal of measuring the mean difference in SSHA
between women and men. Which seems more plausible?
a.
µWomen-µMen = 0
(There is no difference.)
b.
µWomen - µMen β‰  0
(There is some difference.)
19.3 Two-Sample t Procedures
Example: SSHA Scores
β–Έ The summary statistics for the SSHA scores for random samples of
men and women are below. Use this information to construct a 90%
confidence interval for the mean difference.
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women
139.588
20.363
17
Men
122.5
32.132
20
18.3 One-Sample t Confidence Intervals
Example: 90% CI for SSHA Scores
1. Components
1.
οƒΌDo we have two simple random samples?
Yes. It was stated.
οƒΌLarge enough population: sample ratio?
Yes.
NW > 20*17 = 340
NM > 20*20 = 400
οƒΌLarge enough sample?
Yes.
nW + nM =37 < 40
but outlier has been removed.
No skewness.
Steps for SuccessConstructing Confidence Intervals
for m1 - m2 .
Confirm that the 3 key conditions are satisfied
(SRS?, N:n?, t-distribution?).
18.3 One-Sample t Confidence Intervals
Example: 90% CI for SSHA Scores
2. Components.
π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5, sm = 32.132, nm = 20
Steps for SuccessConstructing Confidence Intervals
for m1 - m2 .
1.
2.
3.
4.
5.
Confirm that the 3 key conditions are satisfied
(SRS?, N:n?, t-distribution?).
Identify the 3 key components of the
confidence interval (means, s.ds., n1 , n2 ).
Select t*.
Construct the confidence interval.
*Interpret* the interval.
18.3 One-Sample t Confidence Intervals
Example: 90% CI for SSHA Scores
2. Components.
π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5, sm = 32.132, nm = 20
3. Select t*.
df =min{(nw -1), (nm -1)}=16
t*(90%, 16) = 1.746
Steps for SuccessConstructing Confidence Intervals
for m1 - m2 .
1.
2.
3.
4.
5.
Confirm that the 3 key conditions are satisfied
(SRS?, N:n?, t-distribution?).
Identify the 3 key components of the
confidence interval (means, s.ds., n1 , n2 ).
Select t*.
Construct the confidence interval.
*Interpret* the interval.
18.3 One-Sample t Confidence Intervals
Example: 90% CI for SSHA Scores
2. Components.
π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5, sm = 32.132, nm = 20
Steps for SuccessConstructing Confidence Intervals
for m1 - m2 .
1.
2.
3.
4.
5.
3. Select t*.
df =min{(nw -1), (nm -1)}=16
t*(90%, 16) = 1.746
Confirm that the 3 key conditions are satisfied
(SRS?, N:n?, t-distribution?).
Identify the 3 key components of the
confidence interval (means, s.ds., n1 , n2 ).
Select t*.
Construct the confidence interval.
*Interpret* the interval.
4. Interval.
139.588 βˆ’ 122.5 ± 1.746
20.3632
17
+
32.1322
20
17.088 ± 15.222 = 1.866 to 32.31
18.3 One-Sample t Confidence Intervals
Example: 90% CI for SSHA Scores
2. Components.
π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5, sm = 32.132, nm = 20
Steps for SuccessConstructing Confidence Intervals
for m1 - m2 .
1.
2.
3.
4.
5.
3. Select t*.
df =min{(nw -1), (nm -1)}=16
t*(90%, 16) = 1.746
Confirm that the 3 key conditions are satisfied
(SRS?, N:n?, t-distribution?).
Identify the 3 key components of the
confidence interval (means, s.ds., n1 , n2 ).
Select t*.
Construct the confidence interval.
*Interpret* the interval.
4. Interval.
139.588 βˆ’ 122.5 ± 1.746
20.3632
17
+
32.1322
20
17.088 ± 15.222 = 1.866 to 32.31
5. Interpret.
We are 90% confident that the mean women’s SSHA score is
between 1.866 and 32.31 points higher than men’s.
19.3 Two-Sample t Procedures
Example: SSHA Scores
β–Έ Let’s continue with this example by now conducting a test of
significance for the mean difference in SSHA by gender at a=0.10.
Does our decision align with the results from the earlier poll?
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women
139.588
20.363
17
Men
122.5
32.132
20
19.3 Two-Sample t Procedures
Example: SSHA Scores
State: Is there a difference in the mean SSHA scores between men and women?
(i.e., mDiff β‰  0, mWomen βˆ’ mMen β‰  0, mWomen β‰  mMen )
Plan:
a.) Identify the parameter.
19.3 Two-Sample t Procedures
Example: SSHA Scores
State: Is there a difference in the mean SSHA scores between men and women?
(i.e., mDiff β‰  0, mWomen βˆ’ mMen β‰  0, mWomen β‰  mMen )
Plan:
a.) Identify the parameter.
mDiff =mWomen - mMen.
b) List all given information from the data collected.
19.3 Two-Sample t Procedures
Example: SSHA Scores
State: Is there a difference in the mean SSHA scores between men and women?
(i.e., mDiff β‰  0, mWomen βˆ’ mMen β‰  0, mWomen β‰  mMen )
Plan:
a.) Identify the parameter.
mDiff =mWomen - mMen.
b) List all given information from the data collected. π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5,
c) State the null (H0) and alternative (HA) hypotheses.
sm = 32.132, nm = 20
19.3 Two-Sample t Procedures
Example: SSHA Scores
State: Is there a difference in the mean SSHA scores between men and women?
(i.e., mDiff β‰  0, mWomen βˆ’ mMen β‰  0, mWomen β‰  mMen )
Plan:
a.) Identify the parameter.
mDiff =mWomen - mMen.
b) List all given information from the data collected. π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5,
c) State the null (H0) and alternative (HA) hypotheses.
sm = 32.132, nm = 20
H0: mDiff = 0
Ha : mDiff β‰  0
19.3 Two-Sample t Procedures
Example: SSHA Scores
State: Is there a difference in the mean SSHA scores between men and women?
(i.e., mDiff β‰  0, mWomen βˆ’ mMen β‰  0, mWomen β‰  mMen )
Plan:
a.) Identify the parameter.
mDiff =mWomen - mMen.
b) List all given information from the data collected. π’™π’˜ = 139.588, sw = 20.363, nw = 17
π’™π’Ž = 122.5,
c) State the null (H0) and alternative (HA) hypotheses.
sm = 32.132, nm = 20
H0: mDiff = 0
Ha : mDiff β‰  0
d) Specify the level of significance. a =.10
e) Determine the type of test.
Left-tailed
Right-tailed
Two-Tailed
19.3 Two-Sample t Procedures
Example: SSHA Scores
Plan:
f) Sketch the region(s) of β€œextremely unlikely” test statistics.
19.3 Two-Sample t Procedures
Example: SSHA Scores
Solve:
a)
Check the conditions for the test you plan to use.
 Two Simple Random Samples?
 Large enough population: sample ratios?
 Large enough samples?
19.3 Two-Sample t Procedures
Example: SSHA Scores
Solve:
a)
Check the conditions for the test you plan to use.
 Two Simple Random Samples?
Yes.
Stated as a random sample.
 Large enough population: sample ratios?
Yes. Both populations are arbitrarily large;
much greater than, NW > 20*17 = 340; NM > 20*20 = 400
 Large enough samples?
Yes. nW + nM =37 < 40 outlier has been removed. No skewness.
19.3 Two-Sample t Procedures
Example: SSHA Scores
Solve:
b)
c)
Calculate the test statistic
t=
π‘₯𝑀 βˆ’π‘₯π‘š
𝑠𝑀 2 π‘ π‘š 2
+
𝑛𝑀
π‘›π‘š
=
Determine (or approximate) the P-Value.
19.3 Two-Sample t Procedures
Example: SSHA Scores
Solve:
b)
c)
Calculate the test statistic
t=
π‘₯𝑀 βˆ’π‘₯π‘š
𝑠𝑀 2 π‘ π‘š
+
𝑛𝑀
π‘›π‘š
=
2
Determine (or approximate) the P-Value.
139.588βˆ’122.5
20.3632 32.1322
+ 20
17
=
17.088
8.719
= 1.96
19.3 Two-Sample t Procedures
Example: SSHA Scores
Solve:
b)
c)
Calculate the test statistic
t=
π‘₯𝑀 βˆ’π‘₯π‘š
𝑠𝑀 2 π‘ π‘š
+
𝑛𝑀
π‘›π‘š
=
2
Determine (or approximate) the P-Value.
139.588βˆ’122.5
20.3632 32.1322
+ 20
17
1.96
 1.746 < 1.96 < 2.12
 .05 < P-value < .10
P-value
=
17.088
8.719
DF = 17 - 1
= 1.96
19.3 Two-Sample t Procedures
Example: SSHA Scores
Conclude:
a) Make a decision about the null hypothesis (Reject H0 or Fail to reject H0).
19.3 Two-Sample t Procedures
Example: SSHA Scores
Conclude:
a) Make a decision about the null hypothesis (Reject H0 or Fail to reject H0).
Because the approximate P-value is smaller than 0.10,
we reject the null hypothesis.
b) Interpret the decision in the context of the original claim.
19.3 Two-Sample t Procedures
Example: SSHA Scores
Conclude:
a) Make a decision about the null hypothesis (Reject H0 or Fail to reject H0).
Because the approximate P-value is smaller than 0.10,
we reject the null hypothesis.
b) Interpret the decision in the context of the original claim.
There is enough evidence (at a=.10)
that there is a difference in the mean SSHA score
between men and women.
19.3 Two-Sample t Procedures
Example: SSHA Scores
β–Έ Let’s continue with this example by now conducting a test of
significance for the mean difference in SSHA by gender at a=0.10.
Does our decision align with the results from the earlier poll?
________
Group
Sample
Mean
Sample Standard
Deviation
Sample
Size
Women
139.588
20.363
17
Men
122.5
32.132
20