Download Document

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Chapter 17
Inference about a
Population Mean
5/24/2017
Inference about µ
1
σ not known
In practice, we do not usually know
population standard deviation σ
Therefore, we cannot calculate σx-bar
Instead, we calculate this standard error
of the mean:
SE x  s
5/24/2017
n
Inference about µ
2
t Procedures
Because σ is now known, we do NOT use z
statistics. Instead, we use this t statistic
x  μ0
z
σ
n

x  μ0
t
s
n
T procedures are based on
Student’s t distribution
5/24/2017
Inference about µ
3
Student’s t Distributions
• A “family” of distributions
• Each family member has different
degrees of freedom (df)
• More area in their tails than Normal
distributions (fatter tails)
• As df increases, s becomes a better
estimate of σ and the t distributions
becomes more Normal
• t with more than 30 df  very similar to z
5/24/2017
Inference about µ
4
t Distributions
5/24/2017
Inference about µ
5
Table C “t Table”
Table entries = t* critical values
Rows = df; Columns = probability levels
Familiarize yourself with the t table in the
“Tables and Formulas for Moore” handout
5/24/2017
Inference about µ
6
Using Table C
Question: What t critical value should I use for
95% confidence when df = 7?
Answer: t* = 2.365
5/24/2017
Inference about µ
7
Confidence Interval for μ
s
x t 
n

t* is the critical value with df = n−1
and C level of confidence
Lookup in Table C
5/24/2017
Inference about µ
8
Example
Statement : What is the population mean µ birth
weight of the SIDS population?
Data: We take an SRS of n = 10 from the population of
SIDS babies and retrieve their birth certificates. This was
their birth weights (grams): 2998, 3740, 2031, 2804,
2454, 2780, 2203, 3803, 3948, 2144
Plan: We will calculate the sample mean and
standard deviation. We will then calculate and
interpret the 95% CI for µ.
9
Example (Solution)
x  2890.5 grams s  720 grams
df  n  1  10  1  9; For 95% confidence : t *  2.262 (Table C)
s
95% CI for   x  t 
n
720
 2890.5  2.262 
10
 2890.5 ± 515.1
*
= (2375 to 3406) grams
We are 95% confident population mean µ is
between 2375 and 3406 gms.
10
One-Sample t Test (Hypotheses)
• Draw simple random sample of size n from a
large population having unknown mean µ
• Test null hypothesis H0: μ = μ0
where μ0 ≡ stated value for the population mean
– μ0 changes from problem to problem
– μ0 is NOT based on the data
– μ0 IS based on the research question
• The alternative hypothesis is:
– Ha: μ > μ0 (one-sided looking for a larger value) OR
– Ha: μ < μ0 (one-sided looking for a smaller value) OR
– Ha: μ ≠ μ0 (two-sided)
5/24/2017
Inference about µ
11
One-Sample t Test
One-sample t statistic: t 
x  μ0
s
n
with df  n  1
P-value = tail
beyond tstat
(use Table C)
5/24/2017
Inference about µ
12
P-value: Interpretation
• P-value (interpretation) Smaller-and-smaller Pvalues indicate stronger-and-stronger evidence
against H0
• Conventions:
.10 < P < 1.0  evidence against H0 not significant
.05 < P ≤ .10  evidence against H0 marginally signif.
.01 < P ≤ .05  evidence against H0 significant
P ≤ .01  evidence against H0 highly significant
Basics of Significance Testing
13
Example: “Weight Gain”
Statement: We want to know whether there is good
evidence for weight change in a particular population. We
take an SRS on n = 10 from this population and find the
following changes in weight (lbs).
2.0, 0.4, 0.7, 2.0, −0.4, 2.2, −1.3, 1.2, 1.1, 2.3
Calculate: x  1.020 lbs.; s  1.196 lbs.
Do data provide significant evidence for a
weight change?
5/24/2017
Inference about µ
14
Example “Weight Gain” (Hypotheses)
• Under null hypothesis, no weight gain in population
H0: μ = 0
Note: µ0 = 0 in this particular example
• One-sided alternative, weight gain in population.
Ha: μ > 0
• Two-sided alternative hypothesis, weight change:
Ha: μ ≠ 0
5/24/2017
Inference about µ
15
Example (Test Statistic)
t 
x  μ0
s

1.020  0
 2.70
1.196
n
10
df  10  1  9
5/24/2017
Inference about µ
16
Example (P-value)
• Table C, row for 9 df
• t statistic (2.70) is between
t* = 2.398 (P = 0.02) and t* = 2.821 (P = 0.01)
• One-sided P-value is between .01 and .02:
.01 < P < .02
5/24/2017
Inference about µ
17
Two-tailed P-value
• For two-sided Ha,
P-value = 2 × onesided P
• In our example, the
one-tailed P-value was
between .01 and .02
• Thus, the two-tailed P
value is between .02
and .04
18
Interpretation
• Interpret P-value in
context of claim made
by H0
• In our example,
H0: µ = 0
(no weight gain)
• Two-tailed P-value
between .02 and .04
• Conclude: significant
evidence against H0
19
Paired Samples
Responses in matched pairs
Parameter μ now represents the
population mean difference
5/24/2017
Inference about µ
20
Example: Matched Pairs
• Pollution levels in
two regions (A & B)
on 8 successive days
• Do regions differ
significantly?
• Subtract B from A =
last column
• Analyze differences
Day
1
A
2.92
B
1.84
A–B
1.08
2
3
4
1.88
5.35
3.81
0.95
4.26
3.18
0.93
1.09
0.63
5
6
7
4.69
4.86
5.81
3.44
3.69
4.95
1.25
1.17
0.86
8
5.55
4.47
1.08
x  1.0113 and s  0.1960
5/24/2017
Inference about µ
21
Example: Matched Pairs
Hypotheses:
H0: μ = 0 (note: µ0 = 0, representing no mean
difference)
Ha: μ > 0 (one-sided)
Ha: μ ≠ 0 (two-sided)
Test Statistic:
t 
x  μ0
s
 14.59
0.1960
n
5/24/2017

1.0113  0
8
Inference about µ
df  n  1  8  1  7
22
Illustration (cont.)
P-value:
• Table C  7 df row
• t statistic is greater than largest value in
table: t* = 5.408 (upper p = 0.0005).
• Thus, one-tailed P < 0.0005
• Two-tailed P = 2 × one-tailed P-value:
P < 0.001
• Conclude: highly significant evidence
against H0
5/24/2017
Inference about µ
23
95% Confidence Interval for µ
Air pollution data:
n = 8, x-bar = 1.0113, s = 0.1960
df = 8  1 = 7
For 95% confidence, use t* = 2.365 (Table C)
x t

s
n
 1.0113  2.365
0.1960
 1.0113  0.1639
8
 0.8474 to 1.1752
95% confidence population mean difference µ is between 0.847 and 1.175
5/24/2017
Inference about µ
24
Interpreting the Confidence Interval



The confidence interval seeks population mean
difference µ (IMPORTANT)
Recall the meaning of “confidence,” i.e., the
ability of the interval to capture µ upon
repetition
Recall from the prior chapter that the
confidence interval can be used to address a
null hypothesis
5/24/2017
Inference about µ
25
Normality Assumption
• t procedures require Normality, but they are
robust when n is “large”
• Sample size less than 15: Use t procedures if
data are symmetric, have a single peak with no
outliers. If data are highly skewed, avoid t.
• Sample size at least 15: Use t procedures
except in the presence of strong skewness.
• Large samples: Use t procedures even for
skewed distributions when the sample is large (n
≥ ~40)
5/24/2017
Inference about µ
26
Can we use a t procedure?
Moderately sized dataset (n = 20) w/strong skew.
t procedures cannot be trusted
5/24/2017
Inference about µ
27
Word lengths in Shakespeare’s
plays (n ≈ 1000)
The data has a strong positive skew but since the
sample is large, we can use t procedures.
5/24/2017
Inference about µ
28
Can we use t?
The distribution has no clear violations of
Normality. Therefore, we trust the t procedure.
5/24/2017
Inference about µ
29
Related documents