Download Confidence Intervals for two means

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Degrees of freedom (statistics) wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Confidence interval wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Section 10.2 – Confidence Intervals for πœ‡1 βˆ’ πœ‡2
Conditions:
ο‚· Random: Data come from two independent random samples or from two groups in a randomized
experiment.
1
1
o 10%: When sampling without replacement, check that 𝑛1 ≀ 10 𝑁1 and 𝑛2 ≀ 10 𝑁2 .
ο‚·
Normal/Large Sample: Satisfied if:
o Both Populations are Normal OR
o Both Samples are β‰₯ 30 OR
o Graphs of the data show NO outliers or strong skewness (t-procedures)
The Two-Sample t Statistic
It is very unlikely that we will be given the population standard deviation (Οƒ) for each population, so we will be
using t – procedures.
Μ…πŸ βˆ’ 𝒙
Μ…πŸ
Standard Error of 𝒙
𝑆𝐸π‘₯Μ… 1 βˆ’π‘₯Μ…2 = √
𝑠12 𝑠22
+
𝑛1 𝑛2
Two-Sample t Interval for a Difference Between Two Means
When conditions are met, an approximate C% confidence interval for π‘₯Μ…1 βˆ’ π‘₯Μ…2 is
𝑠12
(π‘₯Μ…1 βˆ’ π‘₯Μ…2 ) ± 𝑑 βˆ— √
𝑛1
𝑠2
+ 𝑛2
2
Where t* is the critical value with C% of its area between –t* and t* for the t distribution with degrees of
freedom technology or smaller than n1 – 1 and n2 – 1.
***Using the table with degrees of freedom, we will use the smaller d.f. from the samples.
Example, pg. 641 Big Trees, Small Trees, Short Trees, Tall Trees The Wade Tract Preserve in Georgia is
an old-growth forest of longleaf pines that has survived in a relatively undisturbed state for hundreds of years.
One question of interest to foresters who study the area is β€œHow do the sizes of longleaf pine trees in the
northern and southern halves of the forest compare?” To find out, researchers took random samples of 30
trees from each half and measured the diameter at breast height (DBH) in centimeters. The summary statistics
are as follows:
Descriptive Statistics:
Variable
North
South
North, South
N
Mean
30
23.70
30
34.53
St Dev
17.50
14.26
Construct and interpret a 90% confidence interval for the difference in the mean DBH of longleaf pines in the
northern and southern halves of the Wade Tract Preserve.
State: We want to estimate the difference πœ‡1 βˆ’ πœ‡2 where πœ‡1 = the true mean DBH of all trees in the southern
half of the forest and πœ‡2 = the true mean DBH of all trees in the northern half of the forest at a 90%
confidence level.
Plan: Two-sample t interval for πœ‡1 βˆ’ πœ‡2
1. Random: The data come from independent random samples of 30 trees each from the northern and
southern halves of the forest.
1
1
ο‚· 10%: 30 ≀ 10 (𝐴𝑙𝑙 π‘‘π‘Ÿπ‘’π‘’π‘  𝑖𝑛 π‘‘β„Žπ‘’ π‘›π‘œπ‘Ÿπ‘‘β„Žπ‘’π‘Ÿπ‘› β„Žπ‘Žπ‘™π‘“), 30 ≀ 10 (𝐴𝑙𝑙 π‘‘π‘Ÿπ‘’π‘’π‘  𝑖𝑛 π‘‘β„Žπ‘’ π‘ π‘œπ‘’π‘‘β„Žπ‘’π‘Ÿπ‘› β„Žπ‘Žπ‘™π‘“)
Fairly safe to assume there are more than 300 trees in each half of the forest.
2. Normal/Large Sample: Because both sample sizes are 30, it is safe to use t-procedures by the Central
Limit Theorem.
Do:
Method 1: Using a table
We’ll use d.f. = 29 οƒ  t* = 1.699
𝑠12
(π‘₯Μ…1 βˆ’ π‘₯Μ…2 ) ± 𝑑 βˆ— √
𝑛1
𝑠2
+ 𝑛2 = (34.53 βˆ’ 23.70) ± 1.699√
2
14.262
30
+
17.52
30
= 10.83 ± 7.00
= (3.83, 17.83)
Method 2: Using the calculator
**CALCULATOR: Use 2-SampTInt: Stats οƒ 
Put in π‘₯Μ… 1: 34.53 Sx1: 14.26, n1: 30, π‘₯Μ… 2: 23.7 Sx2: 17.5, n2: 30 and C-Level: 90 Pooled: NO
Calculate
YOU WRITE: df = 55.728
(3.9362, 17.724)
Conclude: We are 90% confident that the interval from 3.9362 to 17.724 centimeters captures the true
difference in the actual mean DBH of the southern trees and the actual mean DBH of the northern trees.