Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
MGT 2120: Chapter 10 Statistical Inference about Means and Proportions with Two Populations Two population means Notations: ο1 = Mean of population 1 ο³1 = Standard deviation of population 1 ο2 = Mean of population 2 ο³2 = Standard deviation of population 2 Sample 1 = Sample taken from population 1 n1 = Sample size of sample 1 π₯Μ 1 = Mean of sample 1 s1 = Standard deviation of sample 1 Sample 2 = Sample taken from population 2 n2 = Sample size of sample 2 π₯Μ 2 = Mean of sample 2 s2 = Standard deviation of sample 2 §10.1 Inferences about the difference between two population means: ο³1 and ο³2 are known Parameter of interest = ο1 β ο2 Point estimate of ο1 β ο2 = π₯Μ 1 - π₯Μ 2 Expected value of π₯Μ 1 - π₯Μ 2 = E(π₯Μ 1 - π₯Μ 2 ) = ο1 β ο2 π2 π2 1 2 Standard error of π₯Μ 1 - π₯Μ 2 = β n1 + n2 If n1 and n2 are both large (i.e. > 30), then π₯Μ 1 - π₯Μ 2 will follow an approximately normal distribution. Confidence interval for ο1 β ο2: π2 π2 1 2 Use formula 10.4, page 410: (π₯Μ 1 - π₯Μ 2) ± Zο‘/2 β n1 + n2 Hypothesis testing: Ho: ο1 β ο2 β₯ D0 Ha: ο1 β ο2 < D0 One-tail (left) test Test statistic zcalc = Ho: ο1 β ο2 β€ D0 Ha: ο1 β ο2 > D0 One-tail (right) test Ho: ο1 β ο2 = D0 Ha: ο1 β ο2 β D0 Two-tail test (π₯Μ 1 βπ₯Μ 2 )βπ·0 2 2 n1 n2 π π β 1+ 2 ο³1 and ο³2 are rarely known if ever, so we will not work out an example. §10.2 Inferences about the difference between two population means: ο³1 and ο³2 are unknown Case 1: ο³1 β ο³2 Parameter of interest = ο1 β ο2 Point estimate of ο1 β ο2 = π₯Μ 1 - π₯Μ 2 Expected value of π₯Μ 1 - π₯Μ 2 = E(π₯Μ 1 - π₯Μ 2 ) = ο1 β ο2 π2 π2 1 2 Standard error of π₯Μ 1 - π₯Μ 2 = βn1 + n2 with df = Formula 10.7, page 416 If n1 and n2 are both large (i.e. > 30), then π₯Μ 1 - π₯Μ 2 will follow an approximately normal distribution. Confidence interval for ο1 β ο2: π2 π2 1 2 Use formula 10.6, page 416: (π₯Μ 1 - π₯Μ 2) ± tο‘/2,df βn1 + n2 Note: df is given by the formula 10.7, page 416; Data Analysis will provide this number for us. Hypothesis testing: Ho: ο1 β ο2 β₯ D0 Ha: ο1 β ο2 < D0 One-tail (left) test Test statistic tcalc = Ho: ο1 β ο2 β€ D0 Ha: ο1 β ο2 > D0 One-tail (right) test Ho: ο1 β ο2 = D0 Ha: ο1 β ο2 β D0 Two-tail test (π₯Μ 1 βπ₯Μ 2 )βπ·0 2 2 n1 n2 π π β 1+ 2 p-value: T.DIST.RT(ABS(tcalc,df) for both the one-tail tests T.DIST.2T(ABS(tcalc,df) for the two-tail test See Formula 10.7, page 416, for df We will use Excel Data Analysis command for determining the p-value. Case 2: ο³1 = ο³2 = ο³ Parameter of interest = ο1 β ο2 Point estimate of ο1 β ο2 = π₯Μ 1 - π₯Μ 2 Expected value of π₯Μ 1 - π₯Μ 2 = E(π₯Μ 1 - π₯Μ 2 ) = ο1 β ο2 Pooled variance estimate (ππ2 ) = (n1 β1)s12 +(n2 β1)S22 n1 +n2 β2 1 1 1 2 Then, the standard error of π₯Μ 1 - π₯Μ 2 = ππ βn + n with df = n1 + n2 - 2 If n1 and n2 are both large (i.e. > 30), then π₯Μ 1 - π₯Μ 2 will follow an approximately normal distribution. Confidence interval for ο1 β ο2: 1 1 1 2 Use the formula: π₯Μ 1 - π₯Μ 2 ± tο‘/2,df ππ βn + n Hypothesis testing: Ho: ο1 β ο2 β₯ D0 Ha: ο1 β ο2 < D0 One-tail (left) test Test statistic tcalc = with df = n1 + n2 - 2 Ho: ο1 β ο2 β€ D0 Ha: ο1 β ο2 > D0 One-tail (right) test Ho: ο1 β ο2 = D0 Ha: ο1 β ο2 β D0 Two-tail test (π₯Μ 1 βπ₯Μ 2 )βπ·0 1 1 n1 n2 ππ β + p-value: T.DIST.RT(ABS(tcalc,df) for both the one-tail tests T.DIST.2T(ABS(tcalc,df) for the two-tail test df = n1 + n2 - 2 We will use Excel Data Analysis command for determining the p-value. §10.3 Inferences about the difference between two population means: Matched Samples Sample 1 = Observations of a sample prior to the event Sample 2 = Observations from the same subjects as sample 1 taken after the event n = Sample size of samples 1 and 2 x1i = ith observation from sample 1 x2i = ith observation from sample 2 Define: Sample difference = di = x1i β x2i οd = Average of the population of differences ο³d = Standard deviation of population of differences βπ πΜ = Mean of the sample differences = π π β(ππ βπΜ )2 sd = Standard deviation of sample differences = β πβ1 Confidence interval for οd: πΜ ± π‘πΌ/2 ππ ββπ with df = n - 1 Hypothesis testing: Ho: οd β₯ ο0 Ha: οd < ο0 One-tail (left) test Ho: οd β€ ο0 Ha: οd > ο0 One-tail (right) test Ho: οd = ο0 Ha: οd β ο0 Two-tail test πΜ βπ0 π ββπ Test statistic tcalc = π p-value: T.DIST.RT(ABS(tcalc,df) for both the one-tail tests T.DIST.2T(ABS(tcalc,df) for the two-tail test df = n - 1 We will use Excel Data Analysis command for determining the p-value. §10.4 Inferences about the difference between two population proportions Notations: p1 = Proportion of βsuccessβ in population 1 p2 = proportion of βsuccessβ in population 2 Sample 1 = Sample taken from population 1 Sample 2 = Sample taken from population 2 n1 = Sample size of sample 1 n2 = Sample size of sample 2 πΜ 1 = Proportion of βsuccessβ in sample 1 πΜ 2 = Proportion of βsuccessβ in sample 2 Parameter of interest = p1 β p2 Point estimate of p1 β p2 = πΜ 1 - πΜ 2 Expected value of πΜ 1 - πΜ 2 = E(πΜ 1 - πΜ 2 ) = p1 β p2 π1 (1βπ1 ) Standard error of πΜ 1 - πΜ 2 = ο³p1 β p2 = β n1 + π2 (1βπ2 ) n2 πΜ 1 (1βπΜ 1 ) Estimated standard error of πΜ 1 - πΜ 2 = Sp1 β p2 = β n1 + πΜ 2 (1βπΜ 2 ) n2 πΜ 1 - πΜ 2 will follow an approximately normal distribution if all the following four conditions are true: n1p1 β₯ 5; n1(1- p1) β₯ 5; n2p2 β₯ 5; n2(1- p2) β₯ 5 Confidence interval for p1 β p2 πΜ 1 (1βπΜ 1 ) Use the formula 10.13, page 430: πΜ 1 - πΜ 2 ± Zο‘/2 β Hypothesis testing: Ho: p1 β p2 β₯ 0 Ha: p1 β p2 < 0 One-tail (left) test n1 πΜ 2 (1βπΜ 2 ) + Ho: p1 β p2 β€ 0 Ha: p1 β p2 > 0 One-tail (right) test n2 Ho: p1 β p2 = 0 Ha: p1 β p2 β 0 Two-tail test All three Ho includes p1 β p2 = 0, i.e. p1 = p2 = p, a pooled estimate for πΜ can be found for p using the following formula. Pooled estimate πΜ = n1 πΜ 1 +n2 πΜ 2 n1 +n2 1 1 1 2 Estimated standard error of πΜ 1 - πΜ 2 = Sp1 β p2 = βπΜ (1 β πΜ ) (n + n ) Test statistic zcalc = (πΜ 1 βπΜ 2 ) 1 1 n1 n2 βπΜ (1βπΜ )( + ) p-value: 1 - NORM.DIST(ABS(zcalc,1) for both the one-tail tests 2*(1 - NORM.DIST(ABS(zcalc,1)) for the two-tail test Summary of formulas Confidence interval Values of ο³1 and ο³2 are known Values of ο³1 and ο³2 are unknown Hypothesis testing π12 π22 1 2 (π₯Μ 1 - π₯Μ 2 ) ± Zο‘/2 β n + n (π₯Μ 1 β π₯Μ 2) ± tο‘/2,df β π12 n1 + Z= (π₯Μ 1 βπ₯Μ 2 )βπ·0 π22 n2 t= 1 1 1 2 π₯Μ 1 - π₯Μ 2 = ππ βn + n 2 n1 n2 and (π₯Μ 1 βπ₯Μ 2 )βπ·0 2 2 n1 n2 df = π π β 1+ 2 df from results of Data Analysis command Values of ο³1 and ο³2 are unknown; but ο³1 = ο³2 = ο³ 2 π π β 1+ 2 t= (π₯Μ 1 βπ₯Μ 2 )βπ·0 1 1 n1 n2 ππ β + Sp2 = πΜ ± π‘πΌ/2 ππ ββπ t= ο³ 12 ο³ 22 ο« = Standard error of π₯Μ 1 - π₯Μ 2 n1 n 2 ο¦ S12 S 22 οΆ ο§ο§ ο·ο· ο« ο¨ n1 n2 οΈ 1 ο¦ S12 ο§ n1 ο 1 ο§ο¨ n1 2 2 2 οΆ 1 ο¦ S 22 οΆ ο·ο· ο« ο§ ο· n2 ο 1 ο§ο¨ n2 ο·οΈ οΈ Sp = Pooled estimate of the common ο³ if ο³1 = ο³2 =ο ο³ df = n1 + n2 - 2 Matched sample Comments Where, = π₯Μ 1 - π₯Μ 2 = Point estimate for ο1-ο2 (n1 β1)S12 +(n2 β1)S22 n1 +n2 β2 with df = n1 + n2 β 2 Where d = Mean of paired differences, and Sd = Standard deviation of paired differences πΜ βπ0 ππ ββπ df = n - 1 Two population proportions Z== πΜ 1 - πΜ 2 ± Zο‘/2 β πΜ 1 (1βπΜ 1 ) n1 + πΜ 2 (1βπΜ 2 ) (πΜ 1 βπΜ 2 ) 1 1 n1 n2 βπΜ (1βπΜ )( + ) n2 where, πΜ = n1 πΜ 1 +n2 πΜ 2 n1 +n2 Where, p1 ο p 2 = Point estimate for p1 β p2 1 1 1 2 βπΜ (1 β πΜ ) (n + n ) = Standard error of p1 ο p 2 for hypothesis testing where we assume p1 = p2 = p