Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Sample Size Determination In the Context of Hypothesis Testing Sample Size2: 1 Recall, in context of Estimation, Sample Size is based upon: • the width of the Confidence interval: • The confidence level (1 – a) confidence coefficient, z1-a/2 • The population standard error, s/n w ) ( x – z1-a/2(s/n) x + z1-a/2(s/n) x 4 z1a / 2 s 2 n w2 2 Sample Size2: 2 Sample size in Context of Hypothesis Testing: • Need to consider POWER as well as confidence level Example: Suppose we have a hypothesis on one mean: Ho: mo = 100 vs. Ha: mo 100 s = 10 a = .05 If the true mean is in fact ma = 105, what size sample is required so that the power of the test is (1b) = .80 ? Sample Size2: 3 For our hypothesis test, we will reject Ho for x greater than C1 or less than C2 a/2 = .025 a/2 = .025 mo=100 C2 moZ1-a/2(s/n) C1 mo+Z1-a/2(s/n) Sample Size2: 4 Let’s look at these decision points (C1 and C2) relative to a specific alternative. Suppose, in fact, that ma = 105. We will reject Ho • if x is greater than C1 • or x is less than C2 b Pr( fail to reject H o | m 105) C2 Distribution based on Ha C1 ma=105 Sample Size2: 5 b Pr( fail to reject H o | ma ) .20 Pr(C2 x C1 ) | ma ) We want b = .20 for power=.80 Pr( x C1 ) | ma ) Pr( x C2 ) | ma ) C1 ma C2 ma Pr zb Pr zb s/ n s/ n We want b = .20 zb = -.842 C2 C1 ma Sample Size2: 6 Note for sample size determination: a, b are set by the investigator Both a specific null (mo) and a specific alternative (ma) must be specified we assume that the same variance s2 holds for both the null and alternative distributions a/2 a/2 b m0 C1 mo+z1-a/2(s/n) ma C1 maz1b (s/n) Sample Size2: 7 We now have: C1 mo + z1a / 2 C1 in terms of s n both the Ho and Ha distributions: z1 b C1 ma s C1 ma z1 b s/ n n Setting these equal: mo + z1a / 2 s n ma z1 b s n Then solve for n. Sample Size2: 8 Sample Size is then: 2 z + z s 1a / 2 1b 2 n m a mo 2 Note: • Always use positive values for z1-a/2 and z1 b we defined CI using positive z) Sample Size2: 9 In our example: s = 10 a = .05 z1-a/2 = 1.96 b = .20 z1 b = .842 mo = 100 ma = 105 2 z + z s 1a / 2 1b 2 n m a mo 2 2 1.96 + .842 10 2 n 105 100 2 31.39 Or a sample size of n=32 is needed. Sample Size2: 10 If we change the desired power to 1b = .90: b = .10 z1 b = 1.28 2 1.96 + 1.28 10 2 n 105 100 2 41.99 Or a sample size of n=42 is needed. Sample Size2: 11 In the context of hypothesis testing, sample size is a function of: s2, the population variance a = .05 z1-a/2 , Type I error rate b = .20 z1 b , Type II error rate Distance between mo , hypothesized mean and ma , a specific alternative 2 z + z s 1a / 2 1b 2 n m a mo 2 Sample Size2: 12 Using Minitab to estimate Sample Size: Stat Power and Sample Size 1-Sample Z Difference between mo and ma s Desired power (separate by spaces if entering several) 2-sided test Sample Size2: 13 Power and Sample Size 1-Sample Z Test Testing mean = null (versus not = null) Calculating power for mean = null + difference Alpha = 0.05 Sigma = 10 Sample Difference Size 5 32 5 43 Target Power 0.8000 0.9000 Actual Power 0.8074 0.9064 Sample Size2: 14 Sample size and power for comparing means of 2 independent groups. In the example comparing LOS for elective vs. emergency patients, we observed a difference between sample means of 3.3 days – but found that this was NOT statistically significantly different from zero. However 3.3 days is a large, expensive difference in length of stay. Our data had relatively large observed variance, and small n. • What was the power of our study to detect a difference of 3.3 days? • What sample size would be needed per group to find a difference of 3 days or more significantly different from zero? Sample Size2: 15 In Minitab: Stat Power and Sample Size 2-sample t To evaluate power: 1. Enter sample sizes 2. Enter observed difference in means 3. Enter standard deviation s Set a and 1or 2- sided test, using options menu. Sample Size2: 16 In Minitab: Stat Power and Sample Size 2-sample t Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean 2 + 3.3 Alpha = 0.05 Sigma = 10 Sample Size Power 14 0.1342 11 0.1142 Note: Minitab assumes equal n’s for the 2 groups, and only gives space for one value of s Clearly, our power to detect a difference as large as 3.3 days was only about 12% -- not very good! Sample Size2: 17 In Minitab: Stat Power and Sample Size 2-sample t To estimate sample size: 1. Enter desired power s 2. Enter desired significant difference in means 3. Enter standard deviation Sample Size2: 18 Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean 2 + 3 Alpha = 0.05 Sigma = 10 Sample Size Target Power Actual Power 176 0.8000 0.8014 235 0.9000 0.9007 Note: sample sizes are per group. If this seems excessive – is your estimate of the standard deviation reasonable? I used the larger of the 2 observed SD here. You might want to compute a pooled SD, and try that. Sample Size2: 19