Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Case-Control Studies Lecture 5 June 14, 2006 K. Schwartzman MD Case Control Studies Readings • Fletcher, chapter 6 • Walker, chapter 6 [Case-Control Studies] from Observation and Inference, 1991 [course pack] Case-Control Studies - Slide 1 Objectives Students will be able to: 1. Define the term “case-control study” 2. Explain the relationship between case-control and cohort studies 3. Understand the difference between cumulative incidence and incidence density designs Case-Control Studies - Slide 2 Objectives 4. Calculate parameters which may be validly obtained from case-control studies, namely: a. Odds parameters: - odds of exposure in cases - odds of exposure in controls - odds ratio b. Risk parameters: - approximation of relative risk - attributable fraction c. Incidence rate parameters: - incidence rate ratio - attributable fraction among the exposed - attributable fraction for the population Case-Control Studies - Slide 3 Objectives 5. Indicate situations in which case-control studies permit estimation of rate differences between exposure groups 6. Highlight advantages and disadvantages of case-control studies, including key biases 7. List possible sources of controls in case-control studies 8. Identify biases which may result from different types of control selection Case Control Studies - Slide 4 Case-Control Studies Fletcher, p. 92: “Two samples are selected: patients who have developed the disease in question, and otherwise similar people who do not have not developed this disease. The researchers then look backward in time to measure the frequency of exposure to a possible risk factor in the two groups.” In other words, a study population is first assembled based on a determination as to whether subjects have or have not developed an outcome of interest. Subjects (or person-time) are then classified as to whether an exposure of interest took place. Data on other variables (e.g. potential confounders) are also obtained. Case-Control Studies - Slide 5 Walker, 1991: “Case-control studies constitute the major advance in epidemiologic methods of our time” Classic example: Doll & Hill, relationship between lung cancer and cigarette smoking (1950) Case-Control Studies - Slide 6 Advantages Useful for study of conditions that are rare and/or characterized by a long latency between exposure(s) and outcomes of interest. May be useful in evaluating the impact of multiple types of exposure. Disadvantages May be particularly vulnerable to biases arising from selection of subjects (most often of the control group), and measurement (estimation) of exposure Case-Control Studies - Slide 7 In case-control studies, data about exposure status is calculated after first determining outcome status. However, subjects may be recruited “prospectively” (concurrently), e.g.: - All persons aged 30-50 who are diagnosed with hypertension on the island of Montreal during 2006, within 2 weeks of diagnosis. - Controls recruited among persons of the same age who are newly diagnosed with appendicitis in Montreal during the same time period. Case Control Studies - Slide 8 Often, outcome status is already available for all subjects (“historical”) at the time of initiation, e.g.: - During 2006, a researcher identifies all women aged 40-50 who were diagnosed with breast cancer on the island of Montreal in 2004. - In 2006, she recruits a control group among women of the same age who had negative screening mammograms in Montreal in 2004. Case-Control Studies - Slide 9 Note that the terms “prospective” and “retrospective” are not very useful with respect to case-control studies, since data about exposure status is always retrospective (by definition). Case-Control Studies - Slide 10 Cohort and Case-Control Studies Every case control study corresponds to an underlying cohort study, which is (ordinarily) hypothetical. Example (from Doll & Hill, 1950): _____________________________________________________ Women diagnosed with lung cancer vs other diseases at 20 London hospitals Smokers Non-Smokers Total Lung cancer cases 41 19 60 No lung cancer (controls) 28 32 60 Total 69 51 120 _________________________________________________________ Crude odds ratio = odds of exposure in cases/odds of exposure in controls = (a/b)/(c/d) = ad/bc = (41x32) / (19x28) = 2.5 Case-Control Studies - Slide 11 In the corresponding cohort study, women from the same geographic area would be recruited and classified as to smoking status, then followed for the development vs non-development of lung cancer. Case-Control Studies - Slide 12 Assuming all cases of lung cancer during the period of interest were detected, one possible 2x2 table would be Lung cancer No lung cancer Total OR = 2.5 but it could also be: Lung cancer No lung cancer Total OR = 2.5 Smokers 41 859 Non-Smokers 19 981 Total 60 1,840 900 1000 1,900 Smokers 41 70 Non-Smokers 19 81 Total 60 151 111 100 211 Case-Control Studies - Slide 13 • The cases diagnosed and included, and the controls sampled, relate to the exposure experience of an underlying source population. • In each scenario, the estimated odds of cigarette smoking among women with lung cancer are 2.5 times those among women without lung cancer. • In each scenario, all cases of lung cancer were included. The size of the source population (and hence the number of non-cases) was varied. Case-Control Studies - Slide 14 Cumulative incidence case-control studies Goal is to derive estimate of relative risks (relative cumulative incidences) of outcomes among exposed vs. unexposed Design: - Cases are ascertained during a defined observation period - Controls are persons who did not become cases during the period of observation. - The underlying cohort is a fixed one (not open or dynamic). Case-Control Studies - Slide 15 Doll and Hill, 1950 Assume that the source population was as follows: 900 smokers & 1000 non smokers - followed 5 years Then the 2x2 table would be: Smokers Non-Smokers Total Cancer + Cancer - 41 859 19 981 60 1,840 Total 900 1,000 2,000 ________________________________________________ Risk of cancer in smokers: 41/900 = 0.046 Risk of cancer in non smokers: Risk ratio: 19/1000 = 0.019 0.046/0.019 = 2.4 Odds of smoking in women with cancer: Odds of smoking in women without cancer: Odds ratio 41/19 = 2.2 859/981 = 0.88 = 2.5 Case-Control Studies - Slide 16 In the corresponding case control study we take 100% of cases, but sample the controls (60/1840 or 3.3% of all potential controls - those who happened to be admitted to hospital for some other reason). Hence the new table is: Cancer + Cancer - Smokers Non smokers 100% x 41 = 41 3.3% x 859 = 28 100% x 19 = 19 3.3% x 981 = 32 Total 60 60 Total 69 51 120 _________________________________________________________ “Risk” of cancer in smokers: “Risk” of cancer in non smokers: 41/69 = 0.59 INVALID 19/51 = 0.37 INVALID The “risk ratio” from this 2x2 table is also invalid Odds of smoking among cases: Odds of smoking among controls: Odds ratio: 41/19 = 2.2 (as before) 28/32 = 0.88 (as before) 2.2/0.88 = 2.5 (as before) Case-Control Studies - Slide 17 General Form: Cumulative incidence case-control studies outcome + outcome - exposure + a c ___________ exposure b d _____________ | || total cases total controls total exposed total unexposed | total subjects Odds of exposure in cases = a/b Odds of exposure in controls = c/d Odds ratio = odds of exposure in cases ______________________ odds of exposure in controls = a/b ___ c/d = ad __ bc but: Odds of disease among exposed = a/c Odds of disease among unexposed = b/d Odds ratio = odds of disease among exposed = a/c ___________________________ ___ odds of disease among unexposed b/d = ad __ bc Case-Control Studies - Slide 18 Risk parameter estimation in cumulative incidence case-control studies: Recall that relative risk = risk of disease in exposed ______________________ risk of disease in unexposed From our 2x2 table, this is: a/(a+c) _______ = b/(b+d) a(b+d) ______ b(a+c) If the disease is rare, then a<<c and b<<d among the source population then a+c ~ c then a(b+d) ______ ~ ad __ b(a+c) bc and b+d ~ d Case-Control Studies - Slide 19 In a case-control study, it is then possible to estimate the attributable risk (fraction) among the exposed, even if the risk for the population is unknown. In a cohort study, the attributable risk fraction is: R exp - Runexp __________ Rexp = (R exp/Runexp) - (Runexp/Runexp) _______________________ Rexp/Runexp = RR-1 _____ RR In a case-control study, this is estimated by (OR-1)/OR This is the proportion of disease among exposed persons, which is attributable to the exposure Case-Control Studies - Slide 20 Hence, from Doll and Hill (1950), the estimated fraction of lung cancer among female smokers which is attributable to smoking is: 2.5 -1 ______ 2.5 = 0.6 or 60% In other words, there would have been 60% fewer lung cancers among those women, had they never smoked Case-Control Studies - Slide 21 Incidence Density Case-Control Studies The incidence density case-control study involves the implicit comparison of the person-time experience of cases and controls with respect to the exposure(s) of interest. The absolute quantity of person-time sampled - and hence the sampling fraction - is unknown. This is analogous to the situation with respect to persons in a cumulative incidence case-control study. Case-Control Studies - Slide 22 Hence the underlying (hypothetical) cohort is an open or dynamic one. Persons considered controls at one point in time may then become cases; they can then appear twice in the 2x2 table. For this cohort, the general form of the 2x2 table is: exposure + a Pe outcome + person-time Where Pe = person-time among exposed Po = person-time among unexposed IRe = a/Pe IRR = exposure b Po aPo ____ bPe and IRo = b/Po Case-Control Studies - Slide 23 Suppose that all cases are counted, but the controls are sampled with respect to person-time, with sampling fraction ”f” generating the incidence density case-control study. Then the 2x2 table is: outcome + outcome - exposure + a c = fPe Then OR = ad = afPo ___ _____ bc bfPe exposure b d = fPo = aPo ____ bPe which is equivalent to the IRR above. Case-Control Studies - Slide 24 Note that this formulation does not involve any assumptions about disease rarity. It requires that the likelihood of being sampled from the source “population” of person-time varies as a proportion of the person-time potentially “contributed” by each individual. For example: A potential control subject who was absent from the geographic area of interest during most of the accrual period should have less chance of being selected than a potential subject who was present throughout. As with the cumulative incidence design, validity hinges on the assumption that f (the sampling fraction) does not vary with exposure status. Case-Control Studies - Slide 25 Relationship between “open” cohort and incidence density case-control studies • A researcher wishes to evaluate the association between the use of nonsteroidal anti-inflammatory drugs (NSAIDS) and ventricular tachycardia (VT) • In an open cohort study lasting 2 years, subjects are recruited and classified as to exposure status (NSAID use), then followed for development of VT • In principle, it is possible to document periods of exposure and non-exposure for individuals, e.g. months on/off medication, as long as exposure is somehow reassessed Case-Control Studies - Slide 26 Then for the cohort, incidence rates and an incidence rate ratio can be calculated for the exposed vs unexposed person-time experience, e.g. VT, cases Person-years Incidence NSAID No NSAID Total 80 800 0.1/p-y 40 1200 0.033/p-y 120 2000 0.06/p-y The estimated incidence rate ratio is: 80/800 _______ 40/1200 =3 So, assuming no confounding, we estimate that the incidence of ventricular tachycardia among NSAID users is 3 times that among non-users Case-Control Studies - Slide 27 Suppose we instead devise a case-control study. Here, cases will be defined by a first diagnosis of VT at Montreal hospitals, and controls will be recruited among persons who visit the eye clinics of the same hospitals: both over a 2-year accrual period. They will be compared with respect to use of NSAIDS within the last 24 hours prior to presentation. If sampling is done correctly (e.g. the probability of selection is unrelated to NSAID use) then the controls should represent the person-time experience of the source population Case-Control Studies - Slide 28 • If a possible control spent half the accrual period on NSAIDS, and half off, he has a 50% chance of contributing to the “exposed” group and a 50% chance of contributing to the “unexposed” group • This individual will contribute one or the other, depending on the date of the visit chosen as control; but in a larger group of people, the control days sampled will reflect the proportion of exposed person-time • A person can be a control early in the accrual period and a case later • In principle, a single person can also be sampled repeatedly as a control if the time window for exposure definition is short (more complicated in terms of analysis) Case-Control Studies - Slide 29 Suppose that the case-control study includes all cases which would have been detected with the open cohort design. Two controls are recruited per case. This (unbeknownst to the researchers) corresponds to a sampling fraction for controls of 0.12 person-day sampled per person-year of follow-up that would have occurred in the open cohort. Then the 2x2 table is: NSAID No NSAID Total VT, cases No VT(controls) 80 40 120 800*0.12 1200*0.12 2000*0.12 = 96 = 144 = 240 _____________________________________________ Total 176 OR = (80x144)/(40x96) = 3.0 184 360 same as earlier IRR Case-Control Studies - Slide 30 Another example of an incidence density design: • Bronchodilators are used for the treatment of asthma • There is concern that overuse may be associated with an increased risk of adverse events, including death • Side effects can include arrhythmias, which may lead to sudden death • Suissa et al conducted a case-control study using the Saskatchewan health insurance database • They identified 30 persons prescribed anti-asthma medications who died of cardiovascular events, rather than of asthma; the date of death was termed the index date Case-Control Studies - Slide 31 • 4080 control days were then sampled randomly from the 574,103 person-months of follow-up for the entire asthmatic group; each such day was also an index date • Cases and controls were then compared as to use of theophylline and beta-agonists during the 3 months preceding the index date • These were the main exposures of concern Case-Control Studies - Slide 32 Questions for discussion: • Why do you think the researchers chose this study design? • What would have been the corresponding cohort study? Case-Control Studies - Slide 33 With respect to the relationship between theophylline use and sudden cardiac death, the authors found the following: Theophylline in last 3 months Cardiac Death Yes No Yes 17 956 No 13 3124 | | | Total 30 4080 Note that numbers in table refer to person-days (not to persons) OR (crude) = ad __ = bc 17 x 3124 ________ 13 x 956 IRR (crude) = 4.3 (2.1 - 8.8) = 4.3 (2.1 - 8.8) Case-Control Studies - Slide 34 The odds of recent theophylline use among persons aged 5-54 years prescribed anti-asthma drugs who died of cardiovascular events were 4.3 times those among other persons in the same age range who were also prescribed anti-asthma drugs, but did not die. “Asthmatics” aged 5-54 who are prescribed theophylline have an estimated 4.3 fold increase in incidence of fatal cardiovascular events, compared with “asthmatics” who are not prescribed theophylline. Case-Control Studies - Slide 35 As with the cumulative incidence design, an attributable rate fraction can be estimated for exposed persons: It is: I____ e-Io, Ie where = IRR -1 = ______ IRR Ie = incidence among exposed and Io = incidence among the unexposed OR -1 _____ OR For the Saskatchewan study, the estimated attributable rate fraction among “asthmatics” who were prescribed theophylline is: 4.3 - 1 = 0.77 ______ 4.3 Among “asthmatics” aged 5-54 prescribed theophylline, an estimated 77% of fatal cardiovascular events were related to its prescription. Case-Control Studies - Slide 36 It is also possible to estimate the attributable rate fraction for the entire population (PAR%) In a cohort study, this is simply I_____ t - Io, It where It = incidence among the total population Io = incidence among the unexposed For the corresponding incidence density case-control study, the population attributable rate fraction is IRR ____- 1 x proportion of cases who were exposed, IRR estimated as OR -1 x _____ OR a ____ a+b Similar parameters involving risk can be generated for the cumulative incidence design Case-Control Studies - Slide 37 For the Saskatchewan study, recall the 2 x 2 table Theophylline in last 3 months Cardiac death Yes No Yes 17 956 No 13 3124 | | | Total 30 4080 OR = 4.3 Pexp |case = 17/30 = 0.57 then PAR fraction = OR -1 x Pexp |case _____ OR = 4.3 - 1 x 0.57 = 0.44 ______ 4.3 Among Saskatchewan “asthmatics” aged 5-54, an estimated 44% of cardiovascular deaths relate to theophylline prescriptions. Alternatively, had theophylline never been prescribed, 44% fewer deaths would have occurred among “asthmatics.” Case-Control Studies - Slide 38 Attributable rates (rate difference) The absolute rate difference (i.e., the absolute rate of disease attributable to exposure) is Ie - Io Data from a standard case-control study alone cannot validly be used to estimate absolute rates of disease. Even if case ascertainment is complete, the controls represent an unknown and arbitrary fraction of the true person-time at risk. Hence the rate difference cannot be estimated. Case-Control Studies - Slide 39 However, incidence rates can be estimated if there is additional knowledge about the amount of person-time at risk Exposure Disease (+) Disease (-) (+) (-) a c = f x x Pt b d = f x (1- ) x Pt Then Ie = a = ___________ a _____ x Pt [c/(c+d)] x Pt Then Io = b =___________ b _________ (1- ) x Pt [d/(c+d)] x Pt and the rate difference is Ie-Io where = proportion of person-time which is exposed Case-Control Studies - Slide 40 Example: In this nested case-control study, the researchers knew that in the source cohort (Saskatchewan “asthmatics” aged 5-54), there were 47,842 person-years at risk during the study period The 2x2 table was: Cardiac death Yes No Theophylline in last 3 months Yes 17 956 No 13 3124 | | | Total 30 4080 Case-Control Studies - Slide 41 Then the estimated incidence of cardiac death in “asthmatics” prescribed theophylline (Ie) is: a ___________ [c/(c+d)] x Pt = ________________ 17 956/4080 x 47,842 = 0.0015 per person-year And in “asthmatics” who were not prescribed theophylline the estimated incidence (Io) is: b = ___________ [d/(c+d)] x Pt 13 = 0.00035 per person-year _________________ 3124/4080 x 47,842 The estimated rate difference is therefore 0.0015-0.00035 = 0.00115 per person-year. Note that the IRR computed as Ie/Io remains 4.3 Case-Control Studies - Slide 42 Ie and Io may also be estimated if It is known for the source population Recall that It = (Ie x ) + [Io x (1- )] But Ie = Io x OR Then It = Io [(OR x ) + (1- )] So Io = It = It ______________ ________________________ (OR x ) + (1- ) {OR x [c/(c+d)]} + [d/(c+d)] Then use Ie = Io x OR Then RD = Ie - Io as usual [= Io (OR-1)] Case-Control Studies - Slide 43 Example: The total incidence (It) of cardiovascular death in the Saskatchewan cohort was 30 deaths/47,842 person-years = 0.00063 per person-year. Then Io = 0.00063 ___________________________ [4.3 x (956/4080)] + (3124/4080) and Ie = 0.00036 x 4.3 = 0.0015 RD = 0.0015 - 0.00035 = 0.00115 = 0.00035 Case-Control Studies - Slide 44 Additional points Corresponding estimates of attributable risks and risk differences can be made for cumulative incidence case-control studies, if the corresponding additional data is available Estimates of absolute risks/incidence rates and risk/rate differences can be made only if the total amount of persons/person-time at risk is known, or at least one absolute risk/incidence rate is known (i.e. for the total population, the exposed, or the unexposed) Nested case-control studies are a special type of study where cases and controls are explicitly drawn from a defined larger cohort (as in the Saskatchewan asthma study) Case-Control Studies - Slide 45 Case-Control Studies: Strengths and Limitations Advantages of case-control studies: Efficiency - much less expensive/intensive than cohort studies. Very useful for outcomes that are rare or occur after a long latency period. Most outcomes are relatively rare over short-term follow-up. Permit evaluation of multiple exposures. Can rapidly “accrue” person-time experience. Avoid losses to follow-up inherent in cohort studies. Case-Control Studies - Slide 46 Disadvantages • Not useful/efficient for very rare exposures (may not be present in either cases or controls). • Cannot directly compute incidence rates. • Cannot usually evaluate more than one outcome. • Temporality may be lost or distorted. • Potential for considerable bias, i.e. loss of validity. Bias relates to: - Measurement of exposure status - Selection of subjects (usually controls) Case-Control Studies - Slide 47 With respect to measurement, exposure ascertainment must be consistent for cases and controls. There may be potential for misclassification of exposure in relation to disease status Case-Control Studies - Slide 48 Example 1 Differential recall of exposures among cases vs controls e.g. medication use and congenital malformations - particularly if mothers “attuned” to study hypothesis. If cases more likely to recall exposure, results will be biased toward a positive association between exposure and outcome. The more objective the source of exposure data, the better. Case-Control Studies - Slide 49 Example 2 Different sources of information about exposure e.g. family members asked about alcohol consumption of persons who died of gastric cancer, vs direct questioning of control subjects. If family members tend to underestimate cases’ alcohol consumption, results will be biased against finding a positive association between alcohol and gastric cancer. Case-Control Studies - Slide 50 Example 3 Exposure status changes as a consequence of the outcome e.g. patients with symptoms of lung cancer stop smoking If patients with newly diagnosed lung cancer are compared to controls with respect to current or recent smoking, results may be biased, i.e., the association between smoking and lung cancer will be underestimated. Data collection must reflect relevant person-time experience and temporality of exposure and outcome. Case-Control Studies - Slide 51 Association may also be missed if the exposure of interest is poorly documented (an example of non-differential misclassification) Example: mesothelioma It can be caused by brief, intense exposures to asbestos, with a very long latency period (>30 years). In a case control study, both cases and controls may recall such exposures very poorly, thereby leading to an underestimate of the true association. Case-Control Studies - Slide 52 Control selection in case-control studies Recall that the validity of case-control studies hinges on the assumption that the sampling fraction for cases (which may be 100%) and that for controls (usually unknown) does not vary by exposure status. In other words, controls should represent the source population from which the cases arose, with respect to exposure experience. Case-Control Studies - Slide 53 Example 1 A researcher wishes to test the hypothesis that use of nonsteroidal anti-inflammatory drugs (NSAIDs) is associated with development of gastric cancer. She plans a case-control study comparing gastric cancer patients (cases) with patients seen at the same hospital for peptic ulcer disease (controls). - NSAID use is a known risk factor for ulcers. What will be the effect on her findings: a) if NSAID use is truly a risk factor for gastric cancer? b) if NSAID use is truly unassociated with gastric cancer? Case-Control Studies - Slide 54 Hence, controls should not differ systematically from the population of interest with respect to exposure experience. Sometimes the bias may be less obvious, i.e. unrelated to explicit criteria for control selection. Case-Control Studies - Slide 55 Example 2 A researcher wishes to evaluate the association between cell phone use and brain tumours using a case-control design. Cases are recruited from the brain tumour clinic at the Royal General Hospital, a neurosurgery referral centre. Controls are recruited from the family medicine clinic at the same hospital. This clinic primarily serves a low-income population from the area adjacent to the hospital. This control group is less likely than the general population to own cell phones. Result: The study will be biased toward detecting an association between brain tumours and cell phone use. Case-Control Studies - Slide 56 Controls should be at risk for developing the outcome of interest - otherwise they do not contribute useful data to the study (inefficient) - inclusion of individuals not at risk may also distort the results if the reason they are not at risk relates to the exposure under study. This may not be obvious. Example: Sleep apnea (exposure) Cases: and risk of traffic accidents (outcome) Drivers involved in car accidents. Including non-drivers in the control group would be a waste of time - it could bias the results if persons with severe apnea have chosen not to drive and are over-represented in the control group. Case-Control Studies - Slide 57 Controls should be persons who, had they developed the outcome of interest, would have had the same opportunity as the actual cases to be included as such. Similarly, cases should have had the same opportunity as actual controls to be included, had they not developed the outcome of interest. If this is not the case, controls may not properly represent the source population. e.g., study of brain tumours and cell phone use discussed above Case-Control Studies - Slide 58 Types of controls in case-control studies 1. Population Controls Suitable if cases are a representative sample (or all cases) arising from a well-defined source population. Controls are then randomly sampled from the same population. With the incidence-density design, the probability of being sampled should vary with an individual’s person-time at risk. Often, it is not easy to define the precise source population. Case-Control Studies - Slide 59 2. Neighbourhood Controls May match controls to individual cases with respect to neighbourhood of residence. If cases are from a hospital, their neighbours may or may not be equally likely to be treated at the same hospital should they develop the disease in question. Example: A hospital which caters to a particular group within society. Case-Control Studies - Slide 60 3. Family members or friends as controls May share exposure characteristics with cases as opposed to broader source population (e.g. tobacco and alcohol use, dietary intake, use of household products). This can obscure relevant associations. Depends on information provided by cases; investigator loses control over factors leading to selection. Cases’ friends may overlap, leading to disproportionate probabilities of selection of certain individuals as controls. Case-Control Studies - Slide 61 4. Hospital/clinic based controls Often used when cases accrued at specific hospital(s)/clinic(s). Controls are recruited among persons seen at the same hospitals/clinics for other reasons or conditions. To avoid bias, the basis for control selection cannot be related to the exposure under study. The incidence of the “control” condition(s) determines the sampling fraction. Case-Control Studies - Slide 62 Example: A researcher wishes to examine the relationship between anti-hypertensive medication use and car accidents. What will happen if controls are recruited in the cardiology clinic? Case-Control Studies - Slide 63 The best hospital controls are persons with acute conditions that consistently require hospital care but are not related to the exposure of interest. Example: In a case control study of smoking as a risk factor for colon cancer, a researcher recruits controls who undergo appendectomy, prostatectomy, or hysterectomy at the same hospital as the cases. Supplemental Material - Slide 1 Derivation of formula - Part 1 For the cohort study, the 2 x2 table is: Cases Person-time IRR = Ie ___ Io = exposed a Pe unexposed b Po a/Pe aPo _____ b/Po = a(P t - Pe) _______ bPe = a _ x b = = ____ total a+b Pe + Po = Pt bPe a (1 - Pe/Pt) _________ b (Pe/Pt) (1-) ____ Where = Pe/Pt = the proportion of person-years with exposure among total person-years in the source population Supplemental Material - Slide 2 Furthermore, a _ = b a/(a+b) _______ b/(a+b) = = P exp|case __________ 1- Pexp|case where Pexp|case = proportion of cases exposed Then IRR = P exp|case (1- ) _____________ (1-Pexp|case) Equation 1 Supplemental Material - Slide 3 Derivation of formula - Part 2 if = proportion of person-years with exposure then 1- = proportion of person-years without exposure and It = Ie + Io (1- ) i.e. a weighted average of incidence rates among exposed and unexposed persons Supplemental Material - Slide 4 Then the PAR fraction is: I_____ t - Io It = = (I e ) + [(Io (1- )] - Io ____________________ (Ie ) + [Io (1- )] (Ie/Io) + (1- ) (Io/Io) - Io/Io ______________________________________ (Ie/Io) + (Io/Io) (1- ) = (IRR) + 1 - - 1 ________________ (IRR) + 1 - = (IRR - 1) ____________ (IRR - 1) + 1 Supplemental Material - Slide 5 Derivation - Part 3 = IRR -1 _____________ IRR + (1/ ) - 1 = IRR -1 ____________ IRR + (______ 1- ) = IRR -1 ______________ IRR + IRR (1- ) _________ IRR () Supplemental Material - Slide 6 Substituting equation 1 for IRR, this is IRR -1 ____________________________ IRR + _______________________ IRR (1- ) () (1 - Pexp |case) () (Pexp |case) (1- ) = IRR -1 __________________ IRR + _____________ IRR (1-Pexp |case) Pexp case = IRR -1 ______________________________ IRR (Pexp |case) + IRR - IRR (Pexp |case) ______________________________ Pexp case = IRR -1 x ______ IRR Pexp |case = OR -1 x Pexp |case ____ OR