Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Correlative Studies in Clinical Trials Janet E. Dancey, MD Objectives • To define biomarker and correlative study • To describe the types of biomarkers • To understand the issues in designing appropriate biomarker studies • To understand the roles of biomarkers in phase I, II and III studies • To understand different biomarker trial designs What is a Biomarker? • Biomarker – A characteristic that is objectively measured and evaluated as an indicator of normal biologic processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention Biomarkers Definitions working Group National Institutes of Health 2001 • Assay – A method for determining the presence or quantity of a component • Test – A procedure that makes use of an assay for a particular purpose Good biomarkers ≠ Good Assays ≠ Tests What is a Correlative Study? • A type of study that tests for a relationship between a condition and a potential causal factor of the condition • Addresses the question: – What is the correlation between the X and Y? • • • • Cancer and obesity QoL score and treatment Response and Performance status Prognosis and KRAS plan2005.cancer.gov/glossary.html Types of Clinical Biomarkers Diagnosis •Confirmation •Staging •Subtyping Prediagnosis Pretreatment •Risk •Screening •Early detection •Prognostic •Predictive Intratreatment Posttreatment •Early •Early response endpoint or futility •Recurrence/ •Toxicity progression monitoring monitoring 5 Biomarkers and Correlative Studies • Biomarker is what is measured • Correlative study is what is correlations are made between the biomarker and something else • In recent years many biospecimen based biomarkers have been incorporated into cancer therapeutic trials • Principles of design, measurement, data collection and analysis are the same. – Eg hypothesis, trial design, measurement, collection, analysis Case Study – A Novel Targeted Agent You have been following the preclinical and clinical research. It is your opinion that the development of a class of anticancer drugs could be facilitated by incorporating laboratory and imaging correlates early into drug development. The sponsor wants investigators to develop a trial to rapidly translate exciting results seen in animal models into the clinic and that incorporates laboratory and imaging studies that would define the optimal dose and predict activity of this novel agent. What do you do? Get the Right Team • • • • • • Scientist, Clinician, Pathologist, Radiologist, Trialist, Statistician, • Who is important? – – – – – CRAs Treating nurses Booking team Pathology technicians Scanning technicians Who is critical? The patients! Have the Right Core Facilities • • • • • Clinical trial intervention Imaging Pathology Specimen management Data management Have the Right Tools • • • • • • SOPs and training, Protocol and consent forms, Case report forms, Reagents, technologies for analysis Q/C Etc, etc, etc Know the limitations of Team, Cores and Tools! Considerations for Biomarkers • What is the question? (is it a good one?) • Biomarker(s) • Assay/technology • Specimen • Study/Trial Design • Study Execution – Trial and Biomarker (s) • Study Outcome – Trial and Biomarker(s) • Likely Impact • Economically and practically feasible Biomarker, Assay, Specimen Biomarker is relevant to the intervention outcome Assay reliably measures biomarker in the specimen Specimen is reasonable for measuring the biomarker Integrating Biomarkers into Clinical Research: Biomarker • Why this/these biomarker(s)? – What is the question/purpose? • (Is there one?) • (Is it an important one?) – Determining potential “value-added” • Rationale and supporting data – Laboratory experimentation – relevance to disease and/or therapy – Clinical evaluation – prevalence and significance in normal and cancer patients Integrating Biomarkers into Clinical Research: Samples • Will the Samples Be Adequate? – Consider: • • • • • Type, Number, Timing, handling, shipping, storage Demands on patients/clinic/laboratory staff Technical requirements of the proposed analysis Likely impact/plan for handling missing/inadequate specimens Costs and resources Specimen Characteristics • Format • Serum, FFPE or fresh/frozen tissue, etc. • Biopsy, surgical specimen, etc. • Collection • Preservation • Storage conditions & time prior to assay 16 Integrating Biomarkers into Clinical Research: Assay • Why this technology? • Analytical/Laboratory Validation (Laboratory performance) – How well are you measuring the measurand? • • • • • Precision / Reproducibility Method Comparison LoB, LoD, LoQ Linearity Stability – Clinical Laboratory Standards Institute (CLSI) http://www.nccls.org/ – Is it fit for proposed purpose i.e. on the proposed samples/patients? • Clinical Validation (“Clinical Qualification”) – Does the test have clinical utility? – Does it have added value over standard tests? • FDA guidance : “Statistical Guidance on Reporting Results from Studies Evaluating Diagnostic Tests” issued in final form in March, 2007, http://www.fda.gov/cdrh/osb/guidance/1620.html Assay Methods • Detailed protocol • Reagents or kits • Quantitation method (e.g., manual, image analysis) • Scoring & reporting • QC procedures & reproducibility assessments • Marker evaluations blinded to patient characteristics and clinical endpoints • Number and training of persons executing and reading tests Intratumoral heterogeneity of carbonic anhydrase IX (CAIX) Effect of distributional heterogeneity on the analysis of tumor hypoxia based on carbonic anhydrase IX VV Iakovlev, M Pintilie, A Morrison, et al Laboratory Investigation (2007) 87, 1206–1217 a) Immunoperoxidase staining for CAIX in a single tissue section. Analysis of the entire section gave a value of 10.8% CAIX labeling. The circles limit the analysis to 0.6 mm simulated tissue microarray (TMA) cores, and show a wide range in CAIX (for publication purpose only, the image was digitally enhanced to better visualize CAIX areas). pSer473-Akt antibody in human gastroesophageal tumors and HT-29 colon cancer xenografts measured by immunohistochemical staining. Baker, A. F. et al. Clin Cancer Res 2005;11:4338-4340 A, patient tumor samples. 1 and 2 are two surgically resected specimens and 3 and 4 are two biopsy specimens. B, HT-29 human tumor xenografts excised from scid mice and kept at room temperature for the times shown. Each section also includes in the upper right-hand quadrant an on-slide control of HT-29 colon cancer cells stained for pSer473-Akt. Copyright ©2005 American Association for Cancer Research Integrating Biomarkers into Clinical Research: Samples • Will the samples be adequate? – Consider: • • • • • Type, Number, Timing, handling, shipping, storage Demands on patients/clinic/laboratory staff Technical requirements of the proposed analysis Likely impact/plan for handling missing/inadequate specimens Costs and resources • Can the question still be answered? Considerations • Flow of patients through study • Reasons for patient/specimen dropout • Numbers and events • Number of test outcomes at each analysis stage and in each subgroup & missing values Biomarkers in Clinical Research: Uses • Phase I – proof of target inhibition after reaching biologically active dose/concentration • Phase II – predictive marker assessment after identifying promising level of activity; studies should be larger and randomized • Phase III – prospective testing of biomarker and treatment • Phase IV – prospective testing of biomarker and treatment Markers and Endpoints for Trials of Molecularly Targeted Agents Changes in protein function/phosphorylation Biochemical Modulation of Target Pathway Protein phosphorylation/function Expression array Markers of Biological/Cellular Response Proliferation, Cell cycle phase, Apoptosis, Angiogenesis, Metabolism Clinical Response Objective Tumor Response Prolonged Stable Disease Clinical Benefit Survival, quality of life Phase 3 Biochemical Modulation of Target Phase 2 Blood and tissue levels Phase 1 Pharmacokinetics Phase 1 Trials: Considerations • Primary goal: To identify an appropriate dose/schedule for further evaluation Small patient • Design principles: numbers – Maximize safety – Minimize patients treated at biologically inactive doses – Optimize efficiency • Study population: – Patients for whom no standard therapy Heterogenous Refractory Tumours Expect target modulation but not anti-tumour activity Probability of Success in Assessing Pharmacodynamic Effect in Biopsies • It is certainly not 100%, and may be much lower • Analogy to chemical synthesis: yield < 100% at each step • Recognize potential problems and repair ASAP to maximize probability of success. Acquired on time? Handled Sample acquired? Time documented? properly? Target correct? Assay works? Stored properly? Bad karma? P = 0.92 = 0.81, 0.93 = 0.72, 0.95 = 0.53 … Shipped properly? Biomarkers in Phase 1 Trials: Utility • Proof of mechanism (drug hits proposed target) • When toxicity may be insufficient to determine active dose/schedule – Unlikely to occur at dose/exposure that affects the target – Due to off target effects and effects on target are uncertain – To target a specific degree of target inhibition to avoid significant toxicity • When pharmacokinetics may be insufficient to determine active dose/schedule – Assay lacking – Pharmacokinetics in plasma does not match effect in tissues Phase 1 Trial Assessment of Target Effects Requirements • Agent with acceptable preclinical activity, toxicology, pharmacology • Known association of target effect and tumour activity • Well-characterized assay • • • • • • % change in target or target level associated with efficacy Concentration/exposure required for target effect Time course for effect on target, duration, recovery Threshold of detection and CV of target measurements Target effect on tumour vs other tissues (eg PBMC, Skin, Buccal) Collection, processing, shipping, storage effects known/optimized • Usual target values and variability in human tissue known • Patients’ tumours have relevant target • Commitment from investigators/patients • mandatory requirement (just like PK) Biomarker Studies in Phase 1 Trials • MGMT activity after O6-benzylguanine • Friedman H et al J Clin Oncol 16:3570-5, 1998; Spiro et al. Cancer Res 59:2402-10 1999; Dolan et al Clin Cancer Res 8:2519-23, 2002 • 20S proteosome inhibition after bortezomib • Lightcap E et al. Clin Chem 46:673-683, 2000; Adams J, Oncologist 1: 9-16, 2002; • DCE-MRI after PTK787/valatanib • Galbraith S et al NMR in Biomed 15:132-142, 2002; Morgan, B. et al. J Clin Oncol; 21:3955-3964 2003; • S6K inhibition after everolimus • Tanaka C et al J Clin Oncol 26:1596-1602, 2008 • PARP Inhibition after ABT-888 • Kinders RJ, et al. Clin Cancer Res. 2008 Nov 1;14(21):6877-85 Biomarkers in Phase II/III Prognostic versus Predictive Markers • Prognostic markers provide information about the patient’s cancer outcome – e.g., cytogenetics in acute leukaemia; performance status in lung cancer • Predictive markers provide information regarding the probability of benefit or risk from a specific therapy • Markers may correlate with prognosis and treatment outcome – ER, HER2 in breast carcinoma – KRAS in colorectal carcinoma – MGMT in glioblastoma Prognostic Marker Measurement associated with clinical outcome in absence of therapy or with application of standard therapy that all patients are likely to receive. Examples: Oncotype DX, uPA/PAI-1 by ELISA in breast cancer Correlation with outcome not necessarily sufficient to impact clinical decisions, may suggest targets for therapy Good prognosis don’t need adjuv. Rx Useful prognostic information? Hazard ratio = .56 Hazard ratio = .18 31 Prognostic Marker Study Design Marker + All patients Compare clinical outcomes Marker assay Marker - • Standard statistical methods such as log rank test or Cox PH regression assume that study subjects constitute a random sample. • If sampling is stratified or based on outcome, prognostic effect estimates can be biased. MGMT is Prognostic • KM Estimates of Overall Survival by MGMT Methylation Status • Hegi et al NEJM 352:997-1003 2005 MGMT is Prognostic • Hegi et al NEJM 352:997-1003 2005 Predictive Marker Measurement associated with response or lack of response to a particular therapy. Examples: ER/PgR for endocrine Rx benefit in breast cancer Statistical wisdom: Test for treatment by marker interaction Qualitative interaction • New drug better for M+ (h.r. = 0.44) • Control drug better for M (h.r. = 1.31) • Interaction = 0.44/1.31 = 0.33 Quantitative interaction • New drug better for M+ (h.r. = 0.44) • New drug better for M (h.r. = 0.76) • Interaction = 0.44/0.76 = 0.58 MGMT is Predictive • KM Estimates of Overall Survival by MGMT Methylation Status and Treatment Hegi et al NEJM 352:997-1003 2005 Hegi et al NEJM 352:997-1003 2005 Trials Designs: Prospective and Retrospective Evaluation of Biomarkers Prospective Study Marker + Rx Outcome Rx Outcome Histology Marker - Retrospective Study Marker + Good Outcome Histology Marker - Rx Marker + Poor Outcome Marker - Biomarkers to Select Patients: Prospective Evaluation • Advantage – Fewest numbers of patients – Study design guaranteed to have sufficient power to show treatment effect in marker group • Disadvantage – Must know marker to select patients – Rapid turnaround to determine eligibility Biomarkers to Select Patients: Retrospective Evaluation • Advantages – – – – Maximize accrual Need not know the right marker Allows refinement of marker/assay while trial ongoing Allows assessment in marker+/- groups • Disadvantages – Risk of insufficient numbers within marker group(s) • Prevalence of different marker defined subgroups – Collection of samples compromised • Incomplete submission, suboptimal handling – Results may not be generalizable • Bias sampling Retrospective Analysis with Incomplete Specimen Collection • RTOG study 86 -10. A randomized study of RT versus RT + TAB in prostate cancer patients. • To assess its prognostic value, p53 expression was assessed on 129 of 456 patients entered. • Patients with p53 assessed had similar distributions of treatment and prognostic factors as those who did not have p53 assessed. • Statistically significant associations were found between the presence of abnormal p53 protein expression and poor outcome. Data from Grignon et al. J Natl Cancer Inst. 1997;89:158–165. Pajak et al. Arch Pathol Lab Med. 2000;124:1011–1015 Retrospective Analysis with Incomplete Specimen Collection • The survival of those patients who had a p53 determined on their tumour was statistically worse than those without a p53 determination (P = .03) regardless of the actual p53 assay result. Data from Grignon et al. J Natl Cancer Inst. 1997;89:158–165. Figure from Pajak et al. Arch Pathol Lab Med. 2000;124:1011–1015 Biomarkers for Phase 2/3: Predictive markers • Goal: identification of patients likely to benefit (or elimination of those least likely to benefit) • Considerations: – Drug activity – Treatment effect across patient subsets. – Prevalence of the subset(s) of patients with “sensitive” disease or at risk for toxicity. – Assay performance i.e failure rate, sensitivity/specificity/predictive value. – Samples requirements – Trial design to distinguish treatment and prognostic effects Caveats for Assessing Treatment Effects in Biomarker Defined Groups • Biomarker may define a subgroup with a different prognosis from historical outcome data from trials done in an unselected group • E.g. ER+ is both prognostic and predictive – If the outcome with standard treatment is not well defined and/or the outcome of interest is PFS/OS consider a randomized controlled phase 2 design • If a trial is designed to assess treatment effects in Marker+ and Marker- groups – False positives will dilute effect in marker+ group – False negatives will dilute the apparent differences in treatment effect between marker defined groups. – Specimen loss or assay failure will increase the sample size – Trial may be 2-4x size of a conventional study Phase 2 or 3 Trial – Histologically Defined and Biomarker Defined Patient Populations Initial Selection Strata Agent Marker + Histology Stage Randomize Target Tested Marker - Control Agent Outcome Phase 3: Survival (Phase 2: ORR, TTE) Control • Trial is designed to assess treatment effects in Marker+ and Marker- groups • Marker assessment • If negative within marker groups, – NB: need not be prospective stratification – Assay failure increases number of patients screened – False results will dilute differences in effects between marker groups – Analyze between treatment groups – Analyze for new markers, cut-points etc. Predictive Marker Study Design Marker-guided Vs. Randomized Design Randomize To Use Of Marker Versus No Marker Evaluation Marker Determined Treatment All Patients M+ New Drug M− Control New Drug Randomize Treatment Control • Provides direct measure of patient willingness to follow marker-assigned therapy • Marker guided treatment may be attractive to patients or clinicians • Inefficient compared to completely randomized or randomized block design Predictive Marker Study Design • The randomized designs offer the greatest flexibility to examine multiple markers. • Require no a priori assumptions, and are more efficient than marker-guided designs. • False positive, false negatives will make treatment outcomes in marker defined groups appear alike. • Larger studies Sample Size Considerations for Predictive Marker Studies • Testing for an interaction between marker status and treatment • Power depends on – Number of EVENTS – AND their distribution into marker-by-treatment categories – Magnitude of effect (e.g., ratio of hazard ratios or difference of survival differences) – Significance level (e.g., usually 0.05) • Test of interaction typically requires 2-4 times as many events as test for treatment main effect Power Problems • Embedding prognostic and predictive questions in “large” treatment trials – Events sufficient for answering treatment question, may be insufficient for prognostic or predictive questions in marker defined subgroups – Specimen retrieval and assay failures exacerbate sample size problem • If initial trial does not provide definitive answer, may not be able to prospectively test marker question Overcoming Sample Size Limitations • Combine over multiple small studies – Patients and assays comparable? – Identify relevant studies? • Publication bias • Description limited • REMARK reporting guidelines (McShane et al, 2005: BJC, EJC, JCO, JNCI, NCPO) might help facilitate pooled analyses • Some large prospective marker trials will be needed – MINDACT – TAILORx – N0723 • Reduce noise in marker measurements to lessen attenuation of marker effects – Assay improvements Statistical Analysis Methods • Methods for calculating or comparing measures of diagnostic accuracy – – – – Sensitivity/specificity Likelihood ratios Diagnostic odds ratio Area under ROC curve • Methods used to quantify uncertainty in accuracy measures (e.g., 95%) • Methods for calculating test reproducibility, if any 50 Statistical analysis methods • • • • Model building & assumptions Variable selection Missing data handling Coding of marker values in analyses (e.g., continuous vs. categorized) • Internal or external validation 51 Barriers to successful inclusion of biomarker studies • Often: add-on studies that aren’t necessary for achieving the (primary) objectives of the clinical trial – Clinical trials are hard to do well – If something is not a primary or secondary objective it is even harder to do it well • Clinical trials are not designed to answer the biomarker question • Biomarker studies may use assays that are not well established – It is almost impossible to learn to do an assay while studying a novel treatment • “normal” range, relevant cut-points, reproducibility – Too many variables Biomarkers in Clinical Research: Implications • • • • Compelling BUT Increased burden on patients/clinical staff Significant coordination effort required Significant increased cost over ‘traditional’ treatment trials – Numbers of patients may be increased and cost/patient will increase • Qualification, standardization and QA takes time, energy and money • Have a good question, team of experts, assays, specimens, trial design and analysis • Consider the feasibility, economics and likely impact Think about how you will report the study before you start • STARD: STAndards for Reporting of Diagnostic accuracy (Bossuyt et al.) • Published (2003): Clin Chem, Ann Intern Med, Radiol, BMJ, AJCP, Clin BioChem, Clin Chem Lab Med • Re-published (2003-4): 8 journals • REMARK: REporting recommendations for tumor MARKer prognostic studies (McShane et al.) • Published (2005): BJC, EJC, JCO, JNCI, NCPO • Re-published (2006): BCRT, Exp Oncol Additional Designs and Issues • Sargent D, et al. Clinical Trial Designs for Predictive Marker Validation in Cancer Treatment Trials. J Clin Oncol 23:2020-2027, 2005 • Simon R, Maitournam A. Evaluating the efficiency of targeted designs for randomized clinical trials. Clin Cancer Res; 10: 6759–6763, 2004