Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Basic Statistical Concepts Chapter 2 Reading instructions • • • • • • • • 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 Introduction: Not very important Uncertainty and probability: Read Bias and variability: Read Confounding and interaction: Read Descriptive and inferential statistics: Repetition Hypothesis testing and p-values: Read Clinical significance and clinical equivalence: Read Reproducibility and generalizability: Read Bias and variability Eˆ Bias: Systemtic deviation from the true value Design, Conduct, Analysis, Evaluation Lots of examples on page 49-51 Bias and variability Larger study does not decrease bias ˆ n ; n Distribution of sample means: Drog X - Placebo -10 -4 n=40 mm Hg -10 Population mean bias = population mean Drog X - Placebo -7 Drog X - Placebo -7 -4 n=200 mm Hg -10 -7 -4 N=2000 mm Hg Bias and variability There is a multitude of sources for bias Publication bias Positive results tend to be published while negative of inconclusive results tend to not to be published Selection bias The outcome is correlated with the exposure. As an example, treatments tends to be prescribed to those thought to benefit from them. Can be controlled by randomization Exposure bias Differences in exposure e.g. compliance to treatment could be associated with the outcome, e.g. patents with side effects stops taking their treatment Detection bias The outcome is observed with different intensity depending no the exposure. Can be controlled by blinding investigators and patients Analysis bias Essentially the I error, but also bias caused by model miss specifications and choice of estimation technique Interpretation bias Strong preconceived views can influence how analysis results are interpreted. Bias and variability Amount of difference between observations True biological: Variation between subject due to biological factors (covariates) including the treatment. Temporal: Variation over time (and space) Often within subjects. Measurement error: Related to instruments or observers Design, Conduct, Analysis, Evaluation Raw Blood pressure data Placebo DBP (mmHg) Drug X Baseline Subset of plotted data 8 weeks Bias and variability Y X Variation in = Explained + Unexplained observations variation variation Bias and variability Is there any difference between drug A and drug B? Drug A Drug B Outcome Bias and variability Model: Yij i xij ij Y=μB+βx μB Y=μA+βx μA x=age Confounding Confounders Predictors of treatment Predictors of outcome A Treatment allocation Outcome B Example Smoking Cigarettes is not so bad but watch out for Cigars or Pipes (at least in Canada) Variable Non smokers Cigarette smokers Cigar or pipe smokers Mortality rate* 20.2 20.5 35.5 *) per 1000 person-years % Cochran, Biometrics 1968 Example Smoking Cigaretts is not so bad but watch out for Cigars or Pipes (at least in Canada) Variable Non smokers Cigarette smokers Cigar or pipe smokers Mortality rate* 20.2 20.5 35.5 Average age 54.9 50.5 65.9 *) per 1000 person-years % Cochran, Biometrics 1968 Example Smoking Cigaretts is not so bad but watch out for Cigars or Pipes (at least in Canada) Variable Non smokers Cigarette smokers Cigar or pipe smokers Mortality rate* 20.2 20.5 35.5 Average age 54.9 50.5 65.9 Adjusted mortality rate* 20.2 26.4 24.0 *) per 1000 person-years % Cochran, Biometrics 1968 Confounding The effect of two or more factors can not be separated Example: Compare survival for surgery and drug Survival Life long treatment with drug R Surgery at time 0 Looks ok but: •Surgery only if healty enough •Patients in the surgery arm may take drug •Complience in the drug arm May be poor Time Confounding Can be sometimes be handled in the design Example: Different effects in males and females Imbalance between genders affects result Stratify by gender A M A R Gender R B Balance on average B A F R Always balance B Interaction The outcome on one variable depends on the value of another variable. Example Interaction between two drugs A B A=AZD1234 Wash out R B B=AZD1234 + Clarithromycin A Interaction Example: Drug interaction Plasma concentration (µmol/L) linear scale Mean 5 AZD1234 4 AZD0865 alone 3 Combination of clarithromycin AZD1234 and AZD0865 2 1 0 0 4 8 12 16 20 24 Time after dose AUC AZD1234: 19.75 (µmol*h/L) AUC AZD1234 + Clarithromycin: Ratio: 36.62 (µmol*h/L) 0.55 [0.51, 0.61] Interaction Example: Treatment by center interaction Treatment difference in diastolic blood pressure 15 10 mmHg 5 0 -5 0 5 10 15 20 25 30 -10 -15 -20 -25 Ordered center number Average treatment effect: -4.39 [-6.3, -2.4] mmHg Treatment by center: p=0.01 What can be said about the treatment effect? Descriptive and inferential statistics The presentation of the results from a clinical trial can be split in three categories: •Descriptive statistics •Inferential statistics •Explorative statistics Descriptive and inferential statistics Descriptive statistics aims to describe various aspects of the data obtained in the study. •Listings. •Summary statistics (Mean, Standard Deviation…). •Graphics. Descriptive and inferential statistics Inferential statistics forms a basis for a conclusion regarding a prespecified objective addressing the underlying population. Confirmatory analysis: Hypothesis Results Conclusion Descriptive and inferential statistics Explorative statistics aims to find interesting results that Can be used to formulate new objectives/hypothesis for further investigation in future studies. Explorative analysis: Results Hypothesis Conclusion? Hypothesis testing, p-values and confidence intervals Objectives Variable Design Estimate p-value Confidence interval Statistical Model Null hypothesis Results Interpretation Hypothesis testing, p-values n X X , X R Statistical model: Observations 1 n from a class of distribution functions P : Hypothesis test: Set up a null hypothesis: H0: 0 and an alternative H1: 1 Reject H0 if X Sc R n P X Sc | 0 Rejection region Significance level p-value: The smallest significance level for which the null hypothesis can be rejected. Confidence intervals Let X, * 1 if H 0 : * rejected 0 if H 0 : * not rejected (critical function) Confidence set: CX : X, 0 The set of parameter values correponding to hypotheses that can not be rejected. A confidence set is a random subset CX covering the true parameter value with probability at least 1 . Example Objective: To compare sitting diastolic blood pressure (DBP) lowering effect of hypersartan 16 mg with that of hypersartan 8 mg Variable: The change from baseline to end of study in sitting DBP (sitting SBP) will be described with an ANCOVA model, with treatment as a factor and baseline blood pressure as a covariate Model: yij = μ + τi + β (xij - x··) + εij Parameter space: R 4 1 2 3 0 treatment effect i = 1,2,3 {16 mg, 8 mg, 4 mg} Null hypoteses (subsets of H01: τ1 = τ2 (DBP) H02: τ1 = τ2 (SBP) H03: τ2 = τ3 (DBP) H04: τ2 = τ3 (SBP) R 4): Example contined Hypothesis Variable LS Mean CI (95%) p-value 1: 16 mg vs 8 mg Sitting DBP -3.7 mmHg [-4.6, -2.8] <0.001 2: 16 mg vs 8 mg Sitting SBP -7.6 mmHg [-9.2, -6.1] <0.001 3: 8 mg vs 4 mg Sitting DBP -0.9 mmHg [-1.8, 0.0] 0.055 4 : 8 mg vs 4 mg Sitting SBP -2.1 mmHg [-3.6, -0.6] 0.005 This is a t-test where the test statistic follows a t-distribution Rejection region: X : T X c -c 0 c P-value: The null hypothesis can pre rejected at -4.6 -2.8 0 T X 0.001 1 2 P-value says nothing about the size of the effect! Example: Simulated data. The difference between treatment and placebo is 0.3 mmHg No. of patients per group Estimation of effect p-value 10 1.94 mmHg 0.376 100 -0.65 mmHg 0.378 1000 0.33 mmHg 0.129 10000 0.28 mmHg <0.0001 100000 0.30 mmHg <0.0001 A statistical significant difference does NOT need to be clinically relevant! Statistical and clinical significance Statistical significance: Is there any difference between the evaluated treatments? Clinical significance: Does this difference have any meaning for the patients? Health ecominical relevance: Is there any economical benefit for the society in using the new treatment? Statistical and clinical significance A study comparing gastroprazole 40 mg and mygloprazole 30 mg with respect to healing of erosived eosophagitis after 8 weeks treatment. Drug Healing rate gastroprazole 40 mg 87.6% mygloprazole 30 mg 84.2% Cochran Mantel Haenszel p-value = 0.0007 Statistically significant! Clinically significant? Health economically relevant?