Download Seung et al, New Engl J Med 2008

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Regression toward the mean wikipedia , lookup

Choice modelling wikipedia , lookup

Data assimilation wikipedia , lookup

Coefficient of determination wikipedia , lookup

Regression analysis wikipedia , lookup

Linear regression wikipedia , lookup

Transcript
Primer on Statistics
for Interventional
Cardiologists
Giuseppe Sangiorgi, MD
Pierfrancesco Agostoni, MD
Giuseppe Biondi-Zoccai, MD
What you will learn
•
•
•
•
•
•
•
•
•
•
•
•
Introduction
Basics
Descriptive statistics
Probability distributions
Inferential statistics
Finding differences in mean between two groups
Finding differences in mean between more than 2 groups
Linear regression and correlation for bivariate analysis
Analysis of categorical data (contingency tables)
Analysis of time-to-event data (survival analysis)
Advanced statistics at a glance
Conclusions and take home messages
Why a glance at advanced stats?
• Like in a live case, we want to expose you
to the next level of statistical analysis
• It is not highly complex in itself, but only
builds upon a solid knowledge of standard
statistics, ie what we have seen today
• If you are interested in more, let us
know…
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
A hierarchy of analysis levels
• Statistical analyses can be distinguished in
– Univariate (eg describing a variable such as
lesion length)
– Bivariate (eg comparing two different variables
such as CKMB release and subsequent LVEF)
– Multivariable (eg appraising the impact of two or
more independent/moderator variables on a
single dependent variable)
– Multivariate (eg appraising the impact of two or
more independent/moderator variables on two
or more dependent variables)
An example of multivariable analysis
Vlaar et al, Lancet 2008
An example of multivariable analysis
Vlaar et al, Lancet 2008
An example of multivariable analysis
Vlaar et al, Lancet 2008
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
Multiple linear regression
• How can I predict the impact of balloon dilation
pressure on angiographic late loss, taking
concomitantly into account diabetes status and
ACC/AHA lesion type?
Multiple linear regression
• How can I predict the impact of balloon dilation
pressure on angiographic late loss, taking
concomitantly into account diabetes status and
ACC/AHA lesion type?
In other words, how can I predict the impact of a
given variable (aka independent) on another
continuous variable (aka dependent), taking
concomitantly into account other variables?
Multiple linear regression
• Multiple linear regression is just an
extension of bivariate linear regression
• Thus we can move from bivariate
regression to multivariable regression
analysis
• A great deal of emphasis should be placed
on model building and validation, eg
checking residuals and avoiding overfitting
Identifying predictors of late loss
Sangiorgi et al, AHJ 2008
Identifying predictors of late loss
Sangiorgi et al, AHJ 2008
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
Logistic regression
• How can I predict the impact of left ventricular
ejection fraction (LVEF) on the 12-month risk of
ARC-defined stent thrombosis?
Logistic regression
• How can I predict the impact of left ventricular
ejection fraction (LVEF) on the 12-month risk of
ARC-defined stent thrombosis?
In other words, how can I predict the impact of a
given variable (aka independent) on another
dichotomous variable (aka dependent)
Logistic regression
• Logistic regression is based on the logit which
transforms a dichotomous dependent variable
into a continuous one
• Thus a number of assumptions should be valid
to trust its results
• Logistic regression can also be applied to
multivariable and polichotomous analyses
• Emphasis should again be placed on model
building and validation, eg checking residuals
and avoiding overfitting
Logistic regression
Logistic regression
Sangiorgi et al, AHJ 2008
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
Cox proportional hazard model
• How can I predict the impact of left ventricular
ejection fraction (LVEF) on the risk of ARCdefined stent thrombosis over a variable period
of follow-up?
In other words, how can I predict the impact of a
given variable (aka independent) on another
dichotomous variable (aka dependent), but with
variable follow-up and aiming to exploit as much
as possible all event data
Cox proportional hazard model
• Cox proportional hazard analysis is a
parametric analysis that can be used for
dichotomous dependent variables occurring
over time and to test its association with one
or more independent variables
• A number of assumptions should hold true
• In particular, hazards should be constant
over time → log−log survival function
Cox proportional hazard model
Agostoni et al, Am J Cardiol 2005
Cox proportional hazard model
Agostoni et al, Am J Cardiol 2005
Cox proportional hazard model
Marroquin et al, New Engl J Med 2008
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
First things first
• The generalized linear model (GLM) is a flexible
generalization of ordinary least squares
regression
• It relates the random distribution of the
measured variable of the experiment (the
distribution function) to the systematic (nonrandom) portion of the experiment (the linear
predictor) through a function called the link
function (g)
First things first
Earth
Oxford
Generalized linear model (GLM)
• Through differing link functions, GLM
corresponds to other well known models
Distribution
Name
Normal
Exponential
Identity
Inverse
Gamma
Inverse
Gaussian
Inverse
squared
Poisson
Log
Binomial
Logit
Link Function
Mean
Function
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
Propensity analysis
• How can appraise the impact of a specific stent
type on long-term risk of stent thrombosis
outside the context of a randomized controlled
trials?
In other words, how assess the independent
impact of a given independent/moderator
variable when my dataset is not based on
randomization and likelihood of underlying
confounding is very high
Propensity analysis
• Propensity analysis is a peculiar type of
multivariable analysis in which the propensity
to choose one treatment versus the other(s)
is explictly assessed and measured
• Building upon such propensity (usually by
means of non-parsimonious scores), the
dataset is matched and/or analysed
• Propensity are usually exploited in: a)
matched pairs, b) quintiles/deciles, or c) in
logistic/Cox regression models
Propensity score analysis
Seung et al, New Engl J Med 2008
Propensity score analysis
Seung et al, New Engl J Med 2008
Propensity score analysis
Seung et al, New Engl J Med 2008
Propensity score analysis
Seung et al, New Engl J Med 2008
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
Resampling methods
• A friendly question for our best pupil: how can I
build 95% confidence intervals for medians?
In other words, how can I make statistical
inference when the variable I am focusing onto
does not follow standard statistical distributions,
is highly skewed or sparse?
Resampling methods
• Resampling methods are powerful validation
and quantitation tools, as they can be much
more distribution-independent than all other
statistical methods
• The 2 most commonly used resampling
methods are:
- Jacknife
- bootstrap
Bootstrap
What you will learn
• Advanced statistics at a glance:
–
–
–
–
–
–
–
–
Mulivariable analysis
Multiple linear regression
Logistic regression
Cox proportional hazards model
Generalized linear models
Propensity analysis
Resampling methods
Meta-analysis
Meta-analysis
• How can I decide whether first generation DES
increase stent thrombosis given the plethora of
several small, conflicting, and/or inconclusive
pertinent clinical studies?
In other words, how can I combine data from
several small, conflicting, and/or inconclusive
primary studies?
Meta-analysis
• Meta-analysis is a group of statistical
methods used to combine quantitative
estimates from several primary research
studies
• Meta-analysis increase statistical power,
boost external validity, and can test/explore
secondary hypotheses (eg meta-regression)
• When conducted in the context of a
systematic literature review and focusing only
on randomized clinical trials, they are
considered the top level of clinical evidence
Study-level meta-analysis
De Luca et al, Eur Heart J 2008
Study-level meta-analysis
De Luca et al, Eur Heart J 2008
Study-level meta-analysis
De Luca et al, Eur Heart J 2008
Study-level meta-analysis
De Luca et al, Eur Heart J 2008
Study-level meta-analysis
De Luca et al, Eur Heart J 2008
Patient-level meta-analysis
Mauri et al, New Engl J Med 2007
Patient-level meta-analysis
Pooled patient-level analysis for protocol-defined stent thrombosis
Mauri et al, New Engl J Med 2007
Thank you for your attention
For any correspondence:
[email protected]
For further slides on these topics feel
free to visit the metcardio.org website:
http://www.metcardio.org/slides.html