Download Spatial correlation - Vanderbilt University School of Medicine

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Spatial Information in DW- and DCE-MRI
Parametric Maps in Breast Cancer Research
Hakmook Kang
Department of Biostatistics
Center for Quantitative Sciences
Vanderbilt University
Joint Work
•
•
•
•
Allison Hainline in Biostatistics
Xia (Lisa) Li Ph.D at VUIIS
Lori Arlinghaus, Ph.D at VUIIS
Tom Yankeelov, Ph.D at VUIIS
Table of Contents
•
•
•
•
•
•
Spatial & Temporal Correlation
Motivation
DW- & DCE-MRI
Spatial Information
Redundancy Analysis & Penalized Regression
Data Analysis
Spatial & Temporal Correlation
•
•
Temporal correlation: Any measure at a time point is
correlated with measures from neighboring time points,
e.g., longitudinal data
Spatial correlation: Any measure at a voxel is correlated
with measures from its neighbors, e.g., ADC, Ktrans....
Spatial Correlation
Radioactive Contamination
Elevation
Medical Imaging Data
•
•
•
•
Structural & functional MRI data, e.g., brain fMRI, breast
DW- & DCE-MRI
CT scans, etc
Imaging data consist of lots of measures at many
pixels/voxels
Not reasonable to assume independence
Motivation
•
•
•
•
Intrinsic spatial correlation in medical imaging data
Ignoring the underlying dependence
Oversimplifying the underlying dependence
Overly optimistic if positive spatial/temporal correlation
is ignored
Mathematics
•
•
•
•
Cov(X, Y) = 2, positively correlated
Var(X+Y) = Var(X) + Var(Y) + 2Cov(X,Y)
Var(X+Y) = Var(X) + Var(Y) if assume X⊥Y, always smaller
by 2Cov(X,Y)
Variance is smaller than what it should be if correlations
among voxels are ignored.
Motivation
•
•
•
•
DW- & DCE-MRI data from 33 patients with stage II/III
breast cancer
Typical ROI-level analysis: define one region of interest
(ROI) per patient and take the average of values (e.g.,
ADC) within ROI
Build models to predict who will response to NAC
Need a tool to fully use the given information to improve
prediction
MRI – Derived Parameters
DW- and DCE-MRI
•
•
DW-MRI: water motion
DCE-MRI: tumor-related physiological parameters
MRI-derived Parameters
•
•
•
•
•
ADC: apparent diffusion coefficient
Ktrans: tumor perfusion and permeability
kep: efflux rate constant
ve: extravascular extracellular volume fraction
vp: blood plasma volume fraction
MRI-derived Parameters
ADC
Ktrans
kep
ve
vp
Using Spatial Information
Radioactive Contamination
Kep & ADC
http://www.neimagazine.com/features/featuresoil-contamination-in-belarus-25-years-later/featuresoil-contamination-in-belarus-25-years-later-5.html
Spatial Information
•
•
•
Model change in mortality by looking at the average
contamination over time
Model Pr(pCR=1) using ROI-level Kep and/or ADC maps,
pCR = pathological complete response
Oversimplification
How to use the given spatial
information?
1. Variable selection + penalization
2. Ridge
3. LASSO (Least Absolute Shrinkage and Selection
Operator)
1. Elastic Net
Redundancy Analysis
•
•
•
•
A method to select variables which are most unlikely to be
predicted by other variables
X1, X2, ..., X21
Fit Xj ~ X(-j), if R2 is high, then remove Xj
We can also use backward elimination,
Y ~ X1 + ... + X21 + e
Redundancy Analysis
•
•
•
•
First, compute 0,5,...,100 percentiles of Kep and ADC for
each patient
X1= min, X2=5 percentile,..., X20 = 95 percentile, and X21
= max
Apply redundancy analysis: choose which percentiles
uniquely define the distribution of Kep (or ADC)
Apply backward elimination
vs. mean = 0.284
Penalized Regression
•LASSO: L penalty
1
•Ridge: L penalty
2
•Elastic Net: L + L penalty
1
2
Penalized Regression
•
•
•
The penalty terms control the amount of shrinkage
The larger the amount of shrinkage, the greater the
robustness to collinearity
10-fold CV to estimate the penalty terms (default in R)
Approaches
1) Var Selection + Penalization (ridge)
- Variable selection either by redundancy analysis or by
backward elimination
- Combined with ridge logistic regression
2) Ridge (No variable selection)
3) Lasso
4) Elastic Net
Models
Voxel-Level
Voxel-Level + ROI + Clinical
Conventional Method
•
ROI-level analysis
•
ROI + clinical variables (i.e., age and tumor grade)
Data Analysis
Description of Data
•
•
33 patients with grade II/III breast cancer
Three MRI examinations
MRI t2
MRI t1
1st NAC
NACs
Surgery
MRI t3
Objective: Using MRI data (Kep & ADC only) at t1 and t2, we want
to predict if a patient will response to the first cycle of NAC.
Responder
Non-Responder
Correction for Overfitting
•
•
Bootstrap based overfitting penalization
Overfitting-corrected AUC = AUC (apparent) – optimism
(using bootstrap)
Results
Results
•
•
•
•
Penalizing overly optimistic results
Redundancy + Ridge with clinical variables is better than the
others
AUC = 0.92, 5% improvement over ROI + clinical model
ACC = 0.84, 10% improvement over ROI + clinical model
Summary
•
•
•
Compared to ROI-level analysis (i.e., average ADC & Kep), we
are fully using available information (voxel-level information)
We partially take into account the underlying spatial correlation
Reliable & early prediction -> better treatment options before
surgery
Future Research:
Spatial Correlation
•
•
•
Modeling the underlying spatial correlation in imaging
data
Parametric function: 1) Exponential Cov function 2)
Matern’s family
Need to relax isotropic assumption