Download Analysis of Microarray Genomic Data of Breast Cancer Patients

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Metagenomics wikipedia , lookup

Gene therapy wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Long non-coding RNA wikipedia , lookup

Public health genomics wikipedia , lookup

Gene nomenclature wikipedia , lookup

Nicotinic acid adenine dinucleotide phosphate wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Pathogenomics wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Genome evolution wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Minimal genome wikipedia , lookup

Gene wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Microevolution wikipedia , lookup

Genomic imprinting wikipedia , lookup

Gene expression programming wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Genome (book) wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Oncogenomics wikipedia , lookup

Ridge (biology) wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Designer baby wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

NEDD9 wikipedia , lookup

RNA-Seq wikipedia , lookup

Gene expression profiling wikipedia , lookup

Mir-92 microRNA precursor family wikipedia , lookup

Transcript
Analysis of Microarray Genomic
Data of Breast Cancer Patients
Hui Liu, MS candidate
Department of statistics
Prof. Eric Suess, faculty mentor
Department of statistics
Introduction
• Many biomedical tests assay only one or
two gene expression activities.
• Microarray (Gene Chip) assays thousands
of gene expression at the same time.
• Does microarray provide us a better
technique to understand clinical research?
Two-color
Two-color fluorescent
fluorescent hybridization
hybridization for
for the
analysis
gene
expression
microarray
assayingof
gene
expression
byby
microarray
mRNA from Sample 2
(Experimental Sample)
mRNA from Sample 1
(Reference Sample)
Reverse transcribe each sample
using a different fluoresce
nucleotide
(Cy3 or Cy5)
Mix the complex together
Hybridize overnight
Scan and
determine
fluorescence
intensities at
each spot
Research Project Goals
• Independently analyze the Stanford genome
database breast cancer microarray data.
• To learn CLUSTER and TREEVIEW
microarray analysis software programs
(Michael Eisen, 1998-1999).
• To confirm the previous study result (Sorlie et
al, PNAS: Sept 2001, Vol. 98, no. 19, 1086910874).
• To test if microarray analysis is a better
approach for breast cancer clinical research.
Stanford Microarray Database
• Clustering analysis:85 cDNA microarray
experiments: 78 cancers, 3 fibroadenomas, 4
normal breast tissues
• Survial analysis: 49 patients in a cohort study in
which advanced breast cancers without metastasis
were uniformly treated
Methods
• CLUSTER program hierarchical clustering was
applied and the results were displayed by using
TREEVIEW software.
• SAS procedures-PROC PHREG and PROC
LIFETEST-were used for the survival analysis.
Hierarchical Clustering Analysis
• Hierarchical Clustering Algorithm used by the
CLUSTER program is to compute a dendrogram
that assembles all items (genes or arrays) into a
single tree by repeated cycles of clustering process.
• The Pearson correlation coefficient is used to
measure similarity/distance between the expression
of two genes.
 X i  X  Yi  Y 
1
r


N

Sx


 S 
 Y 
• The clustering process groups together genes with
similar patterns of expression basing on the
similarity matrix.
Red: transcript level > median
Green: transcript level<median
Black: transcript level=median
Grey: inadequate or missing data
Hierarchical clustering of 456 intrinsic cDNA clones
ERBB2 amplicor cluster
Novel unknown cluster
Basal epithelial cell-enriched cluster
Normal breast-like cluster
Luminal epithelial gene cluster containing ER
Cluster dendrogram showing the five subtypes of tumors
Basal-like ERBB2+ Luminal
Subtype C
Luminal Subtype A +
B
Normal
Breast-like
Hierarchical clustering of 456 intrinsic cDNA clones
Basal Erbb2+ C
A
B Normal
ERBB2+: genes in the
ERBB2 amplicor cluster
ERBB2 amplicon:
ERBB2, GRB7, etc.
Luminal subtype C: Novel unknown cluster
a novel set of genes
Basal-like: Keratins 5
and 17, laminin, and
Basal epithelial cell-enriched cluster
fatty acid binding
protein 7
Normal breast like:
genes expressed in
Normal breast-like cluster
adipose and other
nonepithelial cell type
Luminal subtype
Luminal epithelial gene cluster containing ER
A+B: ER a gene,
GATA binding protein
3, X-box binding
protein 1
Cluster dendrogram showing the five subtypes of tumors
Basal-like ERBB2+ Luminal
Subtype C
Luminal Subtype A +
B
Normal
Breast-like
Coordinated function of genes cluster
Breast cancer prognosis
Survival analysis: breast CA patient Survival Time
or tumor Relapse Free Time
Hierarchical clustering of 456 intrinsic cDNA clones
Basal Erbb2+ C
A
B Normal
ERBB2+: genes in the
ERBB2 amplicor cluster
ERBB2 amplicon:
ERBB2, GRB7, etc.
Luminal subtype C: Novel unknown cluster
a novel set of genes
Basal-like: Keratins 5
and 17, laminin, and
Basal epithelial cell-enriched cluster
fatty acid binding
protein 7
Normal breast like:
genes expressed in
Normal breast-like cluster
adipose and other
nonepithelial cell type
Luminal subtype
Luminal epithelial gene cluster containing ER
A+B: ER a gene,
GATA binding protein
3, X-box binding
protein 1
Conclusion
• Confirmed the previous study results (Sorlie et al,
Sept. 2001)
* Clinical outcome of Luminal subtype A+B group
is statistically different from Luminal subtype C
group although they are both ER positive.
* There are no significant difference in clinical
outcome between Luminal subtype C group and
Basal-like group probably because they share the
expression of a set of novel genes.
• Learned modern advanced statistical technique for
microarray analysis: CLUSTER, TREEVIEW
Conclusion
Microarray
Gene expression
Hierarchical Cluster Analysis
Tumor classification
Survival analysis
Clinical outcome
Microarray analysis allows us to understand the
coordinated function of groups of genes in disease
prognosis, diagnosis and therapeutic resistance. It is
a valuable approach to clinical research.
Analysis of Microarray Genomic
Data of Breast Cancer Patients
Hui Liu, MS candidate
Department of statistics
Prof. Eric Suess, faculty mentor
Department of statistics
Overall survival analysis
Survival time (months)
Relapse Free Survival analysis
Proportion of patients survived
Relapse Free time (months)
Cluster dendrogram showing the five subtypes of tumors
Basal-like ERBB2+ Luminal
Subtype C
Luminal Subtype A +
(from Sorlie et al, PNAS, Septemer 2001)
B
Normal
Breast-like