Download Molecular Profiles Of Breast Cancer Progression

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Metagenomics wikipedia , lookup

Neuronal ceroid lipofuscinosis wikipedia , lookup

Ridge (biology) wikipedia , lookup

Gene nomenclature wikipedia , lookup

Fetal origins hypothesis wikipedia , lookup

Gene desert wikipedia , lookup

Gene therapy wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Microevolution wikipedia , lookup

Public health genomics wikipedia , lookup

Gene expression programming wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Genome (book) wikipedia , lookup

NEDD9 wikipedia , lookup

Oncogenomics wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Mir-92 microRNA precursor family wikipedia , lookup

Designer baby wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Gene expression profiling wikipedia , lookup

RNA-Seq wikipedia , lookup

Transcript
Molecular
Profiles
Of
Breast
Cancer
Progression
Gabriela Alexe
IBM Computational Biology Center, IBM T.J. Watson Research Center, Yorktown
Heights, NY 10598, USA. Email: [email protected]
We develop a new robust technique to analyze microarray data
which uses a combination of principal components analysis and consensus ensemble kclustering to find robust clusters and gene markers in the data.
We apply our method to a public microarray breast cancer dataset from Ma et al. (2003)
which has expression levels of genes in normal samples as well as in three pathological
stages of disease; namely, atypical ductal hyperplasia or ADH, ductal carcinoma in situ
or DCIS and invasive ductal carcinoma or IDC. Our method averages over clustering
techniques and data perturbation to find stable, robust clusters and gene markers. A major
result of our analysis is that different sets of patients seem to progress
to the same final phenotype along different functional pathways. Our findings are
validated on external gene expression microarray datasets.