* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Molecular Profiles Of Breast Cancer Progression
Metagenomics wikipedia , lookup
Neuronal ceroid lipofuscinosis wikipedia , lookup
Ridge (biology) wikipedia , lookup
Gene nomenclature wikipedia , lookup
Fetal origins hypothesis wikipedia , lookup
Gene desert wikipedia , lookup
Gene therapy wikipedia , lookup
Epigenetics of neurodegenerative diseases wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Gene therapy of the human retina wikipedia , lookup
Epigenetics of diabetes Type 2 wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Microevolution wikipedia , lookup
Public health genomics wikipedia , lookup
Gene expression programming wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Genome (book) wikipedia , lookup
Oncogenomics wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Mir-92 microRNA precursor family wikipedia , lookup
Designer baby wikipedia , lookup
Biology and consumer behaviour wikipedia , lookup
Molecular Profiles Of Breast Cancer Progression Gabriela Alexe IBM Computational Biology Center, IBM T.J. Watson Research Center, Yorktown Heights, NY 10598, USA. Email: [email protected] We develop a new robust technique to analyze microarray data which uses a combination of principal components analysis and consensus ensemble kclustering to find robust clusters and gene markers in the data. We apply our method to a public microarray breast cancer dataset from Ma et al. (2003) which has expression levels of genes in normal samples as well as in three pathological stages of disease; namely, atypical ductal hyperplasia or ADH, ductal carcinoma in situ or DCIS and invasive ductal carcinoma or IDC. Our method averages over clustering techniques and data perturbation to find stable, robust clusters and gene markers. A major result of our analysis is that different sets of patients seem to progress to the same final phenotype along different functional pathways. Our findings are validated on external gene expression microarray datasets.