Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
國立雲林科技大學 National Yunlin University of Science and Technology Poisson-Based Self-Organizing Feature Maps and Hierarchical Clustering for Serial Analysis of Gene Expression Data Presenter : Shao-Wei Cheng Authors : Haiying Wang, Huiru Zheng, and Francisco Azuaje TCBB 2007 Intelligent Database Systems Lab Outline Motivation Objective Methodology Experiments and Results Conclusion Personal Comments N.Y.U.S.T. I. M. 2 Intelligent Database Systems Lab Motivation N.Y.U.S.T. I. M. Serial analysis of gene expression (SAGE) PoissonC is an adaptation the k-means. But PoissonC fails to provide users with a platform to explore complex relationships between clusters. SOM can be used for visualization. But SOM has shown poor performance on SAGE data analysis by euclidean distance. Intelligent Database Systems Lab Objectives To implement and evaluate an adaptation of the SOM algorithm (PoissonS) incorporates Poisson-statistics-based distance function into the SOM learning process to support SAGE data analysis. N.Y.U.S.T. I. M. To integrate Poisson-based distance into a hierarchical clustering system (PoissonHC) Be combined with PoissonS to further improve pattern discovery and visualization for large SAGE data sets. 4 Intelligent Database Systems Lab Methodology Poisson distribution PoissonS N.Y.U.S.T. I. M. Intelligent Database Systems Lab Methodology N.Y.U.S.T. I. M. PoissonHC Intelligent Database Systems Lab Experiments N.Y.U.S.T. I. M. Data sets Synthetic Data Mouse Retinal SAGE Data Human Cancer SAGE Data Compared algorithms: SOM with Euclidean distance SOM with Pearson Correlation Intelligent Database Systems Lab Experiments N.Y.U.S.T. I. M. PossionS Synthetic Data Euclidean distance Pearson Correlation Intelligent Database Systems Lab Experiments N.Y.U.S.T. I. M. PossionHC Mouse Retinal SAGE Data Euclidean distance Pearson Correlation Intelligent Database Systems Lab Experiments SAGE libraries N.Y.U.S.T. I. M. PossionHC Human Cancer SAGE tags Intelligent Database Systems Lab Experiments N.Y.U.S.T. I. M. PossionS + PossionHC Mouse Retinal SAGE Data Intelligent Database Systems Lab Conclusion N.Y.U.S.T. I. M. By incorporating a Poisson statistics-based distance into the SOM learning algorithm and hierarchical clustering techniques, significant improvements in pattern discovery and visualization for SAGE data are accomplished. 12 Intelligent Database Systems Lab Personal Comments Advantage Incorporates Poisson-statistics-based distance function into the SOM and hierarchical clustering. Drawback N.Y.U.S.T. I. M. Can’t determine the optimal number of clusters automatically. Application Clustering about SAGE data. . 13 Intelligent Database Systems Lab