Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Data Analysis 2 A course introduction Jan Platoš January 31, 2017 Department of Computer Science Faculty of Electrical Engineering and Computer Science VŠB - Technical University of Ostrava Course Content • Pattern and Rule mining • Pattern and Rule assessment • Clustering algorithms and principles • • • • Representative based clustering Hierarchical clustering Density based clustering Clustering validation • Application of graphs for vector data analysis • Matrix algorithms for graph/network data processing • Community detection using top-down and bottom-up approaches. • Basic algorithm for graph visualization 1 Bibliography • Aggarwal C.C. (2015), Data Mining: The Textbook, Springer. • Bramer, M. (2013). Principles of data mining. Springer. • Leskovec, J., Rajaraman, A., Ullman, J. D. (2014). Mining of massive datasets. Cambridge University Press. • Newman, M. (2010). Networks: an introduction. Oxford University Press. • Witten, I. H., Frank, E. (2011). Data Mining: Practical machine learning tools and techniques [3rd Ed.]. Morgan Kaufmann. • Zaki, M. J., Meira Jr, W. (2014). Data Mining and Analysis: Fundamental Concepts and Algorithms. Cambridge University Press. 2 Course Evaluation - Full-time students • There is 30 points from exercises (10x3): • 1 point for presence for the whole exercise (a motivation). • 2 points for finishing task (small implementation/analytical tasks) at the exercise or at home (1-2 hours of work). • Small presentation for 25 points. • Presentation related to the data analysis or implementation task bellow. (1-2h) • Method implementation and results comparison for 25 points. • Evaluation of the implemented method on the standard dataset and comparison of the results with standard implementation (2-4h). • Dataset analysis for 25 points. • A report (PDF) about analysis of the selected dataset with standard algorithm (2-4h). 3 Course Evaluation - Combined students • Small presentation for 30 points. • Presentation related to the data analysis or implementation task bellow. • Method implementation and results comparison for 35 points. • Evaluation of the implemented method on the standard dataset and comparison of the results with standard implementation. • Dataset analysis for 35 points. • A report (PDF) about analysis of the selected dataset with standard algorithm. 4 Questions?