Download Data Mining

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

K-means clustering wikipedia , lookup

Nonlinear dimensionality reduction wikipedia , lookup

Cluster analysis wikipedia , lookup

Transcript
With effect from the academic year 2015-16
IT 6112
DATA MINING
Instruction
Duration of University Examination
University Examination
Sessional
3 Periods per Week
3 Hours
80 Marks
20 Marks
Course objectives:
At the end of the course, student should be able to
1. To introduce the basic concepts of Data Mining.
2.
Discover interesting patterns in corpus of data available using various
techniques.
3.
Analyze supervised and unsupervised models and estimate the accuracy of
the algorithms.
Course Outcomes:
Students who complete this course should be able to
1. Preprocess the raw data to make it suitable for various data mining algorithms.
2. Apply the techniques of clustering, classification, association finding, feature selection
and visualization to real world data.
UNIT-I
Introduction: Challenges, Origins of Data Mining and Data Mining Tasks.
Data: Types of Data Quality, Data Preprocessing, Measures of Similarity and Dissimilarity,
OLAP and Multidimensional Data Analysis.
UNIT-II
Classification: Preliminaries, General Approach to Solving a Classification Problem,
Decision
Tree Induction-Model Over fitting, Evaluating the Performance of a Classifier, Methods of
Comparing Classifiers, Rule-Based Classifier.
UNIT-III
Classification: Nearest-Neighbor classifiers, Bayesian Classifiers, Artificial Neutral
Networks,
Support Vector Machine, Ensemble Methods, Class Imbalance Problem, Multiclass Problem.
UNIT-IV
Association Analysis: Problem Definition, Frequent Item Set Generation, Rule Generation,
Compact Representation of Frequent Item Sets, Alternative Methods for Generating Frequent
Item Sets, FP-Growth Algorithm, Evaluation of Association Patterns, Effect of Skewed
Support
Distribution, Handling Categorical Attributes a Handling Continuous Attributes, Handling a
concept Hierarchy.
With effect from the academic year 2015-16
UNIT-V
Cluster Analysis: Overview, K-means, Agglomerative Hierarchical Clustering, DBSCAN
Cluster Evaluation on Characteristics of Data, Clusters and Clustering Algorithms.
Suggested Reading:
1) Pang-Ning Tan. Michael Steinbach, Vipin Kumar, "Introduction to Data Mining",
Pearson Education, 2008.
2) Han J &Kamber M, Data Mining: Concepts and Techniques, Third Edition, Elsiver, 2011
3) Arun K Pujari, "Data Mining Techniques ", University Press. 2ndEdn, 2009.
4) VikrarnPudi, P. Radha Krishna, "Data Mining", Oxford University Press, 1st edition,
2009.
Sumathi, S N Sivanandam, "Introduction to Data Mining and its Applications ", Springer.