Download Introduction to Data..

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Cluster analysis wikipedia , lookup

Transcript
Department of Computer Science
Elective Course
Introduction Data warehouse and Data mining
16B1NCI438
Course Coordinator: Avinash Pandey

Knowledge of Database management system
Introduction

This course provides an introduction to Data warehouse and data
mining. The course will begin with introduction of Datawarehouse and followed by integration of a data mining system
with data warehouse.

We shall also do experiments to implement various data mining
techniques such as classification, clustering, etc.

Introduction to data ware house
 Data warehousing components, data extraction, cleanup, and transformation tools –
metadata; business analysis - reporting and query tools and applications, online
analytical processing (OLAP) – need – multidimensional data model;

Data Mining
 data mining- introduction – data – types of data – data mining functionalities –
interestingness of patterns – integration of a data mining system with a data
warehouse – issues –data pre-processing;

Association rule mining and classification
 Mining Frequent Patterns, Associations and Correlations – Mining Methods – Mining
Various Kinds of Association Rules
 Classification and Prediction - Basic Concepts , Decision Tree Induction ,Bayesian
Classification, Support Vector Machines , Other Classification Methods

Cluster Analysis
 Types of Data in Cluster Analysis, A Categorization of Major Clustering Methods,
Partitioning Methods, Hierarchical Methods, Density-Based Methods, Grid-Based
Methods, Model-Based Clustering Methods, Clustering High-Dimensional Data

Applications and Trends in Data Mining
 Data Mining Applications: Social Network Analysis , Mining Sequence Patterns in
Biological Data, Text Mining

Display a comprehensive understanding of different data
mining tasks and the algorithms.

Apply the techniques of clustering, classification,
association finding, feature selection and visualisation on
real world data.

Determine whether a real world problem has a data
mining solution.

Conceptualise a data mining solution to a practical
problem.

Examination
75 Marks
 T-1
20 Marks
20 Marks
35 Marks
25 Marks
 T-2
 T-3

Internal assessment
 Attendance
 Project
▪ The class project is an important component of this course. Students will be able to put
their gained knowledge to practice and demonstrates their skills. The project topics should
be approved by the instructor.