Download CSIS-445SyllabusFall_2016SPoposki_

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Cluster analysis wikipedia , lookup

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
CSIS-445- Advanced Databases
(Topic Data Mining)
Fall 2016
Syllabus
Instructor:
Saso Poposki
Email:
[email protected]
Textbook & Materials:
1) Data Mining : Concepts and Techniques (3rd Edition).
Han, Jiawei, Kamber, Micheline, and Pei, Jian.
ISBN: 9780123814791
2) A Programmer's Guide to Data Mining (The Ancient Art of the Numerati)
Ron Zacharski.
- free online data mining textbook: http://guidetodatamining.com/
REFERENCE MATERIALS
3) Data Science in R: A Case Studies Approach to Computational Reasoning and
Problem Solving
by Deborah Nolan, Duncan Temple Lang
ISBN 9781482234817
Software: We will be using open- source software that you can install on your own
machines. Throughout the course we are extensively using Python and R (previous
knowledge of these specific languages is not required, however a successfully finished
programming course / experience in any high-level programming language is required).
1) Enthought Canopy: Easy Python Deployment Plus Integrated Analysis
Environment for Scientific Computing and Data Analysis:
https://store.enthought.com/downloads/#default
2) R for Statistics. Download at http://www.r-project.org/
3) Datasets from the UCI Machine Learning Repository at
http://archive.ics.uci.edu/ml/datasets.html
Course Information: Data mining is a broad area that integrates techniques from several
fields including machine learning, statistics, pattern recognition, artificial intelligence, and
database systems, for the analysis of large volumes of data. This course gives a wide
exposition of these techniques and their software tools. This course will illustrate data mining
process and how the technology works with sample live applications of data mining. The
following topics will be covered:
o
o
o
o
o
o
o
o
o
Basic concepts, applications and trends in data mining
Relationship between data mining, data warehouse, and query tools
Data preparation for the data mining process
Model building, algorithms and technology:
Supervised learning:
Classification and Prediction (Decision Trees & Bayesian Networks)
Unsupervised learning
Clustering (k-means and hierarchical clustering)
Evaluation of the data mining model; comparisons of different data mining
models
o Visualization using Data Mining
Data mining aims at finding useful regularities in large data sets. Interest in the field is
motivated by the growth of computerized data collections which are routinely kept by many
organizations and commercial enterprises, and by the high potential value of patterns
discovered in those collections.
Course Objectives:
Students who successfully complete this course will gain a strategic and analytical toolkit
essential to Data Mining. Students will be able to:
 Analyze large sets of data and uncover patterns within the data
 Use mathematical algorithms to uncover patterns in both web and regular data
 Predict data based on the patterns discovered previously
 Use tools and statistical analysis to analyze data
 Describe legal, ethical and public relations implications of data mining
Homework and Lab work
This course requires significant research. You will be completing the following tasks:
 Attendance & Preparation: Depending on the material, each week, you will be
completing a lab, a quiz, or answering a short question in addition to the weekly
discussion board post. Your attendance and participation will be measured by the
quality of your answers. Answers are due by 11:59 pm CST each Tuesday
evening of the week.
 Homework Assignments: There are several complex, work-intensive homework
assignments, which consist of theoretical/ mathematical or programming
exercises. Homework must be posted by 11:59 pm CST on Tuesday evening.
 Final Project: The final project requires pursues an interesting question about a
dataset which you have found (there are numerous sources on the internet). You
will draft the project in stages and apply everything that you learn in this course,
including preprocessing (with Excel, R, or Python), algorithm selection and
application, visualization, and interpretation. The full description for the Final
Project will be announced appropriately.
Tests and Exams: Tests, quizzes and exams must be taken on the announced dates and
times.
Grading System: Your grade will be based upon the following;
Attendance/Discussion
Project
Quizzes
Assignments
Exams
Total
10%
15%
20%
25%
30%
100%
The final grading scale will be no stricter than:
93% >= A, 83% >= B, 73% >= C, 60% >= D
ONU's Disability Policy
Disability Support Services
It is the policy of Olivet Nazarene University to accommodate students with disabilities
in accordance with federal and state laws. Undergraduate students with documented
disabilities should notify Dr. Sue Rattin, assessment and learning support services
director (Burke 117), to request course accommodations. The Learning Development
Center (Benner 015) will coordinate accommodations for students with disabilities who
are approved for services.
Academic Coaching Center (ACC)
For courses covered by the ACC:
Students who are having academic difficulty are encouraged to contact the professor first.
If additional academic help is needed, students may visit the Academic Coaching Center
(ACC), located in the lower level of Ludwig. Peer coaches are available to help students
one on one or in small groups. When visiting the ACC, students should bring their
textbooks, notes and any other materials needed to help understand the course
assignment. The ACC is open Sunday through Thursday from 4 p.m. to 10 p.m.
If you need help with study skills or writing, professional staff members are available to
assist you at the ACC.
For courses not covered by the ACC:
Students who are having academic difficulty are encouraged to contact the professor first.
If additional help is needed, please contact the Center for Student Success at 815-9295665 to schedule an appointment. The Center provides assistance with time management,
test taking skills, note taking help, goal setting and other skills necessary to help students
succeed academically.
If you need help with study skills or writing, professional staff members are available to
assist you at the ACC.