Download Course Outline - Pima Community College

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Principal component analysis wikipedia , lookup

Cluster analysis wikipedia , lookup

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Course Content
PIMA COMMUNITY COLLEGE
Effective: 200709
ITF
126
Initiator:
Campus:
Date:
Teradata Warehouse Miner (TWM)
Marty Jansen, Nancy Russell
Greg Wilson, Marcia Wojsko
Community
10/25/2006
Credit Hours:
Lecture Periods:
1.50
1.50
Description:
An introduction to the Teradata Warehouse Miner (TWM) software to construct analytic models of
data (data mining). Includes basic data mining terminology and techniques, and works through a
series of exercises using the analytical and data manipulation functions of the Teradata Warehouse
Miner software.
Student Learning Outcomes:
Upon successful completion of this course, the student will be able to:
1. Discuss data mining terminology.
2. Describe data mining techniques.
3. Analyze and manipulate data using Teradata mining functions.
4. Discuss the use of Business Analytic Templates.
5. Use statistical tests to analyze data.
Course Outline:
I.
Introduction and Data Profiling
A. Teradata Warehouse Miner Overview
B. Terms and Concepts
II.
Descriptive Statistics
1.
Data Explorer
2.
Values
3.
Frequency
4.
Statistical Analysis
5.
Scatter Plot
6.
Correlations
7.
Histogram
8.
Adaptive Histogram
9.
Overlap
III.
Data Reorganization and Manipulation
A. Analytic Data Set Generation
1.
SQL Assistant
2.
Variable Creation
3.
SQL Elements
4.
Dimensioning
5.
Variable Transformation
a.
Retain
b.
Design Coding
c.
Bin Coding
B.
C.
IV.
d.
Derivative
e.
Math Transformations
Data Reorganization
1.
Join
2.
Sample
3.
Denorm
4.
Partition
Matrix Functions
1.
Correlation
2.
Covariance
Analytic Algorithms and Advanced Usage
A. Analytic Algorithms
1.
Factor Analysis
2.
Linear Regression
3.
Decision Trees
4.
Clustering
5.
Association
6.
Scoring
B. Statistical Tests
1.
Factor Analysis
2.
Linear Regression
C. Model Manager
1.
Publish
2.
Manage