Download Data mining

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Cluster analysis wikipedia , lookup

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Show Me Potential Customers
Data Mining Approach
Leila Etaati
Leila Etaati
• 10 years experience in SQL server
• PhD students in Information System Department,
Business School University of Auckland
• Lecturer and Tutor of BI and database
• System design in University of Auckland
@Leila_Etaati
[email protected]
www.rad.pasfu.com
www.linkedin.com/in/leilaetaa
ti
Agenda
Introduction to Data Mining
Introduction to Microsoft Data Mining Solution
Exploring Data Mining Algorithms
Solving Real-World Challenges with MS Data Mining (DEMO)
Descriptive Analysis
Predictive Analysis
Enhance .NET Application with Data Mining
Introduction to Data Mining
Introduction to
Data Mining
What is Data Mining (DM)?
• What is the aim of DM
Find pattern and trend in data
Prediction of likely outcomes
• How DM do that?
By employing mathematical analysis
• Which area mostly uses DM
Forecasting
Risk and probability analysis
Recommendation
Grouping
Finding Sequence
Data mining : Marketing Department
DM Types of Analysis?
Data Mining Life Cycle
Introduction to Microsoft Data Mining
Solution
Introduction to Microsoft
Data Mining Solutions
Mining Structure
Train and Test
Exploring Data Mining Algorithms
Exploring Data Mining
Algorithms
Microsoft Decision Tree Algorithm
•
•
•
A descriptive and predictive
algorithm
Accept both discrete and continues
attributes, needs a key column,
input column
It employs feature selection
technique to guide the selection
Number of Child at
home
Age
Age>40
Have 0 child
Have 1-2
Childs
Have 3 or
more Childs
Who is going
to buy the bike
Have 0 Childs
Age<=40
Have 1-2
Childs
Have 3 or
more Childs
Clustering Algorithm
Categorize items in groups with similar
attribute values
 employs K-means algorithm and Expectation
Maximization (EM)
Mostly descriptive but can be predictable
Microsoft Naïve Bayes Algorithm
•
A classification
Algorithm Uses
Bayesian technique
for categorization.
•
It is useful for
finding attributes
that effects on
generating a result,
such as finding
prospective buyers of
a
product.(descriptive)
Association Rule
Identifies
association between
attributes. One of
the most common
usage of this is to
do a market basket
analysis with this
algorithm
Time Series
For time based analysis. Such as predicting sales for next
couple of months.
Solving Real-World Challenges
with MS Data Mining (DEMO)
Solving Real-World
Challenges with
MS Data Mining (DEMO)
Buying the Bike
Bike Buyers
Number of Car at
Home
Demo: Microsoft Decision Tree, Clustering and Naïve
Bayes
Leila Etaati
Content Type
Discrete
Continuous
Cyclical
Ordered
Discretized
• data values are separate such as colour values: Red, Yellow, and Blue
• data values are continues; such as Age, or salary.
• data values are in a cyclic order, such as days of week.
• data values are in a sequential order; such as days of month.
• data values are continues, but bucketed into categories and as a result behave
as discrete.
Accuracy Charts
Lift Chart
Profit Chart
Classification
Matrix
Cross
Validation
Demo: Finding the Best Algorithm
Leila Etaati
Demo: Prediction with DMX
Leila Etaati
DEMO: Enhance .NET Application with Data Mining
CREATE MINING MODEL TravelBudgetPrediction
(
traveller_ID long KEY,
Year TEXT DISCRETE,
Quarter TEXT DISCRETE,
mode TEXT DISCRETE,
country TEXT DISCRETE,
purpose TEXT DISCRETE,
package TEXT DISCRETE,
Age TEXT DISCRETE,
Sex TEXT DISCRETE,
Duration TEXT DISCRETE,
Visits long DISCRETE,
Nights long DISCRETE,
Spend long DISCRETE
PREDICT)
USING MICROSOFT_DECISIONTREE;
DMX Code with .Net (predict the travel Budget)
Demo: Microsoft Association Rule
Leila Etaati
Summary
Introduction to Data Mining
Introduction to Microsoft Data Mining Solutions
Exploring Data Mining Algorithms
Solving Real-World Challenges with MS Data Mining (DEMO)
Descriptive Analysis
Predictive Analysis
Enhance .NET Application with Data Mining
References to Study More
 Data Mining Tutorials in MSDN:
http://technet.microsoft.com/en-us/library/bb677206.aspx
 Data Mining Algorithms in MSDN:
http://technet.microsoft.com/en-us/library/ms175595.aspx
 Data Mining with SQL Server 2008 Book:
http://www.amazon.com/Data-Mining-Microsoft-Server-2008/dp/0470277742
@Leila_Etaati
[email protected]
www.rad.pasfu.com
www.linkedin.com/in/leilaetaati
Explore Everything PASS Has to Offer
Free SQL Server and BI Web Events
Free 1-day Training Events
Regional Event
This is Community
Business Analytics Training
Local User Groups Around
the World
Session Recordings
PASS Newsletter
Free Online Technical Training