Download Case Studies

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Cluster analysis wikipedia , lookup

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Data Mining
Techniques for CRM
Paul J.C. Chang
Eneida Lau
Ximena Salazar
Lester Arellano
José-Pablo González
Edith Quispe
Data Mining in CRM ...
“ ...through data mining – the extraction of hidden
predictive information from large databases –
organizations can identify valuable customers, predict
future behaviors, and enable firms to make proactive,
knowledge-driven decisions.”
Agenda
 Introduction, Definition: Paul
 The Evolution & Apps. of Data Mining: Eneida
 Internal Considerations & Data mining techniques: Ximena
 Data mining and CRM – relationship & customer privacy: Lester
 Case Studies (Neural Networks, CHAID): JPG
 CHAID vs neural nets; Conclusions: Edith
Introduction
 Product-oriented view VS. Customer-oriented view
 Design-build-sell VS. sell-build-redesign
 One-on-one marketing VS. mass marketing
 Goal of revolution: Establish a long term
relationship with each customer
 The advent of the Internet and technological tools
accelerate modern CRM revolution
 CRM is important for B2C or C2B, and even more
crucial in B2B environments
Why Data Mining?
 Between businesses and customers…
 Collecting customer demographics and behavior data
makes precision targeting possible
 Helps to devise an effective promotion plan when
new products developed
 Creates and solidifies close customer relationships
Between businesses…
 Helps to smooth transactions, communications and
collaboration
 Simplifies and improves logistics and procurement
process
What is Data Mining?
 “…a sophisticated data search capability that uses
statistical algorithms to discover patterns and correlations
in data.”
 “…another way to find meaning in data.”
 Data mining is part of a larger process called knowledge
discovery
What Data Mining is ~NOT~
• Data mining software does not eliminate the
need to know the business, understand the
data, or be aware of general statistical
methods.
• DM does not find patterns or knowledge
without verification
• DM helps to generate hypotheses, but it does
not validate the hypotheses
Evolutionary Stages of Data Mining
Data
Collection
(1960’s)
Data
Access
(1980’s)
Data
Navigation
(1990’s)
Data
Mining
(2000’s)
•Retrospective,
static data delivery
•Retrospective,
dynamic data delivery
at record level
•Retrospective,
dynamic data delivery
at multiple level
•Retrospective,
Proactive information
delivery
•Summations or
averages
•Branch sales at
specific period of time
•Global view or drill
down
•Online analytic tools,
feedback and
information exchange
•RDBMS, SQL, ODBC
•OLAP,
multidimensional
databases, data
warehouses
•Computers, tapes,
disks
•IBM, CDC
•Oracle, Sybase,
Informix, IBM,
Microsoft
•Pilot, IRI, Arbor,
Redbrick
•Adv. Algorithms,
multiprocessor,
computers, massive
databases
•Lockheed, IBM, SGI
Breakdown of Data Mining from a Process
Orientation
Data Mining
Discovery
•Conditional
Logic
•Affinities and
Associations
•Trends and
Variations
Predictive
Modeling
Forensic
Analysis
•Outcome
Prediction
•Deviation
Detection
•Forecasting
•Link Analysis
Applications of Data Mining
Retail
1. Performing
basket analysis
2. Sales forecasting
3. Database
marketing
4. Merchandise
planning and
allocation
Banking
1. Card marketing
2. Cardholder
pricing and
profitability
3. Fraud detection
4. Predictive lifecycle
management
Telecommunications
1. Call detail record
analysis
2. Customer loyalty
OTHER APPLICATIONS
Customer
Segmentation
Discrete
segments by
adding variables
Manufacturing
Customize
Products.
Predict features
Warranties
No. clients who
will ask for claims
Frequent flier
incentives
Identify groups
who can receive
incentives
INTERNAL CONSIDERATIONS
Data mining
Decision-making process
Skillsets and technologies must be available to integrate them
Knowledge
gained
through DM
•
•
•
•
Sell to and service customers
Manage inventory
Supervise employees
Work to correct and prevent loss
-An algorithm for scoring
-A score for particular customer,
employee
-An action associated with a customer,
employee or transaction
DATA MINING TECHNIQUES
Nearest
Neighbor
Data Retained
DM
Approaches
Data distilled
Case-Based
Reasoning
Logical
Numeric and
Non-numeric
Cross
Tabulational
Non-numeric
Data
Equational
They are applied to tasks of predictive
modeling
and forensic analysis
They extract patterns and then use for various purposes
Numeric
Data
CUSTOMER RELATION MANAGEMENT
Definition
•
•
•
•
Know
Target
Sell
Service
1
2 Stage
Concept
2
CRM: Development
of the offer
3 Which’s
- From product to customer orientation
- Market Strategy from outside-in
-Push the development of customer orientation
-Innovating value proposition
Components of CRM
Customer
Information
Data
Warehouse
Analyze the
Data
Campaign
Execution &
Tracking
Internal
Customer
Data
Customer
Outside
Data
Source Data
Historical
Data
•Billing
Records
•Surveys
•Web logs,
•External
Credit Card
data
sources
records
Current
Address, Web
page viewing
profiles.
Data Mining Techniques +
Customer Oriented
Interactions between
MKT, information,
Tech and sales
channels
Data Mining & CRM
• The Relationship
– Customer Life
Cycle
• Prospects
• Respondents
• Active Customers
• Former Customers
Data
Mining
Inputs
What
information is
available
Output
What is likely
to be
interested
Data Mining & CRM
• Inputs
– Prospects Data Warehouse in other industries
– Click Stream Information
• Market Data Intelligence
– DM can predict behavior of customer (CLC) and match it with
any market event (a,i. I pod nano)
• Data Mining and Customer Privacy
– Privacy Bill of Rights, Independent verification of security
policies.
– Create an anonymous architecture for handling customer info.
Case Studies
Neural Networks vs. CHAID
Case #1
Neural Networks
Neural Networks
• The exact way in which
the brain enables
thought is one of the
great mysteries of
science
Neurons
NeoVistas Solutions’ Decision Series
• For retail, insurance, telecommunications, and
healthcare.
• Includes discovery tools based on neural networks,
clustering, genetic algorithms, and association rules
The problem
•
•
•
•
Large retailer
Over $1 billion in sales
Overstocked on slow-moving products
Under-stocked on most popular items at
critical selling periods.
Solution
• With Clustering and and NN:
– Review point-of-sale history and equate store
groupings to sales patterns.
– Forecast stocking requirements on a store-bystore basis.
Results
• Management is able to forecast seasonal
trends at the store-item level.
• The Decision Series tools showed that
clustering similar items into actionable groups
streamlined the ordering process.
• Revenues increased by 11.6%
Case #2
CHAID
Applied Metrix
• Uses a combination of CHAID segmentation
and logistic regression response probability
modeling to establish predictive models that
are deployed over a proprietary Internet
system
The problem
• Home equity marketer that extended home
equity lines of credit at the national level.
• The client’s goal was to increase the
efficiency of targeting current mortgage
customers who might be interested in the
client’s service.
The Solution
• CHAID identified 16
distinct market
segments.
• In particular, one
particular segment
accounted for 65% of
responses to the
mailing.
Results
• The highest-rated group from the predictive model
had by far the highest response rate to the equity
line of credit campaign—85% above average for the
direct mailing,
• The goal of the program was a 10% increase in
response rate, but the actual response rate
increased 30%.
• The firm was able to increase profits by over one
million dollars in the first year after implementation.
CHAID vs. Neural Networks
Clarity and explicability
- CHAID model is understandable as a set of rules
- Neural Network is obscure
Implementation/integration
- The CHAID model is much easier to be implemented
that a Neural Network.
- The risk of missing code by an IT department is slim
for a CHAID model and higher for a Neural Network.
Data Requirements
- The data for both techniques requires some preprocessing.
- Neural Network require the data be transformed into
binary format.
Accuracy of model
- Neural Networks provide more accurate models,
especially for complex problems.
Construction of model
- CHAID is easier and quicker to construct.
- Neural Networks have many parameters that must
be set and require more skilled manipulation.
Cost
- Building a Neural Network is more costly then
building a CHAID model.
Aplications
- CHAID and Neural Networks can create predictive
models.
- Neural Networks can handle both categorical and
continuous independent variables, but these have to
be transformed to 0/1 input variables.
- When all or most of the independent variables are
continuous, neural networks should perform better
than CHAID.
Aplications
- The Neural Networks and CHAID can be used to
solve sequence prediction problems.
- Neural Networks can be used to solve estimation
problems.
- CHAID provides good solutions to classification
problems, can be used for exploratory analysis and
can provide descriptive rules.
- An interesting development is the combination of
these two techniques to create “neural trees”.
CONCLUSIONS
- The choice among different options
is not as the choice to use data
mining technologies in a CRM
initiative.
- Data Mining represents the link
from the data stored over many
years through various interactions
with customers in diverse
situations, and the knowledge
necessary to be successful in
relationship marketing concepts.
CONCLUSIONS
- Through the full implementation of a CRM program,
which must include data mining, organizations foster
improved loyalty, increase the value of their
customers, and attract the right customers.
- As customers and businesses interact more
frequently, businesses will have to leverage on CRM
and related technologies to capture and analyze
massive amounts of customer information.
CONCLUSIONS
- CRM solutions focus
primarily on analyzing
consumer information for
economic benefits, and
very little touches on
ensuring privacy.