Download SAS Enterprise Miner templates for database marketing applications

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Enterprise Miner® templates for database marketing applications
Authors:
Federico Ambrogi, Guido Cuzzocrea, Massimo Saputo
-
NUNATAC
Abstract
Treasure island, an enormous amount of data at your disposal and the best software environment to exploit it. Data
Mining is an opportunity to gain a competitive advantage and should supply the marketing decision-makers with
operative tools for the evaluation of client potential, the planning of an integrated offer and the selection of campaign
targets.
Nevertheless data and technology are necessary but not sufficient. To successfully apply Data Mining solutions to
Database Marketing activity you need to perform a process flow that starts from goal identification and then passes
through making data available, and applying the SEMMA methodology. But this is still not enough: you need to
implement the campaign in production, considering not only the new targeting rules, but traditional criteria, crosssections and control tests.
All this comparing budgets with expected break-even per segment and averaging market penetration goals with profit
maximisation ones.
Do you feel like Indiana Jones? What if you had a detailed treasure map to get you out of the jungle alive and smiling?
Introduction
Database Marketing means searching for
information wealth within the appropriately
organised data sources of companies, and
devising efficient and measurable rules to
enable you to make the best marketing
decisions.
In other words, Database Marketing means
moving the data analysis procedure to a
context which is centred on business
problems and which is directed towards
achieving competitive commercial strategies.
An important characteristic of Database
Marketing is the role played within the
company, which is now centred on the new
listen-and-answer approach rather than the
produce-and-sell one, in relation to their
clients and the market.
This role has the following prerequisites:
• the will of management to adopt this
approach and the necessary accompanying
cultural and organisational shift of the
company;
• the availability of a dedicated data structure
(a Marketing oriented Data Warehouse);
• the possibility of operating in an analytical
environment which favours the production of
decision-making rules on an industrial level
(a Data Mining environment).
In this culturally and technologically
innovative context, what was once the
estimate of a Scoring Model, now becomes
an opportunity to implement decision-making
processes for the optimisation not of a
campaign, but the ten or hundred campaigns
expected to come out of an ambitious
marketing operation.
1. Who we are
NUNATAC was formed at the beginning of
1994 in response to a specific need in the
market. A critical success factor in numerous
business sectors now means having an
efficient marketing strategy with a
particularly strong client focus: the ability to
develop a close relationship with your clients,
based on reciprocal knowledge and the
development
of
personalised
products/services, is an essential condition for
commercial development and consolidation.
The defining characteristic of NUNATAC is
the combination of specific abilities and a
well developed know-how in the field of
Database Marketing. Our approach to project
responsibility is centred on proposing
1
solutions to needs identified by consultation
with management, systems development and
training of our clients in the methods we have
utilised.
NUNATAC is composed of a group of
professional consultants with statistical,
computing and marketing skills: since 1997,
NUNATAC is a European Quality Partner of
the SAS Institute.
2. Database Marketing Activity
2.1 Goals of the project
What are the guidelines that permit you to
usefully perform Database Marketing?
What are the necessary abilities required to
extract information wealth from the vast and
complex company databases?
To what degree can a sound methodology,
supported by an appropriate “tool box”
operating in a dedicated and easily accessible
data structure, give us consistent economic
advantages?
Finally, if the objective is to derive decisionmaking processes on an industrial level, is it
then possible to translate the successive
phases of this process into logical entities,
which are clearly identifiable and repeatable?
Nunatac’s opinion is that not only is this
approach possible, but it’s desired. Our
current activity of Database Marketing
consultants and Quality Partners of the SAS
Institute, is centred on the production of
“templates” for the realisation of Behavioural
Segmentation projects and the construction of
a Scoring System, in the Enterprise Miner™
environment.
Enterprise Miner templates
for Database Marketing applications
TM
2.2. Database Marketing cycle
Database Marketing activities are centred
around a purpose built database and follow a
precise, logical sequence of actions.
Database Marketing
Business
question
Analysis
goals
Evaluation
of results
Marketing
Database
Test and
Implementation
Data selection
and organisation
Analysis
In relation to this sequence, the Database
Marketing operation begins with the
identification of the business issue to which
an appropriate solution is required.
The business objective must therefore be
translated into analytical terms and into the
availability of the data needed to provide a
non-trivial answer to the problem.
When we possess a significant and
appropriately organised information base, we
then proceed to the actual mining phase.
The results of the analysis, after having been
validated, must subsequently be implemented
within the company’s productive process and
must be compared with traditional decisionmaking criteria.
The final and decisive phase involves the
measurement of the results obtained on the
market following the action just undertaken:
it is only through this procedure that we can
capitalise on the experience and feed the
information wealth of the database.
An example of the productive cycle
associated with the Database Marketing
operation is the construction of a Scoring
System to optimise the results of a campaign.
In this case, the objective is strictly geared
towards a specific marketing operation and
2
the result we wish to obtain is redemption
maximisation rather than that of profit or
penetration of the campaign.
In the case of Behavioural Segmentation, the
desired business objective is definitely more
ambitious: we don’t limit ourselves to
wanting to optimise results associated with a
specific marketing action, but rather to
provide the elements for a marketing strategy
and commercial planning.
The fundamental aspect we want to underline
here, is the perfect matching, in relation to the
Database Marketing cycle discussed above, in
terms of the logical sequence of the actions to
be carried out.
In the context of the activities that marketing
must carry out and of the solutions that it
should provide to the company, Behavioural
Segmentation pursues strategic objectives. To
give a few significant examples, you might
consider: the planning of differentiated
marketing policies in relation to the identified
targets, the budget assignment to the sale
network in relation to cross-selling potentials,
the delimitation of the prospect universe to
which apply a Scoring System for a specific
campaign.
In this sense, the objective of constructing a
Scoring System is to support tactical
decisions, within a particular strategic plan.
From a conceptual point of view, a Scoring
System is the fine tuning in the search of a
radio
frequency
achieved
through
Segmentation.
If, however, we have to embark on the
Database Marketing process within a
company used to working traditionally, the
Behavioural Segmentation might be a too
expensive and resource intensive project to
start with. In this case, the construction of a
Scoring System becomes a project with
which we can more rapidly measure the
comparative advantage of the new approach.
2.3 Database marketing and data Mining:
what is necessary?
The prerequisites to carry out an efficient
Database Marketing activity are:
• Availability - or training - of dedicated and
competent human resources
• Technology to access and manage the data
sources
• Dedicated data structures and an analysis
environment
• An analytical methodology
• Implementation of processes
The following diagram depicts the logic
architecture
of
the
dedicated
data
environment, of the analytical processes to
implement and the operative aspects to face,
in order to carry out the Database Marketing
activity.
Logical structure
Internal
data
Ambiente DW
External
data
Personal data
manager
Customer Analysis
Data Mart
Segmentation
Scoring System
Product Analysis
Data Mart
Market Analysis
Data Mart
Basket Analysis
P.O.S. potential evaluation
Data Mining Processes
Commercial Proposal:
target selection and product mix.
R.O.I.
Campaign
manager
In a departmental Data Warehousing view,
the dedicated data structure provides for the
loading and the integration of internal and
external data, in addition to the traditional
decision-making rules that constitute the
business knowledge resources of the
company.
The Marketing Data Warehouse, or the
Marketing Database, must contain data at
various levels of detail in order to satisfy the
different analytical needs (OLAP and Data
Mining) of different reference units for the
business (Client, Products, Points of Sale).
The Analysis Data Marts, organised on the
basis of the desired objectives that need to be
followed up by the actual analysis, depend on
this structure.
The Enterprise Miner tool has to access to
these Data Marts, and the deliverables of the
3
Data Mining step include the significant
reports necessary to make the best decisions
and the coded rules for the target selection.
2.4. Data Mining Process flow chart
Going back to the hypothesis of translating
into the Enterprise Miner repeatable process
flows for specific Database Marketing
projects, the general reference plan is the
Data Mining process subdivided into phases,
one of which is devoted to the analytical task
using the SEMMA methodology (Sample,
Explore, Modify, Model, Assess).
Data Mining process and
SEMMA methodology
Define business
problem
Evaluate
environment
Make Data
available
Mine data in cycles
Review
Explore
Sample
Implement in
production
Assess
3.1 The Behavioural Segmentation Project
A Behavioural Segmentation project involves
the following phases:
• Preparation of the elementary database for
the segmentation, considering an industry
oriented data model (e.g.: banking, insurance,
retail).
• The selection of a significant sample on
which to carry out the analysis and validate
the classification criteria.
• The analysis process to identify the
homogenous segments and, on this basis,
estimate a general classification rule to be
applied to all the clients.
• Finally, the estimated rule must be
implemented in a computing process. Such a
rule allows us to transform the chosen and
interpreted segments in a marketing vein, in a
classification
that
is
homogeneously
applicable for new clients and those acquired
in future.
Modify
Model
Developing analysis in the Enterprise Miner
environment means plotting a process flow
for the realisation of the successive phases of
a Data Mining project.
To render the Enterprise Miner Templates
usable means creating macro flows that act as
guidelines for the realisation of specific
Database Marketing projects: in other words,
the treasure map!
3. Enterprise Miner Templates
On the basis of previous logic plans, the idea
under discussion is that of providing support
for the carrying out of Database Marketing
projects on an industrial scale. Nunatac, in
accordance with the SAS Institute, has
produced, in the Enterprise Miner
environment, the first templates for
Behavioural
Segmentation
and
the
construction of a Scoring System.
3.2 The Design of a Scoring System
Even the construction of a Scoring System,
albeit with different applications and a greater
emphasis on the estimate and validation of a
model for the assignment of a score, can be
easily identified by clearly defined logical
phases.
3.2.1 Scoring System Template
Let’s look at one of our first template ideas:
an Enterprise Miner flow-chart for the
development of a Scoring System.
The template has been organised trying to
highlight the different SEMMA phases and
several points of the general data mining
methodology (Fig. 1).
Fig. 1: Scoring Template
4
Coming back to the different steps of a Data
Mining project, an Enterprise Miner
workspace can incorporate what is shown in
grey in Fig.2.
It is possible to highlight the relationship
between each phase of the Methodology and
the template:
1. The business problem definition can have
an impact on the identification of the
target variable, but it is not the phase in
which the computation is effectively done
(this is more in the modify step).
2. The environment evaluation step is not
included, but might be simplified by the
use of the template: In fact the structure
will give a clear understanding of what
the project needs in order to be
implemented and which kind of data is
appropriate.
Fig. 2: Data Mining Methodology and Template
M ake Data Available - Step 1
Define business
problem
Evaluate
environment
M ake Data Available - Step 2
Review
M ine data in cycles
Explore
Implement in Production Step 2
Sample
Implement in Production - Step 1
3. Making data available requires a massive
use of the data step, especially when a
data warehouse does not exist. Generally
this activity is done outside of the
Enterprise Miner. Once the data is in
SAS format and a structure is created for
analysis purposes (one record per
statistical unit) it is possible to start
working with the E/M Template. At this
point the analyst must define the variables
to use, create some useful new variables
or transform the existing ones, evaluate
the probability distributions, and clean
missing values.
4. The mining in cycle step can be split into
two moments. The first concentrates on
sampling and exploring the data, the
M odify
Assess
M odel
second on building and assessing the
model.
5. The template’s last sub-diagram applies
the previously generated scoring code to a
prospect file, reports on, and explores the
results.
The operative step will be demanded to the
ERP software, and the review step will be
carried out when the results for evaluation are
available.
Conclusions
The use of data Mining techniques is difficult
for non-statistical experts.
Generally
analysing business data is not easy even for
an expert in statistics without a specific
experience.
5
The SEMMA Methodology is a general
guideline for solving a Data Mining problem.
More, the data mining methodology is a
general help in setting up a complete project,
from the specification of the business
problem to the implementation in production.
The great need of standardisation, and the
integration of data mining processes in the
production cycles requires to have something
like a "data mining product" to have this
solution more easily accepted in the company
environment.
Furthermore to demonstrate that something
works is of vital importance for the diffusion
of data mining techniques.
From this the idea to analyse some real world
projects and to try to put them in an
Enterprise Miner template. The goal is to
build something in the middle between the
Data Mining and SEMMA Methodology on
one side and the real application on the other.
A template is not a ready-to-use solution for a
particular business problem. A template is a
more specific solution to a business problem
than SEMMA is.
Summarising, a template is the specification
of the Data Mining and SEMMA
Methodology for a typical business problem.*
*
NUNATAC
Via Crocefisso 5, 20122 Milano – (Italy)
Tel.+39 02 86996848 Fax. +39 02 89012074
E-mail: [email protected]; [email protected]
6