Download Knowledge-Driven Business Intelligence Systems: Part II

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Fuzzy concept wikipedia , lookup

Knowledge representation and reasoning wikipedia , lookup

Time series wikipedia , lookup

Philosophy of artificial intelligence wikipedia , lookup

Fuzzy logic wikipedia , lookup

Ethics of artificial intelligence wikipedia , lookup

Expert system wikipedia , lookup

AI winter wikipedia , lookup

Intelligence explosion wikipedia , lookup

Incomplete Nature wikipedia , lookup

History of artificial intelligence wikipedia , lookup

Existential risk from artificial general intelligence wikipedia , lookup

Structural equation modeling wikipedia , lookup

Transcript
Knowledge-Driven Business
Intelligence Systems: Part II
Week 11
Dr. Jocelyn San Pedro
School of Information Management &
Systems
Monash University
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
Lecture Outline
ƒ Data Mining Technologies
ƒ Neural Networks
ƒ Genetic Algorithms
ƒ Fuzzy Logic
ƒ Decision Trees
ƒ Data Visualisation
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
2
Learning Objectives
At the end of this lecture, the students will
ƒ Gain some understanding of data mining
technologies (decision trees, neural networks,
genetic algorithms, and fuzzy logic) that are
commonly used in data mining techniques
ƒ Preview some visualisation tools and gain an
understanding of how they support business
decision making
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
3
1
Data Mining Technologies
1960s – classical statistical analysis
ƒ Correlation, regression, chi-square, cross-tabulation
1980s – classical statistical analysis augmented by more
powerful set of soft computing techniques
ƒ neural networks, genetic algorithms, fuzzy logic, decision
trees
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
4
Soft Computing
Emerging discipline that combines
computational methods for dealing with
inexact, approximate reasoning approaches
ƒ simulating the brain-way of solving problems neural networks
ƒ evolving solutions - genetic algorithms
ƒ dealing with logical ambiguity - fuzzy logic
ƒ representing effect of each event, or decision, on
successive events – decision trees
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
5
Neural Networks
ƒ Attempt to mirror the way human brain works in recognizing
patterns by developing mathematical structures with the
ability to learn (Marakas, 2002)
ƒ Attempt to “learn” patterns from data directly, by sifting data
repeatedly, searching for relationships, automatically
building models, and correcting over and over again the
model’s own mistakes – (Dhar and Stein, 1997)
ƒ Good at modelling poorly understood problems for which
sufficient data can be collected
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
6
2
Artificial Neural Nets (ANNs)
simple computer programs that build models from
data by trial and error
“Learning from Experience”
ƒ Present a piece of data to a neural network
ƒ The net predicts an output
ƒ The net compares is guess to the actual correct value (also
presented to the network)
ƒ If ANN guess is right, the net does nothing
ƒ If ANN guess is wrong, net figures out how to adjust some
internal parameters so that it can make better prediction if it
sees similar data again in future
ƒ Over time, the ANN begins to converge on a fairly accurate
model of the process
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
7
Artificial Neural Nets (ANNs)
Network Topology- The number of layers and units in each layer
and a way in which the units are connected together.
3 basic layers:
The input layer receives the data
1. The internal or hidden layer processes the data.
2. The output layer relays the final result of the net.
Output Layer
Guesses
Hidden Layer
Processing
Input Layer
Data Input
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 8
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
Artificial Neural Nets (ANNs)
Make initial guess based on
current weight settings and
inputs
Calculate error
with associated
output
Determine the amount
and direction of individual
weight adjustment
Training the ANN adjusting neural network
weights. During training the
network analyses the data
you have provided and
changes weights between
network units to reflect
dependencies found in your
data.
Adjust individual
weights according
to calculations
Calculate error/adjust
weights for each node in
hidden layer
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 9
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
3
Artificial Neural Nets (ANNs)
Testing is a process of estimating quality of the trained neural
network. During this process a part of data that wasn't used
during training is presented to the trained network case by
case. Then forecasting error is measured on each case and used
as the estimation of network quality.
Preparing the ANN in Alyuda Forecaster – www.alyuda.com
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
10
Artificial Neural Nets (ANNs)
ƒ Effective in problems of image recognition
ƒ Not suited well for, say, financial or serious medical
applications.
ƒ highly intricate systems - include dozens of neurons with
a couple hundred connections between them
ƒ non-transparency of forecasting models represented by a
trained neural network
ƒ knowledge reflected in terms of weights of a couple
hundred intraneural connections cannot be analysed and
interpreted by a human.
ƒ Despite of these difficulties neural networks are actively
used (with varying success) in different financial
applications in the majority of developed countries.
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
11
ANN Applications – Alyuda
Forecaster
ƒ Credit Approval - determine risk of granting a loan to an
applicant
¾ Classify applicant as either LOW risk, HIGH risk
¾ Guide decision in granting or denying new loans
ƒ Employee retention- identify potential employees who are
likely to stay with the organization during the next year
based on previous year data
¾ Classify employee’s retention probability as LOW or
HIGH probability
¾ Identify employees who intend to leave and take the
appropriate measures to retain them.
www.alyuda.com
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
12
4
ANN Applications – Alyuda
Forecaster
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
13
ANN Applications – Alyuda
Forecaster
ƒ Gas consumption - forecast gas consumption by a power
plant.
ƒ Sales forecasting - forecast weekly sales of a small restaurant
chain using the historical data over 109 weeks period
ƒ Stock prediction - forecast the percentage of the Close price
change for Chevron Corp 4 days in advance
www.alyuda.com
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
14
Data Mining Technologies
Genetic Algorithms
ƒ Recognise a good solution, spreads some of that solution’s
features into a population of competing solutions, and
“breeds” good solutions
ƒ Powerful technique for solving various combinatorial or
optimisation problems
ƒ Sample Genetic algorithm online demos
http://math.hws.edu/xJava/GA/
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
15
5
Genetic Algorithm
ƒ First a population of possible solutions to a problem are
developed.
ƒ Next, the better solutions are recombined with each other to
form some new solutions.
ƒ Finally the new solutions are used to replace the poorer of
the original solutions and the process is repeated.
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
16
Genetic Algorithm - Example
Selecting a fixed number of market parameters influencing the
market performance the most
ƒ names of these parameters comprise a descriptive set or a set
of chromosomes determining qualities of an "organism" - a
solution of the problem
ƒ Values of parameters determining a solution correspond to
genes
ƒ A search for the optimal solution is similar then to the
process of evolution of a population of organisms, where
each organism is represented by a set of its chromosomes.
http://www.megaputer.com/dm/systems.php3#stat_package
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
17
Genetic Algorithm - Example
The process of evolution of population of organisms is driven
by three mechanisms:
ƒ selection of the strongest – or survival of the fittest those sets
of chromosomes that characterise the most optimal solutions
ƒ cross-breeding - production of new organisms by mixing sets
of chromosomes of parent sets of chromosome
ƒ mutations - accidental changes of genes in some organisms
of the population.
ƒ After a number of new generations built with the help of the
described mechanisms one obtains a solution that cannot be
improved any further. This solution is taken as a final one.
http://www.megaputer.com/dm/systems.php3#stat_package
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
18
6
Genetic Algorithms- Weak Points
ƒ The very way of formulating the problem deprives one of
any opportunity to estimate statistical significance of the
obtained solution.
ƒ Second, only a specialist can develop a criterion for the
chromosome selection and formulate the problem
effectively.
ƒ Thus genetic algorithms should be considered at present
more as an instrument for scientific research rather than as a
tool for generic practical data analysis, for instance, in
finance.
http://www.megaputer.com/dm/systems.php3#stat_package
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
19
Fuzzy Logic
ƒ Our language is full of vague and imprecise concepts, and
allows for conveyance of meaning through semantic
approximations
ƒ These approximations are useful to humans, but do not
readily lend themselves to the rule-based reasoning done on
computers.
ƒ Use of fuzzy logic is how computers handle this ambiguity
ƒ Allows for partial or “fuzzy” description of rules
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 20
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
The Basics of Fuzzy Logic
ƒ In a “crisp” rule, the result is either false (0) or true (1) and
can be stored in a binary fashion.
ƒ In a “fuzzy” rule, the result ranges from 0 (absolutely false)
to 1 (absolutely true), with stops in between.
ƒ absolutely false, slightly false, slightly true, absolutely
true
ƒ slightly similar, similar, very similar
ƒ These operations utilise functions that assign a degree of
“membership” in a set.
ƒ Degree of similarity of current data to historical data is
0.75
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 21
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
7
Membership Function Example
1.00
Degree of
0.50
Tallness
0.00
0
1
2
3
4
5
6
7
8
9
10
Height in Feet
The “Tallness” function takes a person’s height and converts it
to a numerical scale from 0 to 1.
Here the statement “He is Tall” is absolutely false for heights
below 5 feet and absolutely true for heights above 7 feet
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 22
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
Inferencing using Fuzzy Rules
Example
“Well if you’ve got a high margin, price sensitive
product, promoting that product via ads, displays,
etc. is likely to have a high impact on sales volume.
If the volume impact is high, it’s a good candidate
for allocation of promotion dollars.
But you also want to promote products more
heavily when they’re relatively new in order to
increase market awareness and to establish market
share…”
Dhar, V. and Stein, R. (1997)
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
23
Inferencing using Fuzzy Rules
One fuzzy rule: If product is new, then a client should
spend more money promoting it
new-product-rule
Product is NEW
THEN
Promotion should be HIGH
Dhar, V. and Stein, R. (1997)
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
24
8
Inferencing using Fuzzy Rules
µ - Degree of
Membership in the fuzzy
set NEW
µ
1
0.3
0
0
235
365
Days since product was introduced
Dhar, V. and Stein, R. (1997)
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
25
Inferencing using Fuzzy Rules
Promotion expense that is 2%
of sales is absolutely LOW
The
degree of
“Lowness”
of
Promotion
expense
that is
2.9% of
sales is
0.75.
PROMOTION
1
0.75
Low
Medium
High
0
0
3
5
8
Expense as a percentage of sales
15
Dhar, V. and Stein, R. (1997)
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
26
Inferencing using Fuzzy Rules
Price Sensitivity
1
0.4
(ratio of % change in volume per change in price)Price
sensitivity
is 0.4 LOW
or 0.1
Medium
Low
Medium
High
0.1
0
0
Input
1
2
3
4
Dhar, V. and Stein, R. (1997)
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
5
Take Max
value or
Fuzzy Set
Union:
Price
sensitivity
is 0.4 LOW
27
9
Inferencing using Fuzzy Rules
Other fuzzy rules:
ƒ If product is NEW, then a client should spend MORE money
promoting it
ƒ If the price sensitivity of product is LOW, then promotion
should be LOW
ƒ If the price sensitivity of product is MEDIUM, then
promotion should be MEDIUM
ƒ If the price sensitivity of product is HIGH, then promotion
should be HIGH
Dhar, V. and Stein, R. (1997)
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
28
Fuzzy Systems
Some Advantages
ƒ Great in dealing with qualitative data, as well as object
attribute
ƒ Offers an attractive trade-off between accuracy and
compactness – express relationships in terms of simple rules
ƒ Not computationally expensive – compared to “crisp” rulebased systems
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
29
Fuzzy Systems
Some Disadvantages
ƒ Saturation of fuzzy sets – fuzzy sets get so full of inferences
that the consequent fuzzy regions are overloaded > system
loses the information provided by the fuzzy rules
ƒ Needs domain expertise to setup fuzzy sets
ƒ Only provides approximation to human reasoning
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
30
10
Notes on Decision Trees
CART – Classification and Regression Trees
ƒ Most common decision tree, statistical analysis data mining
tool
ƒ automatically searches for and finds high performance
classification and prediction
ƒ key elements are a set of rules for:
ƒ splitting each node in a tree;
ƒ deciding when a tree is complete; and
ƒ assigning each terminal node to a class outcome (or
predicted value for regression)
ƒ More info and software demo on http://www.salfordsystems.com/
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
31
Data Visualisation
For any kind of high dimensional data set, displaying predictive
relationships is a challenge.
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
32
http://www.sapdesignguild.org/editions/edition2/info_zoom.asp
Human Visual Perception and
Data Visualisation
ƒ Data visualisation is so powerful because the human visual
cortex converts objects into information so quickly.
ƒ The next three slides show (1) usage of global private
networks, (2) flow through natural gas pipelines, and (3) a
risk analysis report that permits the user to draw an
interactive yield curve.
ƒ All three use height or shading to add additional dimensions
to the figure.
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 33
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
11
Global Private Network Activity
High Activity
Low Activity
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 34
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
Natural Gas Pipeline Analysis
Note: Height shows total flow through compressor stations.
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 35
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
An “Enlivened”
Enlivened” Risk Analysis
Report
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 36
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
12
Telephone Polling Results
Note: On the “live” map, clicking on an area allows the user
to drill down and see results for smaller areas.
Marakas,
G.M.
(2002) Decision
support
systems –inSEM
the 21st
Century. 2nd Ed, Prentice Hall 37
IMS3001 –
BUSINESS
INTELLIGENCE
SYSTEMS
1 , 2004
References
Dhar, V. and Stein, R. (1997) Intelligent decision Support
Methods: the Science of Knowledge Work, Prentice Hall.
Dhar, V. and Stein, R. (1997) Seven methods for transforming
corporate data into business intelligence.
Marakas, G.M. (2002) Decision support systems in the 21st
Century. 2nd Ed, Prentice Hall (or other editions)
Power, D. (2002) Decision Support Systems: Concepts and
Resources for Managers, Quorum Books.
***********
Good Online resource on fuzzy sets and operations
http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol4/sbaa/r
eport.fuzzysets.html
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
38
Questions?
[email protected]
School of Information Management and Systems, Monash
University
T1.28, T Block, Caulfield Campus
9903 2735
IMS3001 – BUSINESS INTELLIGENCE SYSTEMS – SEM 1 , 2004
39
13