Download Document

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Visualization and
Data Mining
Daniel A. Keim
Professor and Head of
Data Mining and Information Visualization
University of Constance
78457 Konstanz, Germany
[email protected]
Comments
• Tight Integration of Data Mining and Visualization
 Automatic Data Mining for Data Preprocessing
 Using Visualization to Steer Automatic Data Analysis
 Automatic Analysis Techniques for selecting the visualization
• Important Challenges
- Business Analytics: CRM, Marketing, Finance, …
- Network Analytics: Monitoring, Security, …
 many involve GIS
 differences to NVAC ?
May 2, 2005
My Name, title and AffiliationDaniel
Keim, University of Constance
2
Tight Integration
Data
Data
Visualization of
DM-Algorithm
the
result
DM-Algorithm
step 1
DM-Algorithm
step n
DM-Algorithm
Result
Visualization of
the result
Result
Result
Knowledge
Knowledge
Knowledge
Preceding
Visualization
Subsequent
Visualization
May 2, 2005
Visualization + Interaction
Visualization of
the data
Data
Tightly integrated
Visualization
3
Tightly Integrated Visualization
Visualization of algorithmic decisions
Data
DM-Algorithm
step n
Result
Knowledge
May 2, 2005
Visualization + Interaction
DM-Algorithm
step 1
 Data and patterns are better understood
 User can make decisions based on
perception
 User can make decisions based on
domain knowledge
 Visualization of result enables user
specified feedback for next algorithmic run
4
Visual Classification [AEK 00]
attr.1 attr.2
...
class
attr. 1
class
attr. 2 class
0.2
Y
0.5
N
2.4
2.0
...
N
0.3
Y
1.3
Y
0.3
Y
2.0
N
0.5
N
2.5
Y
1.1
Y
5.1
N
...
Y
...
...
...
23.3
...
0.3
...
...
...
...
...
 Each attribute is sorted and visualized separately
 Each attribute value is mapped onto a unique pixel
 The color of a pixel is determined by the class label of the object
 The order is reflected by the arrangement of the pixels
May 2, 2005
5
Visual Classification
 A New Visualization of a Decision Tree
age < 35
G
Salary
< 40
[40,80]
G
May 2, 2005
V
> 80
P
6
Example of Tight Integration:
Visual Classification
Level 1
Level 2
...
leaf
split point
inherited split point
Level 18
May 2, 2005
7
Geo-related information: Learning from History
Computer generated Cartograms
Presidential
Election 2000
Results
Bush – Gore
May 2, 2005
9
Rectangular Cartograms:
2004 Election Results
May 2, 2005
10
Related documents