Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Visualization and Data Mining Daniel A. Keim Professor and Head of Data Mining and Information Visualization University of Constance 78457 Konstanz, Germany [email protected] Comments • Tight Integration of Data Mining and Visualization Automatic Data Mining for Data Preprocessing Using Visualization to Steer Automatic Data Analysis Automatic Analysis Techniques for selecting the visualization • Important Challenges - Business Analytics: CRM, Marketing, Finance, … - Network Analytics: Monitoring, Security, … many involve GIS differences to NVAC ? May 2, 2005 My Name, title and AffiliationDaniel Keim, University of Constance 2 Tight Integration Data Data Visualization of DM-Algorithm the result DM-Algorithm step 1 DM-Algorithm step n DM-Algorithm Result Visualization of the result Result Result Knowledge Knowledge Knowledge Preceding Visualization Subsequent Visualization May 2, 2005 Visualization + Interaction Visualization of the data Data Tightly integrated Visualization 3 Tightly Integrated Visualization Visualization of algorithmic decisions Data DM-Algorithm step n Result Knowledge May 2, 2005 Visualization + Interaction DM-Algorithm step 1 Data and patterns are better understood User can make decisions based on perception User can make decisions based on domain knowledge Visualization of result enables user specified feedback for next algorithmic run 4 Visual Classification [AEK 00] attr.1 attr.2 ... class attr. 1 class attr. 2 class 0.2 Y 0.5 N 2.4 2.0 ... N 0.3 Y 1.3 Y 0.3 Y 2.0 N 0.5 N 2.5 Y 1.1 Y 5.1 N ... Y ... ... ... 23.3 ... 0.3 ... ... ... ... ... Each attribute is sorted and visualized separately Each attribute value is mapped onto a unique pixel The color of a pixel is determined by the class label of the object The order is reflected by the arrangement of the pixels May 2, 2005 5 Visual Classification A New Visualization of a Decision Tree age < 35 G Salary < 40 [40,80] G May 2, 2005 V > 80 P 6 Example of Tight Integration: Visual Classification Level 1 Level 2 ... leaf split point inherited split point Level 18 May 2, 2005 7 Geo-related information: Learning from History Computer generated Cartograms Presidential Election 2000 Results Bush – Gore May 2, 2005 9 Rectangular Cartograms: 2004 Election Results May 2, 2005 10