Download Information Surveillance Management Systems

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Outlier Detection
Sanjay Chawla
University of Sydney
Data Mining
• Last few years - mainly focused on
different aspects of outlier detection.
• I work with a slightly broader definition of
outlier detection:
– Conventional unsupervised (e.g., distance
based outliers etc.)
– Imbalanced Classification Problem
– Rare Patterns (e.g., High Confidence & Low
Support rules)
Context
• Lot of my work is in the context of my interaction
with the insurance industry in Australia.
• However can be abstracted to a more general
framework and then applied to
– “network security”;
– fraud detection;
– “homeland security” applications
– Information surveillance management systems
Information Surveillance
Missing:
Privacy
Piece???
Correlation Mining &
Domain Knowledge
Felligi Sunter;
Uncertainty
DB;
Blocking
Imbalanced Classification
(e.g., Logistic Regression)
What is an outlier?
Different definitions will result in different outliers
Interesting Abstract Problem:
Manifold Outliers
non-outlier
non-outlier
outlier
outlier
PCA
Kernel PCA ??
Related documents