Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Outlier Detection Sanjay Chawla University of Sydney Data Mining • Last few years - mainly focused on different aspects of outlier detection. • I work with a slightly broader definition of outlier detection: – Conventional unsupervised (e.g., distance based outliers etc.) – Imbalanced Classification Problem – Rare Patterns (e.g., High Confidence & Low Support rules) Context • Lot of my work is in the context of my interaction with the insurance industry in Australia. • However can be abstracted to a more general framework and then applied to – “network security”; – fraud detection; – “homeland security” applications – Information surveillance management systems Information Surveillance Missing: Privacy Piece??? Correlation Mining & Domain Knowledge Felligi Sunter; Uncertainty DB; Blocking Imbalanced Classification (e.g., Logistic Regression) What is an outlier? Different definitions will result in different outliers Interesting Abstract Problem: Manifold Outliers non-outlier non-outlier outlier outlier PCA Kernel PCA ??