Difference between revisions of "...find outliers"

Revision as of 09:25, 7 January 2019

Anomaly detection. Sometimes the goal is to identify data points that are simply unusual. In fraud detection, for example, any highly unusual credit card spending patterns are suspect. The possible variations are so numerous and the training examples so few, that it's not feasible to learn what fraudulent activity looks like. The approach that anomaly detection takes is to simply learn what normal activity looks like (using a history non-fraudulent transactions) and identify anything that is significantly different. _______________________________________________.

Do you have > 100 features?

Yes, then try One-class Support Vector Machine (SVM)
No, need fast training, then try Principal Component Analysis (PCA)-based Anomaly Detection

Also consider...

- K-Means Clustering
- Hierarchical Clustering; Agglomerative (HAC) & Divisive (HDC)

@@ Line 17: / Line 17: @@
 Also consider...
 ** [[K-Means]] Clustering
-** [[Hierarchical Cluster Analysis (HCA)]]
+** [[Hierarchical Clustering; Agglomerative (HAC) & Divisive (HDC)]]

Difference between revisions of "...find outliers"

Revision as of 09:25, 7 January 2019

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools