A comprehensive survey of numeric and symbolic outlier mining techniques

被引:87
作者
Agyemang, Malik [1 ]
Barker, Ken [1 ]
Alhajj, Rada [1 ]
机构
[1] Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada
关键词
symbolic; rule-based; distance-based; depth-based; distribution-based; outliers; interestingness; unexpectedness; taxonomy; web-based; exception patterns;
D O I
10.3233/IDA-2006-10604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data that appear to have different characteristics than the rest of the population are called outliers. Identifying outliers from huge data repositories is a very complex task called outlier mining. Outlier mining has been akin to finding needles in a haystack. However, outlier mining has a number of practical applications in areas such as fraud detection, network intrusion detection, and identification of competitor and emerging business trends in e-commerce. This survey discuses practical applications of outlier mining, and provides a taxonomy for categorizing related mining techniques. A comprehensive review of these techniques with their advantages and disadvantages along with some current research issues are provided.
引用
收藏
页码:521 / 538
页数:18
相关论文
共 79 条
[1]   ROR: Rejection of outliers by rotations [J].
Adam, A ;
Rivlin, E ;
Shimshoni, I .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (01) :78-84
[2]  
Aggarwal C. C., 2001, SIGMOD Record, V30, P37, DOI 10.1145/376284.375668
[3]   An effective and efficient algorithm for high-dimensional outlier detection [J].
Aggarwal, CC ;
Yu, PS .
VLDB JOURNAL, 2005, 14 (02) :211-221
[4]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[5]   DATABASE MINING - A PERFORMANCE PERSPECTIVE [J].
AGRAWAL, R ;
IMIELINSKI, T ;
SWAMI, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) :914-925
[6]  
Agyemang M, 2005, LECT NOTES COMPUT SC, V3589, P285, DOI 10.1007/11546849_28
[7]  
Agyemang M., 2005, PROC ACM S APPL COMP, P482
[8]  
Agyemang M., 2004, PROC ACM S APPL COMP, P590
[9]  
AGYEMANG M, 2004, LSC MINE ALGORITHM M, V1, P5
[10]  
AMING A, 1996, P 2 INT C KNOWL DISC, P164