Neighborhood outlier detection

被引:120
作者
Chen, Yumin [1 ]
Miao, Duoqian
Zhang, Hongyun
机构
[1] Xiamen Univ Technol, Dept Comp Sci & Technol, Xiamen 361024, Peoples R China
基金
中国国家自然科学基金;
关键词
Outlier detection; Rough sets; Neighborhoods; Data mining; ROUGH; REDUCTION;
D O I
10.1016/j.eswa.2010.06.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
KNN (k nearest neighbor) is widely discussed and applied in pattern recognition and data mining, however, as a similar outlier detection method using local information for mining a new outlier, neighborhood outlier detection, few literatures are reported on. In this paper, we introduce neighborhood model as a uniform framework to understand and implement neighborhood outlier detection. Furthermore, a neighborhood-based outlier detection algorithm is also given. This algorithm integrates rough set granular technique with outlier detecting. We propose a neighborhood-based metric on outlier detection, and compare neighborhood outlier detection with DIS, KNN and RNN. The experimental results show that neighborhood-based metric is able to measure the local information for outlier detection. The detected accuracies based on neighborhood outlier detection are superior to DIS, KNN for mixed dataset, and a litter better than RNN for discrete dataset. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8745 / 8749
页数:5
相关论文
共 26 条
[1]  
Aggarwal C. C., 2001, SIGMOD Record, V30, P37, DOI 10.1145/376284.375668
[2]  
[Anonymous], P ACM SIDMOD INT C M
[3]  
[Anonymous], P PKDD02
[4]  
Bay S.D., 1999, UCI KDD REPOSITORY
[5]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[6]  
HARKINS S, 2002, P 4 INT C DAT WAR KN, P170
[7]  
He ZY, 2005, LECT NOTES COMPUT SC, V3644, P400
[8]   Fuzzy probabilistic approximation spaces and their information measures [J].
Hu, QH ;
Yu, DR ;
Xie, ZX ;
Liu, JF .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2006, 14 (02) :191-201
[9]   Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].
Hu, QH ;
Yu, DR ;
Xie, ZX .
PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423
[10]   Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323