Robust support vector data description for outlier detection with noise or uncertain data

被引:70
作者
Chen, Guijun [1 ]
Zhang, Xueying [1 ]
Wang, Zizhong John [1 ,2 ]
Li, Fenglian [1 ]
机构
[1] Taiyuan Univ Technol, Coll Informat Engn, Taiyuan, Shanxi, Peoples R China
[2] Virginia Wesleyan Coll, Dept Math & Comp Sci, Norfolk, VA USA
基金
中国国家自然科学基金;
关键词
Outlier detection; Support vector data description; Local density; epsilon-insensitive loss; ONE-CLASS CLASSIFICATION;
D O I
10.1016/j.knosys.2015.09.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an example of one-class classification methods, support vector data description (SVDD) offers an opportunity to improve the performance of outlier detection and reduce the loss caused by outlier occurrence in many real-world applications. However, due to limited outliers, the SVDD model is built only by using the normal data. In this situation, SVDD may easily lead to over fitting when the normal data contain noise or uncertainty. This paper presents two types of new SVDD methods, named R-SVDD and epsilon NR-SVDD, which are constructed by introducing cutoff distance-based local density of each data sample and the epsilon-insensitive loss function with negative samples. We have demonstrated that the proposed methods can improve the robustness of SVDD for data with noise or uncertainty by extensive experiments on ten UCI datasets. The experimental results have shown that the proposed epsilon NR-SVDD is superior to other existing outlier detection methods in terms of the detection rate and the false alarm rate. Meanwhile, the proposed R-SVDD can also achieve a better outlier detection performance with only normal data. Finally, the proposed methods are successfully used to detect the image-based conveyor belt fault. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:129 / 137
页数:9
相关论文
共 31 条
[1]  
Blake C. L., 1998, Uci repository of machine learning databases
[2]   One-Class Classification based on searching for the problem features limits [J].
Cabral, George G. ;
Oliveira, Adriano L. I. .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (16) :7182-7199
[3]   Density weighted support vector data description [J].
Cha, Myungraee ;
Kim, Jun Seok ;
Baek, Jun-Geol .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (07) :3343-3350
[4]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[5]   Failure analysis of belt conveyor damage caused by the falling material. Part II: Application of computer metrotomography [J].
Fedorko, Gabriel ;
Molnar, Vieroslav ;
Marasova, Daniela ;
Grincova, Anna ;
Dovica, Miroslav ;
Zivcak, Jozef ;
Toth, Teodor ;
Husakova, Nikoleta .
ENGINEERING FAILURE ANALYSIS, 2013, 34 :431-442
[6]   Privacy preserving and fast decision for novelty detection using support vector data description [J].
Hu, Wenjun ;
Wang, Shitong ;
Chung, Fu-lai ;
Liu, Yong ;
Ying, Wenhao .
SOFT COMPUTING, 2015, 19 (05) :1171-1186
[7]   Asymptotic behaviors of support vector machines with Gaussian kernel [J].
Keerthi, SS ;
Lin, CJ .
NEURAL COMPUTATION, 2003, 15 (07) :1667-1689
[8]   One-class classification with Gaussian processes [J].
Kemmler, Michael ;
Rodner, Erik ;
Wacker, Esther-Sabrina ;
Denzler, Joachim .
PATTERN RECOGNITION, 2013, 46 (12) :3507-3518
[9]   One-class classification: taxonomy of study and review of techniques [J].
Khan, Shehroz S. ;
Madden, Michael G. .
KNOWLEDGE ENGINEERING REVIEW, 2014, 29 (03) :345-374
[10]   Density-based clustering [J].
Kriegel, Hans-Peter ;
Kroeger, Peer ;
Sander, Joerg ;
Zimek, Arthur .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (03) :231-240