Semi-supervised outlier detection based on fuzzy rough C-means clustering

被引:70
作者
Xue, Zhenxia [1 ]
Shang, Youlin [1 ]
Feng, Aifen [1 ]
机构
[1] Henan Univ Sci & Technol, Sch Math & Stat, Luoyang, Peoples R China
关键词
Pattern recognition; Outlier detection; Semi-supervised learning; Rough sets; Fuzzy sets; C-means clustering;
D O I
10.1016/j.matcom.2010.02.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a fuzzy rough semi-supervised outlier detection (FRSSOD) approach with the help of some labeled samples and fuzzy rough C-means clustering. This method introduces an objective function, which minimizes the sum squared error of clustering results and the deviation from known labeled examples as well as the number of outliers. Each cluster is represented by a center, a crisp lower approximation and a fuzzy boundary by using fuzzy rough C-means clustering and only those points located in boundary can be further discussed the possibility to be reassigned as outliers. As a result, this method can obtain better clustering results for normal points and better accuracy for outlier detection. Experiment results show that the proposed method, on average, keep, or improve the detection precision and reduce false alarm rate as well as reduce the number of candidate outliers to be discussed. (C) 2010 IMACS. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1911 / 1921
页数:11
相关论文
共 21 条
[21]  
Zhang Daoqiang., 2002, P 2002 INT C CONTROL, P123, DOI DOI 10.1109/ICCA.4132002.1229535