Active Semi-supervised Affinity Propagation Clustering Algorithm based on Local Outlier Factor

被引:0
作者
Qi, Lei [1 ]
Ting, Li [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
来源
2018 37TH CHINESE CONTROL CONFERENCE (CCC) | 2018年
关键词
Local outlier factor; active learning; pair-wise constraint; affinity propagation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithm can reveal the inherent properties and laws of data through the learning of unlabeled data. However, interference data exists in some fields of different data forms, and the clustering will reduce the credibility of clustering results without processing data. This paper puts forward a semi-supervised clustering algorithm based on outlier pruning (LOF-SAP). For the outlier in the data, the local outlier factor algorithm (LOF) is used to look for them and reduce the influence of the outliers in data structure. Then, semi-supervised clustering algorithms can help find better partitions of data in the presence of side information. And then side information that is pair-wise constraint obtained with active learning is embedded in the data similarity matrix. At the last through affinity propagation clustering algorithm, clustering results obtains. This method is compared with the traditional affinity propagation (AP) clustering algorithm and the AP clustering algorithm with pair-wise constraint, and the experiment is done by using UCI database. And the proposed method can achieve better clustering performance.
引用
收藏
页码:9368 / 9373
页数:6
相关论文
共 18 条
[1]  
AGGARWAL CC, 2008, P SIAM INT C DAT MIN
[2]  
[Anonymous], 2002, FROM INSTANCE LEVEL
[3]  
[Anonymous], 2010, INT C EXH COMP GEOSP
[4]  
[Anonymous], 2009, P WORLD C ENG
[5]  
Arora P, 2017, PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), P82, DOI 10.1109/CONFLUENCE.2017.7943128
[6]  
Bin Wang, 2009, Proceedings of the 2009 Ninth IEEE International Conference on Computer and Information Technology. CIT 2009, P293, DOI 10.1109/CIT.2009.107
[7]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[8]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[9]  
Gal Y., COMPUTER VISION PATT
[10]   Self-training-based face recognition using semi-supervised linear discriminant analysis and affinity propagation [J].
Gan, Haitao ;
Sang, Nong ;
Huang, Rui .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2014, 31 (01) :1-6