SDenPeak: Semi-Supervised Nonlinear Clustering based on Density and Distance

被引:6
作者
Fan, Wen-Qi [1 ]
Wang, Chang-Dong [1 ]
Lai, Jian-Huang [2 ]
机构
[1] Sun Yat Sen Univ, Sch Mobile Informat Engn, Zhuhai 519082, Peoples R China
[2] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou 510006, Guangdong, Peoples R China
来源
PROCEEDINGS 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2016) | 2016年
关键词
Semi-supervised clustering; constrained clustering; density-based clustering; distance-based clustering;
D O I
10.1109/BigDataService.2016.43
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering by fast search and find of Density Peaks termed DenPeak is the latest and the most popular development of unsupervised clustering that combines both density and distance. However, it suffers from significantly inaccurate performance when there is large diversity of density in different clusters in completely unsupervised. Despite a highly improved performance in semi-supervised clustering, there has been no works to incorporate supervision into DenPeak by using only a few pairwise must-link and cannot-link constraints. To address this problem, we propose a semi-supervised framework for DenPeak, namely SDenPeak, by integrating pairwise constraints to guide the clustering procedure. Experimental results confirm that our algorithm is simple but quite effective in generating satisfactory results on targeting real datasets.
引用
收藏
页码:269 / 275
页数:7
相关论文
共 27 条
[1]   Semi-Supervised Kernel Mean Shift Clustering [J].
Anand, Saket ;
Mittal, Sushil ;
Tuzel, Oncel ;
Meer, Peter .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (06) :1201-1215
[2]  
[Anonymous], 2004, ICML
[3]  
[Anonymous], SEMI SUPERVISED CLUS
[4]  
[Anonymous], 2005, P INT MACH LEARN C, DOI DOI 10.1145/1102351.1102424
[5]  
[Anonymous], 2004, KERNEL METHODS PATTE
[6]  
[Anonymous], P 26 INT C MACH LEAR
[7]  
[Anonymous], 2001, ICML
[8]  
Asuncion A., 2007, Uci machine learning repository
[9]   Support vector clustering [J].
Ben-Hur, A ;
Horn, D ;
Siegelmann, HT ;
Vapnik, V .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :125-137
[10]   A Detection Method for the Resource Misuses in Information Systems [J].
Wang, Chao ;
Zhang, Gaoyu ;
Liu, Lan .
2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, :531-534