Clustering and Projected Clustering with Adaptive Neighbors

被引:860
作者
Nie, Feiping [1 ]
Wang, Xiaoqian [1 ]
Huang, Heng [1 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
来源
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14) | 2014年
关键词
Clustering; Block diagonal similarity matrix; Adaptive neighbors; Clustering with dimensionality reduction;
D O I
10.1145/2623330.2623726
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many clustering methods partition the data groups based on the input data similarity matrix. Thus, the clustering results highly depend on the data similarity learning. Because the similarity measurement and data clustering are often conducted in two separated steps, the learned data similarity may not be the optimal one for data clustering and lead to the suboptimal results. In this paper, we propose a novel clustering model to learn the data similarity matrix and clustering structure simultaneously. Our new model learns the data similarity matrix by assigning the adaptive and optimal neighbors for each data point based on the local distances. Meanwhile, the new rank constraint is imposed to the Laplacian matrix of the data similarity matrix, such that the connected components in the resulted similarity matrix are exactly equal to the cluster number. We derive an efficient algorithm to optimize the proposed challenging problem, and show the theoretical analysis on the connections between our method and the K-means clustering, and spectral clustering. We also further extend the new clustering model for the projected clustering to handle the high-dimensional data. Extensive empirical results on both synthetic data and real-world benchmark data sets show that our new clustering methods consistently outperforms the related clustering approaches.
引用
收藏
页码:977 / 986
页数:10
相关论文
共 23 条
[1]  
[Anonymous], 2004, NIPS
[2]  
[Anonymous], 1997, C BOARD MATH SCI
[3]  
[Anonymous], 2003, NIPS
[4]  
[Anonymous], 2013, INT C MACH LEARN
[5]   Support vector clustering [J].
Ben-Hur, A ;
Horn, D ;
Siegelmann, HT ;
Vapnik, V .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :125-137
[6]  
Boyd S., 2004, CONVEX OPTIMIZATION, VFirst, DOI DOI 10.1017/CBO9780511804441
[7]  
Cai X., 2013, IJCAI 13, P2598
[9]   NEW SPECTRAL METHODS FOR RATIO CUT PARTITIONING AND CLUSTERING [J].
HAGEN, L ;
KAHNG, AB .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1992, 11 (09) :1074-1085
[10]  
Huang J., 2013, AAAI