Robust Similarity Measure for Spectral Clustering Based on Shared Neighbors

被引:23
作者
Ye, Xiucai [1 ]
Sakurai, Tetsuya [1 ]
机构
[1] Univ Tsukuba, Dept Comp Sci, Ibaraki, Japan
关键词
Spectral clustering; data analysis; similarity measure; shared nearest neighbors; directed kNN graph; ALGORITHM;
D O I
10.4218/etrij.16.0115.0517
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spectral clustering is a powerful tool for exploratory data analysis. Many existing spectral clustering algorithms typically measure the similarity by using a Gaussian kernel function or an undirected k-nearest neighbor (kNN) graph, which cannot reveal the real clusters when the data are not well separated. In this paper, to improve the spectral clustering, we consider a robust similarity measure based on the shared nearest neighbors in a directed kNN graph. We propose two novel algorithms for spectral clustering: one based on the number of shared nearest neighbors, and one based on their closeness. The proposed algorithms are able to explore the underlying similarity relationships between data points, and are robust to datasets that are not well separated. Moreover, the proposed algorithms have only one parameter, k. We evaluated the proposed algorithms using synthetic and real-world datasets. The experimental results demonstrate that the proposed algorithms not only achieve a good level of performance, they also outperform the traditional spectral clustering algorithms.
引用
收藏
页码:540 / 550
页数:11
相关论文
共 26 条
[1]   A density-based similarity matrix construction for spectral clustering [J].
Beauchemin, Mario .
NEUROCOMPUTING, 2015, 151 :835-844
[2]   A Max-Flow-Based Similarity Measure for Spectral Clustering [J].
Cao, Jiangzhong ;
Chen, Pei ;
Zheng, Yun ;
Dai, Qingyun .
ETRI JOURNAL, 2013, 35 (02) :311-320
[3]   Robust path-based spectral clustering [J].
Chang, Hong ;
Yeung, Dit-Yan .
PATTERN RECOGNITION, 2008, 41 (01) :191-203
[4]   Parallel Spectral Clustering in Distributed Systems [J].
Chen, Wen-Yen ;
Song, Yangqiu ;
Bai, Hongjie ;
Lin, Chih-Jen ;
Chang, Edward Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :568-586
[5]  
Ertöz L, 2003, SIAM PROC S, P47
[6]  
Fanti C, 2004, ADV NEUR IN, V16, P1603
[7]   Spectral grouping using the Nystrom method [J].
Fowlkes, C ;
Belongie, S ;
Chung, F ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (02) :214-225
[8]   An Adaptive Spectral Clustering Algorithm Based on the Importance of Shared Nearest Neighbors [J].
He, Xiaoqi ;
Zhang, Sheng ;
Liu, Yangguang .
ALGORITHMS, 2015, 8 (02) :177-189
[9]   AN IMPROVED SPECTRAL GRAPH PARTITIONING ALGORITHM FOR MAPPING PARALLEL COMPUTATIONS [J].
HENDRICKSON, B ;
LELAND, R .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1995, 16 (02) :452-469
[10]  
Houle ME, 2010, LECT NOTES COMPUT SC, V6187, P482, DOI 10.1007/978-3-642-13818-8_34