A novel clustering algorithm based on PageRank and minimax similarity

被引:0
作者
Qidong Liu
Ruisheng Zhang
Xin Liu
Yunyun Liu
Zhili Zhao
Rongjing Hu
机构
[1] Lanzhou University,School of Information Science and Engineering
[2] Lanzhou University,School of Higher Education
来源
Neural Computing and Applications | 2019年 / 31卷
关键词
Cluster analysis; Density-based clustering; Minimax similarity; PageRank;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering by fast search and find of density peaks (herein called FDPC), as a recently proposed density-based clustering algorithm, has attracted the attention of many researchers since it can recognize arbitrary-shaped clusters. In addition, FDPC needs only one parameter dc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d_c$$\end{document} and identifies the number of clusters by decision graph. Nevertheless, it is not clear how to find a proper dc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d_c$$\end{document} for a given data set and such a perfect parameter may not exist in practice for the multi-scale data set. In this paper, we proposed a modified PageRank algorithm to compute the local density for each data point which is more robust than Gaussian kernel and cutoff method. Besides, FDPC yields poor results on the random distribution data sets since there may be several maxima for one cluster. To solve this problem, we proposed an improved minimax similarity method. Comparing our proposed approach with FDPC on some artificial and real-life data sets, the experimental results indicate that our proposed approach outperforms FDPC in terms of accuracy.
引用
收藏
页码:7769 / 7780
页数:11
相关论文
共 113 条
[31]  
Müller KR(2008)Robust path-based spectral clustering Pattern Recogn 2 251-1928
[32]  
Dhillon IS(2010)Theory of minimum spanning trees. I. Mean-field theory and strongly disordered spin-glass model Phys Rev E 1 4-3060
[33]  
Guan Y(2005)Fast PageRank computation via a sparse linear system Internet Math 30 1624-undefined
[34]  
Kulis B(2007)Clustering aggregation ACM Trans Knowl Discov Data (TKDD) 29 553-undefined
[35]  
Tu E(2018)Robust MST-based clustering algorithm Neural Comput 24 1917-undefined
[36]  
Cao L(2018)Fuzzy least squares twin support vector clustering Neural Comput Appl 28 3045-undefined
[37]  
Yang J(2014)Adaptive k-means clustering algorithm for MR breast image segmentation Neural Comput Appl undefined undefined-undefined
[38]  
Kasabov N(2018)Tree2Vector: learning a vectorial representation for tree-structured data IEEE Trans Neural Netw Learn Syst undefined undefined-undefined
[39]  
Chang D(undefined)undefined undefined undefined undefined-undefined
[40]  
Zhao Y(undefined)undefined undefined undefined undefined-undefined