Density peaks clustering based on geodetic distance and dynamic neighbourhood

被引:7
作者
Lv, Li [1 ]
Wang, Jiayuan [1 ]
Wu, Runxiu [1 ]
Wang, Hui [1 ]
Lee, Ivan [2 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Jiangxi, Peoples R China
[2] Univ South Australia, UniSA S, Adelaide, SA 5000, Australia
基金
中国国家自然科学基金;
关键词
density peaks; clustering; geodetic distance; dynamic neighbourhood; ALGORITHM;
D O I
10.1504/IJBIC.2021.113363
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering algorithm uses Euclidean distance as a measure of similarity between the samples and it can achieve a good clustering effect when processing the manifold datasets. Utilising this feature, we propose a density peaks clustering algorithm based on geodetic distance and dynamic neighbourhood. This new algorithm measures the similarity between the samples by using geodetic distance, and the number of neighbours K is dynamically adjusted according to the spatial distribution of samples for geodetic distance computation. By choosing geodetic distance as the similarity measure, the problems of manifold dataset clustering can be easily solved, and the clustering is made more effective when the sparse clusters and dense clusters co-exist. The new algorithm was then compared against the other five clustering algorithms on six synthetic datasets and ten real-world datasets. The experiments showed that the proposed algorithm not only outperformed the other conventional algorithms on manifold datasets, but also achieved a very good clustering effect on multi-scale, cluttered and intertwined datasets.
引用
收藏
页码:24 / 33
页数:10
相关论文
共 22 条
[1]   A hybrid recommendation system with many-objective evolutionary algorithm [J].
Cai, Xingjuan ;
Hu, Zhaoming ;
Zhao, Peng ;
Zhang, WenSheng ;
Chen, Jinjun .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
[2]   Hybrid many-objective particle swarm optimization algorithm for green coal production problem [J].
Cui, Zhihua ;
Zhang, Jiangjiang ;
Wu, Di ;
Cai, Xingjuan ;
Wang, Hui ;
Zhang, Wensheng ;
Chen, Jinjun .
INFORMATION SCIENCES, 2020, 518 :256-271
[3]   Detection of Malicious Code Variants Based on Deep Learning [J].
Cui, Zhihua ;
Xue, Fei ;
Cai, Xingjuan ;
Cao, Yang ;
Wang, Gai-ge ;
Chen, Jinjun .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) :3187-3196
[4]   A robust density peaks clustering algorithm using fuzzy neighborhood [J].
Du, Mingjing ;
Ding, Shifei ;
Xue, Yu .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (07) :1131-1140
[5]   Density peaks clustering using geodesic distances [J].
Du, Mingjing ;
Ding, Shifei ;
Xu, Xiao ;
Xue, Yu .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (08) :1335-1349
[6]  
Ester M., 1996, PROC 2 INT C KNOWLED, P226, DOI DOI 10.5555/3001460.3001507
[7]   A METHOD FOR COMPARING 2 HIERARCHICAL CLUSTERINGS [J].
FOWLKES, EB ;
MALLOWS, CL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (383) :553-569
[8]   Fast agglomerative clustering using a k-nearest neighbor graph [J].
Franti, Pasi ;
Virmajoki, Olli ;
Hautamaki, Ville .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (11) :1875-1881
[9]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[10]  
Lichman M., 2013, UCI machine learning repository