A fast density peaks clustering algorithm with sparse search

被引:56
|
作者
Xu, Xiao [1 ]
Ding, Shifei [1 ,2 ]
Wang, Yanru [1 ]
Wang, Lijuan [1 ]
Jia, Weikuan [3 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minstry Educ Peoples Republ China, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
[3] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
关键词
DPC algorithm; Computational complexity; Sparse search strategy; Fewer distance calculations; Similarity matrix; FIND; SHAPES; NUMBER;
D O I
10.1016/j.ins.2020.11.050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a large unlabeled set of complex data, how to efficiently and effectively group them into clusters remains a challenging problem. Density peaks clustering (DPC) algorithm is an emerging algorithm, which identifies cluster centers based on a decision graph. Without setting the number of cluster centers, DPC can effectively recognize the clusters. However, the similarity between every two data points must be calculated to construct a decision graph, which results in high computational complexity. To overcome this issue, we propose a fast sparse search density peaks clustering (FSDPC) algorithm to enhance the DPC, which constructs a decision graph with fewer similarity calculations to identify cluster centers quickly. In FSDPC, we design a novel sparse search strategy to measure the similarity between the nearest neighbors of each data points. Therefore, FSDPC can enhance the efficiency of the DPC while maintaining satisfactory results. We also propose a novel random third-party data point method to search the nearest neighbors, which introduces no additional parameters or high computational complexity. The experimental results on synthetic datasets and real-world datasets indicate that the proposed algorithm consistently outperforms the DPC and other state-of-the-art algorithms. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:61 / 83
页数:23
相关论文
共 50 条
  • [21] An improved density peaks clustering based on sparrow search algorithm
    Chen, Yaru
    Zhou, Jie
    He, Xingshi
    Luo, Xinglong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11017 - 11037
  • [22] Fast density peaks clustering algorithm in polar coordinate system
    Chao Li
    Shifei Ding
    Xiao Xu
    Shuying Du
    Tianhao Shi
    Applied Intelligence, 2022, 52 : 14478 - 14490
  • [23] Fast density peaks clustering algorithm in polar coordinate system
    Li, Chao
    Ding, Shifei
    Xu, Xiao
    Du, Shuying
    Shi, Tianhao
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14478 - 14490
  • [24] Adaptive cutoff distance: Clustering by fast search and find of density peaks
    Mehmood, Rashid
    Bie, Rongfang
    Jiao, Libin
    Dawood, Hussain
    Sun, Yunchun
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (05) : 2619 - 2628
  • [25] Constraint-based clustering by fast search and find of density peaks
    Liu, Ruhui
    Huang, Weiping
    Fei, Zhengshun
    Wang, Kai
    Liang, Jun
    NEUROCOMPUTING, 2019, 330 : 223 - 237
  • [26] Clustering by fast search and find of density peaks via heat diffusion
    Mehmood, Rashid
    Zhang, Guangzhi
    Bie, Rongfang
    Dawood, Hassan
    Ahmad, Haseeb
    NEUROCOMPUTING, 2016, 208 : 210 - 217
  • [27] Intelligent fault diagnosis of rolling bearings based on clustering algorithm of fast search and find of density peaks
    Wu, Jun
    Lin, Manxi
    Lv, Yaqiong
    Cheng, Yiwei
    QUALITY ENGINEERING, 2023, 35 (03) : 399 - 412
  • [28] An improved density peaks clustering algorithm with fast finding cluster centers
    Xu, Xiao
    Ding, Shifei
    Shi, Zhongzhi
    KNOWLEDGE-BASED SYSTEMS, 2018, 158 : 65 - 74
  • [29] A Fast Density Peaks Clustering Algorithm Based on Pre-screening
    Xu, Xiao
    Ding, Shifei
    Sun, Tongfeng
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 513 - 516
  • [30] Fast and general density peaks clustering
    Sieranoja, Sami
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2019, 128 : 551 - 558