Reverse-Nearest-Neighbor-Based Clustering by Fast Search and Find of Density Peaks

被引:1
|
作者
Zhang, Chunhao [1 ]
Xie, Bin [1 ,2 ,3 ]
Zhang, Yiran [1 ]
机构
[1] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Peoples R China
[2] Hebei Normal Univ, Hebei Prov Engn Res Ctr Supply Chain Big Data Ana, Shijiazhuang 050024, Peoples R China
[3] Hebei Normal Univ, Hebei Prov Key Lab Network & Informat Secur, Shijiazhuang 050024, Peoples R China
基金
中国国家自然科学基金;
关键词
Density peaks; Reverse nearest neighbor; Clustering; Cluster fusion; ALGORITHM;
D O I
10.23919/cje.2022.00.165
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Clustering by fast search and find of density peaks (CFSFDP) has the advantages of a novel idea, easy implementation, and efficient clustering. It has been widely recognized in various fields since it was proposed in Science in 2014. The CFSFDP algorithm also has certain limitations, such as non-unified sample density metrics defined by cutoff distance, the domino effect for the assignment of remaining samples triggered by unstable assignment strategy, and the phenomenon of picking wrong density peaks as cluster centers. We propose reverse-nearest-neighbor-based clustering by fast search and find of density peaks (RNN-CFSFDP) to avoid these shortcomings. We redesign and unify the sample density metric by introducing reverse nearest neighbor. The newly defined local density metric and the K-nearest neighbors of each sample are combined to make the assignment process more robust and alleviate the domino effect. A cluster fusion algorithm is proposed, which further alleviates the domino effect and effectively avoids the phenomenon of picking wrong density peaks as cluster centers. Experimental results on publicly available synthetic data sets and real-world data sets show that in most cases, the proposed algorithm is superior to or at least equivalent to the comparative methods in clustering performance. The proposed algorithm works better on manifold data sets and uneven density data sets.
引用
收藏
页码:1341 / 1354
页数:14
相关论文
共 50 条
  • [21] Partial Discharge Pulse Segmentation Based on Clustering by Fast Search and Find of Density Peaks
    Zhu Y.
    Jiang W.
    Liu G.
    Zhu, Yongli (yonglipw@163.com), 1600, China Machine Press (35): : 1377 - 1386
  • [22] Clustering by fast search and find of density peaks via heat diffusion
    Mehmood, Rashid
    Zhang, Guangzhi
    Bie, Rongfang
    Dawood, Hassan
    Ahmad, Haseeb
    NEUROCOMPUTING, 2016, 208 : 210 - 217
  • [23] Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
    Yu, Donghua
    Liu, Guojun
    Guo, Maozu
    Liu, Xiaoyan
    Yao, Shuang
    IEEE ACCESS, 2019, 7 : 34301 - 34317
  • [24] Fast Instance Search Based on Approximate Bichromatic Reverse Nearest Neighbor Search
    Iwamura, Masakazu
    Matozaki, Nobuaki
    Kise, Koichi
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1121 - 1124
  • [25] Paralleled fast search and find of density peaks clustering algorithm on GPUs with CUDA
    Li M.
    Huang J.
    Wang J.
    International Journal of Networked and Distributed Computing, 2016, 4 (3) : 173 - 181
  • [26] Paralleled Fast Search and Find of Density Peaks Clustering Algorithm on GPUs with CUDA
    Li, Mi
    Huang, Jie
    Wang, Jingpeng
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 313 - 318
  • [27] A fuzzy mixed data clustering algorithm by fast search and find of density peaks
    Li, Ye
    Chen, Yiyan
    Li, Qun
    INTELLIGENT DATA ANALYSIS, 2019, 23 : S199 - S224
  • [28] Crime Data Analysis Using Clustering by Fast Search and find of Density Peaks
    Alghamdi, Ahmed
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (11): : 174 - 178
  • [29] Intelligent fault diagnosis of rolling bearings based on clustering algorithm of fast search and find of density peaks
    Wu, Jun
    Lin, Manxi
    Lv, Yaqiong
    Cheng, Yiwei
    QUALITY ENGINEERING, 2023, 35 (03) : 399 - 412
  • [30] Cleaning of Transient Fault Data in Distribution Network Based on Clustering by Fast Search and Find of Density Peaks
    Duan, Xiaoli
    Liu, Sanwei
    Huang, Fuyong
    Zhang, Daoyuan
    Zhao, Yan
    Duan, Jianjia
    Zeng, Zeyu
    Yu, Ting
    Zhong, Lipeng
    Dai, Bin
    ENGINEERING LETTERS, 2023, 31 (04) : 1348 - 1358