Clustering based on local density peaks and graph cut

被引:25
|
作者
Long, Zhiguo [1 ]
Gao, Yang [1 ]
Meng, Hua [2 ]
Yao, Yuqin [2 ]
Li, Tianrui [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Sch Math, Chengdu 611756, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Density peaks; Spectral clustering; Local density; Similarity between trees;
D O I
10.1016/j.ins.2022.03.091
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering by fast search and find of density peaks (DPC) is a widely used and studied clustering algorithm. In this article, we notice that DPC can achieve highly accurate clustering results when restricted to local neighborhoods. Therefore, by investigating density information in local neighborhoods, we propose to capture latent structures in data with family trees, which can reflect density dominations among nearest neighbors of data. A data set will then be partitioned into multiple family trees. In order to obtain the final clustering result, instead of exploiting the error-prone allocation strategy of DPC, we first elaborately design a novel similarity measure for family trees, characterizing not only the distance between data points, but also the structure of trees. Then, we adapt graph cut for the corresponding connection graph to also take global structural information into account. Extensive experiments on both real-world and synthetic data sets show that the proposed algorithm can outperform several prominent clustering algorithms for most of the cases, including the DPC and spectral clustering algorithms and some of their latest variants. We also analyze the robustness of the proposed algorithm w.r.t. hyper-parameters and its time complexity, as well as the necessity of its components through ablation study. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:263 / 286
页数:24
相关论文
共 50 条
  • [31] Density peaks clustering based on density backbone and fuzzy neighborhood
    Lotfi, Abdulrahman
    Moradi, Parham
    Beigy, Hamid
    PATTERN RECOGNITION, 2020, 107 (107)
  • [32] A three-way clustering method based on improved density peaks algorithm and boundary detection graph
    Sun, Chen
    Du, Mingjing
    Sun, Jiarui
    Li, Kangkang
    Dong, Yongquan
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 153 : 239 - 257
  • [33] Connection density based clustering: A graph-based density clustering method
    Xu, Feng
    Cai, Mingjie
    Li, Qingguo
    Zhou, Jie
    Fujita, Hamido
    APPLIED SOFT COMPUTING, 2024, 161
  • [34] An Adaptive Clustering Algorithm Based on Local-Density Peaks for Imbalanced Data Without Parameters
    Tong, Wuning
    Wang, Yuping
    Liu, Delong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3419 - 3432
  • [35] Density peaks clustering based on mutual neighbor degree
    Zhao J.
    Yao Z.-F.
    Lyu L.
    Fan T.-H.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (03): : 543 - 552
  • [36] Coflow scheduling algorithm based density peaks clustering
    Li, Chenghao
    Zhang, Huyin
    Zhou, Tianying
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 805 - 813
  • [37] Cosine kernel based density peaks clustering algorithm
    Wang, Jiayuan
    Lv, Li
    Wu, Runxiu
    Fan, Tanghuai
    Lee, Ivan
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2020, 12 (01) : 1 - 20
  • [38] Fast Clustering by Affinity Propagation Based on Density Peaks
    Li, Yang
    Guo, Chonghui
    Sun, Leilei
    IEEE ACCESS, 2020, 8 : 138884 - 138897
  • [39] Improved density peaks clustering based on firefly algorithm
    Zhao J.
    Tang J.
    Shi A.
    Fan T.
    Xu L.
    Xu, Lizhong (lxu0530@126.com), 1600, Inderscience Enterprises Ltd. (15): : 24 - 42
  • [40] A text clustering algorithm based on find of density peaks
    Liu, Peiyu
    Liu, Yingying
    Hou, Xiuyan
    Li, Qingqing
    Zhu, Zhenfang
    2015 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2015, : 348 - 352