Flexible density peak clustering for real-world data

被引:2
|
作者
Hou, Jian [1 ]
Lin, Houshen [1 ]
Yuan, Huaqiang [1 ]
Pelillo, Marcello [2 ,3 ]
机构
[1] Dongguan Univ Technol, Sch Comp Sci & Technol, Dongguan 523808, Peoples R China
[2] Ca Foscari Univ, DAIS, I-30172 Venice, Italy
[3] Ca Foscari Univ, European Ctr Living Technol, I-30123 Venice, Italy
基金
中国国家自然科学基金;
关键词
Clustering; Density peak; Real-world data; Number of clusters; FAST SEARCH; K-MEANS; FIND;
D O I
10.1016/j.patcog.2024.110772
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In density based clustering, the density peak algorithm has attracted much attention due to its effectiveness and simplicity, and a vast amount of clustering approaches have been proposed based on this algorithm. Some of these works require manual selection of cluster centers with a decision graph, where human involvement leads to uncertainty in clustering results. In order to avoid human involvement, some other algorithms depend on user-specified number of clusters to determine cluster centers automatically. However, it is well known that accurate estimation of number of clusters is a long-standing difficulty in data clustering. In this paper we present a sequential density peak clustering algorithm to extract clusters one by one, thereby determining the number of clusters automatically and avoiding manual selection of cluster centers in the meanwhile. Starting from a density peak, our algorithm generates an initial cluster surrounding the density peak in the first step, and then obtains the final cluster by expanding the initial cluster based on the relative density relationship among neighboring data points. With a peeling-off strategy, we obtain all the clusters sequentially. Our algorithm works well with clusters of Gaussian distribution and is therefore potential for clustering of real-world data. Experiments with a large number of synthetic and real datasets and comparisons with existing algorithms demonstrate the effectiveness of the proposed algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Streamline Density Peak Clustering for Practical Adoptions
    Yang, Shuai
    Shen, Xipeng
    Chi, Min
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 49 - 58
  • [42] An Enhanced Density Peak Based Clustering Algorithm
    Hou, Jian
    Liu, Weixue
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 411 - 416
  • [43] Real-World Evidence: Integrating Machine Learning with Real-World Big Data for Predictive Analytics in Healthcare
    Vecchio, Nicolas
    CARDIOLOGY, 2024,
  • [44] A Novel Density Peak Clustering Algorithm based on Squared Residual Error
    Parmar, Milan
    Wang, Di
    Tan, Ah-Hwee
    Miao, Chunyan
    Jiang, Jianhua
    Zhou, You
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 43 - 48
  • [45] Quantum density peak clustering
    Duarte Magano
    Lorenzo Buffoni
    Yasser Omar
    Quantum Machine Intelligence, 2023, 5
  • [46] Quantum density peak clustering
    Magano, Duarte
    Buffoni, Lorenzo
    Omar, Yasser
    QUANTUM MACHINE INTELLIGENCE, 2023, 5 (01)
  • [47] Fast density peak clustering for large scale data based on kNN
    Chen, Yewang
    Hu, Xiaoliang
    Fan, Wentao
    Shen, Lianlian
    Zhang, Zheng
    Liu, Xin
    Du, Jixiang
    Li, Haibo
    Chen, Yi
    Li, Hailin
    KNOWLEDGE-BASED SYSTEMS, 2020, 187
  • [48] Multivariate functional data clustering using adaptive density peak detection
    Ren, Rui
    Fang, Kuangnan
    Zhang, Qingzhao
    Wang, Xiaofeng
    STATISTICS IN MEDICINE, 2023, 42 (10) : 1565 - 1582
  • [49] Density peak clustering using global and local consistency adjustable manifold distance
    Tao, Xinmin
    Guo, Wenjie
    Ren, Chao
    Li, Qing
    He, Qing
    Liu, Rui
    Zou, Junrong
    INFORMATION SCIENCES, 2021, 577 : 769 - 804
  • [50] Enhancing Cluster Center Identification in Density Peak Clustering
    Hou, Jian
    Zhang, Aihua
    Lv, Chengcong
    Xu, E.
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 268 - 275