CA-CSM: a novel clustering algorithm based on cluster center selection model

被引:2
作者
Zhang, Ruilin [1 ]
Song, Xinghao [1 ]
Ying, Surong [1 ]
Ren, Huilin [1 ]
Zhang, Boyu [1 ]
Wang, Hongpeng [1 ]
机构
[1] Harbin Inst Technol, Shenzhen, Peoples R China
关键词
Clustering; Cluster center selection; Parameter-free local density; Boundary degree; Core object; DENSITY PEAKS; FAST SEARCH; FIND; SKEWNESS; NUMBER;
D O I
10.1007/s00500-021-05835-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering by fast search and find of density peaks (DPC) is a well-known algorithm due to the simple structure and high extensibility. It requires neither iteration nor additional parameters. However, DPC and most of its improvements still encounter some challenges such as parameter dependence, unreasonable metric and cluster center determination difficulty. Aiming at these issues, we propose a novel clustering algorithm based on cluster center selection model (CA-CSM). Firstly, we calculate the density parameter automatically according to the local information of the object to reduce the parameter dependence. Subsequently, we propose the concept of boundary degree to discriminate core objects from non-core objects. With the local density metric, we establish a model (CSM) with high expansibility to automatically detect the cluster centers from core objects. We test CA-CSM on 21 datasets using five benchmarks and compare it to 7 state-of-the-art algorithms. Extensive experiments and analysis show that our algorithm is feasible and effective.
引用
收藏
页码:8015 / 8033
页数:19
相关论文
共 43 条
  • [1] Performance evaluation of density-based clustering methods
    Aliguliyev, Ramiz M.
    [J]. INFORMATION SCIENCES, 2009, 179 (20) : 3583 - 3602
  • [2] Border-Peeling Clustering
    Averbuch-Elor, Hadar
    Bar, Nadav
    Cohen-Or, Daniel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (07) : 1791 - 1797
  • [3] FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM
    BEZDEK, JC
    EHRLICH, R
    FULL, W
    [J]. COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) : 191 - 203
  • [4] Multidimensional Balance-Based Cluster Boundary Detection for High-Dimensional Data
    Cao, Xiaofeng
    Qiu, Baozhi
    Li, Xiangli
    Shi, Zenglin
    Xu, Guandong
    Xu, Jianliang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) : 1867 - 1880
  • [5] Chen HP, 2017, NEUROCOMPUTING, V236, P104, DOI 10.1016/j.neucom.2016.09.103
  • [6] A fast density-based data stream clustering algorithm with cluster centers self-determined for mixed data
    Chen, Jin-Yin
    He, Hui-Hao
    [J]. INFORMATION SCIENCES, 2016, 345 : 271 - 293
  • [7] BLOCK-DBSCAN: Fast clustering for large scale data
    Chen, Yewang
    Zhou, Lida
    Bouguila, Nizar
    Wang, Cheng
    Chen, Yi
    Du, Jixiang
    [J]. PATTERN RECOGNITION, 2021, 109
  • [8] A novel clustering algorithm based on the natural reverse nearest neighbor structure
    Dai, Qi-Zhu
    Xiong, Zhong-Yang
    Xie, Jiang
    Wang, Xiao-Xia
    Zhang, Yu-Fang
    Shang, Jia-Xing
    [J]. INFORMATION SYSTEMS, 2019, 84 : 1 - 16
  • [9] An entropy-based density peaks clustering algorithm for mixed type data employing fuzzy neighborhood
    Ding, Shifei
    Du, Mingjing
    Sun, Tongfeng
    Xu, Xiao
    Xue, Yu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 294 - 313
  • [10] Residual Excitation Skewness for Automatic Speech Polarity Detection
    Drugman, Thomas
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (04) : 387 - 390