DenMune: Density peak based clustering using mutual nearest neighbors

被引:45
作者
Abbas, Mohamed [1 ]
El-Zoghabi, Adel [1 ]
Shoukry, Amin [2 ,3 ]
机构
[1] Inst Grad Studies & Res, Informat Technol, Alexandria, Egypt
[2] Egypt Japan Univ Sci & Technol, Comp Sci & Engn, Alexandria, Egypt
[3] Fac Engn, Comp & Syst Engn Dept, Alexandria, Egypt
关键词
Clustering; Mutual neighbors; Dimensionality reduction; Arbitrary shapes; Pattern recognition; Nearest neighbors; Density peak; ALGORITHM;
D O I
10.1016/j.patcog.2020.107589
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many clustering algorithms fail when clusters are of arbitrary shapes, of varying densities, or the data classes are unbalanced and close to each other, even in two dimensions. A novel clustering algorithm "DenMune" is presented to meet this challenge. It is based on identifying dense regions using mutual nearest neighborhoods of size K , where K is the only parameter required from the user, besides obeying the mutual nearest neighbor consistency principle. The algorithm is stable for a wide range of values of K . Moreover, it is able to automatically detect and remove noise from the clustering process as well as detecting the target clusters. It produces robust results on various low and high dimensional datasets relative to several known state of the art clustering algorithms. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:18
相关论文
共 27 条
  • [1] Abbas M. A., 2012, 2012 11th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA), P1192, DOI 10.1109/ISSPA.2012.6310472
  • [2] Abbas M. A., 2012, INT J DATA ENG, V3, P28
  • [3] To cluster, or not to cluster: An analysis of clusterability methods
    Adolfsson, Andreas
    Ackerman, Margareta
    Brownstein, Naomi C.
    [J]. PATTERN RECOGNITION, 2019, 88 : 13 - 26
  • [4] [Anonymous], 2018, PATTERN RECOGN, DOI DOI 10.1016/j.patcog.2018.05.014
  • [5] Arthur D., 2007, P 18 ANN ACM SIAM S, P877
  • [6] Combining unsupervised and supervised learning in credit card fraud detection
    Carcillo, Fabrizio
    Le Borgne, Yann-Ael
    Caelen, Olivier
    Kessaci, Yacine
    Oble, Frederic
    Bontempi, Gianluca
    [J]. INFORMATION SCIENCES, 2021, 557 : 317 - 331
  • [7] Fast and Accurate Hierarchical Clustering Based on Growing Multilayer Topology Training
    Cheung, Yiu-ming
    Zhang, Yiqun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) : 876 - 890
  • [8] Fast and effective cluster-based information retrieval using frequent closed itemsets
    Djenouri, Youcef
    Belhadi, Asma
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    [J]. INFORMATION SCIENCES, 2018, 453 : 154 - 167
  • [9] Ertöz L, 2003, SIAM PROC S, P47
  • [10] How much can k-means be improved by using better initialization and repeats?
    Franti, Pasi
    Sieranoja, Sami
    [J]. PATTERN RECOGNITION, 2019, 93 : 95 - 112