Automatic centroid initialization in k-means using artificial hummingbird algorithm

被引:0
作者
Kusum Preeti [1 ]
undefined Deep [2 ]
机构
[1] Department of Mathematics, Indian Institute of Technology Roorkee, Uttarakhand, Roorkee
[2] The University of Tennessee Health Science Centre, Memphis, 38163, TN
关键词
Clustering analysis; Data clustering; K-means; Nature inspired algorithm;
D O I
10.1007/s00521-024-10764-4
中图分类号
学科分类号
摘要
K-means is a widely used technique that heavily relies on the initial cluster centroid location. Poorly chosen centroids can cause the algorithm to get trapped in suboptimal solutions. Additionally, determining the optimal number of clusters for large datasets is computationally expensive. To address these challenges, a recently developed Artificial Hummingbird Algorithm (AHA) is used to initialize cluster centroid locations and automatically determine the best estimate for the number of clusters. AHA simulates the specialized flight skills and intelligent foraging strategies of hummingbirds, striking a fine balance between exploration and exploitation during the search process. Unlike other data clustering approaches that use a fixed threshold in heuristic methods, we propose a dynamic threshold based on the variance of the data with respect to its centroids for activating cluster centroids in AHA. The data are automatically partitioned into k cluster centroids such that cohesion, measured by cluster diameters, and separation, measured by nearest neighbor distance, are optimized. The algorithm is tested on various datasets, including real-world data, fundamental clustering benchmarks, synthetic data, and high-dimensional data. To evaluate performance, metrics such as fitness value, inter-cluster distance, and intra-cluster distance were used. Results indicate that the proposed method ranked first and achieved superior clustering performance compared to state-of-the-art algorithms. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:3373 / 3398
页数:25
相关论文
共 50 条
  • [1] Greedy centroid initialization for federated K-means
    Yang, Kun
    Amiri, Mohammad Mohammadi
    Kulkarni, Sanjeev R.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (06) : 3393 - 3425
  • [2] Density K-means : A New Algorithm for Centers Initialization for K-means
    Lan, Xv
    Li, Qian
    Zheng, Yi
    PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 958 - 961
  • [3] A Near-Optimal Centroids Initialization in K-means Algorithm Using Bees Algorithm
    Mahmuddin, M.
    Yusof, Y.
    COMPUTING & INFORMATICS, 2009, : 172 - 175
  • [4] DETERMINISTIC INITIALIZATION OF THE K-MEANS ALGORITHM USING HIERARCHICAL CLUSTERING
    Celebi, M. Emre
    Kingravi, Hassan A.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [5] Adaptive Initialization Method for K-Means Algorithm
    Yang, Jie
    Wang, Yu-Kai
    Yao, Xin
    Lin, Chin-Teng
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [6] AN INTELLIGENT INITIALIZATION METHOD FOR THE K-MEANS CLUSTERING ALGORITHM
    Sheu, Jyh-Jian
    Chen, Wei-Ming
    Tsai, Wen-Bin
    Chu, Ko-Tsung
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2551 - 2566
  • [7] HOW THE INITIALIZATION AFFECTS THE STABILITY OF THE k-MEANS ALGORITHM
    Bubeck, Sebastien
    Meila, Marina
    von Luxburg, Ulrike
    ESAIM-PROBABILITY AND STATISTICS, 2012, 16 : 436 - 452
  • [8] Initialization methods for remote sensing image clustering using K-means algorithm
    Zhong Y.-F.
    Zhang L.-P.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2010, 32 (09): : 2009 - 2014
  • [9] Initialization for K-means clustering using Voronoi diagram
    Reddy, Damodar
    Jana, Prasanta K.
    2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 395 - 400
  • [10] An Improved Heuristic K-Means Clustering Method Using Genetic Algorithm Based Initialization
    Mustafi, D.
    Sahoo, G.
    Mustafi, A.
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2017, 509 : 123 - 132