Optimized gravitational-based data clustering algorithm

被引:15
|
作者
Alswaitti, Mohammed [1 ]
Ishak, Mohamad Khairi [1 ]
Isa, Nor Ashidi Mat [1 ]
机构
[1] Univ Sains Malaysia, Sch Elect & Elect Engn, Engn Campus, Nibong Tebal 14300, Penang, Malaysia
关键词
Gravitational clustering; Centroid initialization; Nature-inspired algorithms; Exploitation and exploration balance; Clustering analysis; SYSTEMS;
D O I
10.1016/j.engappai.2018.05.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gravitational clustering is a nature-inspired and heuristic-based technique. The performance of nature-inspired algorithms relies on the balance achieved between exploitation and exploration. A modification over a data clustering algorithm based on the universal gravity rule is proposed in this paper. Although gravitational clustering algorithm has a high exploration ability, it lacks a proper exploitation mechanism because of the impulsive velocity of agents that search the solution space, which leads to the huge step size of agent positions through iterations. This study proposes the following solutions to impose a balance between exploitation and exploration: (i) the dependence of the agent on velocity history is removed to avoid high velocity caused by accumulating previous velocities, and (ii) an initialization step of centroid positions is added using the variance and median initialization method with a predefined number of clusters. The initialization step eliminates the effects of random initialization and subrogates the exploration process. Experiments are conducted using 13 benchmark datasets from the UCI machine learning repository. In addition, the proposed algorithm is tested on two case studies using the electrical hotspots and cervical cell datasets. The performance of the proposed clustering algorithm is compared qualitatively and quantitatively with several state-of-the-art clustering algorithms. The obtained results indicate that the proposed clustering algorithm outperforms conventional techniques. Furthermore, the clusters obtained using the proposed algorithm are more homogeneous than those obtained using conventional techniques. The proposed algorithm quantitatively achieves better results than the other techniques in 9 out of 15 datasets in terms of accuracy, F-score, and purity.
引用
收藏
页码:126 / 148
页数:23
相关论文
共 50 条
  • [21] Exemplar-Based Clustering Analysis Optimized by Genetic Algorithm
    Yang Zhen
    Wang Laitao
    Fan Kefeng
    Lai Yingxu
    CHINESE JOURNAL OF ELECTRONICS, 2013, 22 (04): : 735 - 740
  • [22] Optimized Density Peaks Clustering Algorithm Based on Dissimilarity Measure
    Ding S.-F.
    Xu X.
    Wang Y.-R.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3321 - 3333
  • [23] A K-means Optimized Clustering Algorithm Based on Improved Genetic Algorithm
    Pu, Qiu-Mei
    Wu, Qiong
    Li, Qian
    Lecture Notes in Electrical Engineering, 2022, 801 LNEE : 133 - 140
  • [24] Log analysis audit model based on optimized clustering algorithm
    Yu Hui
    Shi Xingjian
    2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 841 - 846
  • [25] HBO Based Clustering and Energy Optimized Routing Algorithm for WSN
    Selvi, M.
    Nandhini, C.
    Thangaramya, K.
    Kulothungan, K.
    Kannan, A.
    2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 89 - 92
  • [26] Wafer map preprocessing based on optimized DBSCAN clustering algorithm
    Chen S.-H.
    Yi M.-L.
    Zhang Y.-X.
    Shang Y.-L.
    Yang P.
    Yang, Ping (yangping1964@163.com), 1600, Northeast University (36): : 2713 - 2721
  • [27] Bayesian optimized indoor positioning algorithm based on dual clustering
    Chen, Min
    Pu, Qiaolin
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [28] Hybridization of the Gravitational Search Algorithm and Big Bang-Big Crunch Algorithm for Data Clustering
    Hatamlou, Abdolreza
    Hatamlou, Masoumeh
    FUNDAMENTA INFORMATICAE, 2013, 126 (04) : 319 - 333
  • [29] Remora optimization algorithm-based optimized node clustering technique for reliable data delivery in VANETs
    Konduru S.
    Sathya M.
    International Journal of Intelligent Networks, 2022, 3 : 74 - 79
  • [30] A DATA STREAMS CLUSTERING ALGORITHM BASED ON INTERVAL DATA
    Li, Yan
    Ye, Ming
    Wang, Huiwen
    Liu, Dan
    Che, Yin
    PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 2775 - 2778