Optimized gravitational-based data clustering algorithm

被引:15
|
作者
Alswaitti, Mohammed [1 ]
Ishak, Mohamad Khairi [1 ]
Isa, Nor Ashidi Mat [1 ]
机构
[1] Univ Sains Malaysia, Sch Elect & Elect Engn, Engn Campus, Nibong Tebal 14300, Penang, Malaysia
关键词
Gravitational clustering; Centroid initialization; Nature-inspired algorithms; Exploitation and exploration balance; Clustering analysis; SYSTEMS;
D O I
10.1016/j.engappai.2018.05.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gravitational clustering is a nature-inspired and heuristic-based technique. The performance of nature-inspired algorithms relies on the balance achieved between exploitation and exploration. A modification over a data clustering algorithm based on the universal gravity rule is proposed in this paper. Although gravitational clustering algorithm has a high exploration ability, it lacks a proper exploitation mechanism because of the impulsive velocity of agents that search the solution space, which leads to the huge step size of agent positions through iterations. This study proposes the following solutions to impose a balance between exploitation and exploration: (i) the dependence of the agent on velocity history is removed to avoid high velocity caused by accumulating previous velocities, and (ii) an initialization step of centroid positions is added using the variance and median initialization method with a predefined number of clusters. The initialization step eliminates the effects of random initialization and subrogates the exploration process. Experiments are conducted using 13 benchmark datasets from the UCI machine learning repository. In addition, the proposed algorithm is tested on two case studies using the electrical hotspots and cervical cell datasets. The performance of the proposed clustering algorithm is compared qualitatively and quantitatively with several state-of-the-art clustering algorithms. The obtained results indicate that the proposed clustering algorithm outperforms conventional techniques. Furthermore, the clusters obtained using the proposed algorithm are more homogeneous than those obtained using conventional techniques. The proposed algorithm quantitatively achieves better results than the other techniques in 9 out of 15 datasets in terms of accuracy, F-score, and purity.
引用
收藏
页码:126 / 148
页数:23
相关论文
共 50 条
  • [41] A new algorithm based on metaheuristics for data clustering
    Tsutomu SHOHDOHJI
    Fumihiko YANO
    Yoshiaki TOYODA
    Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2010, 11 (12) : 921 - 926
  • [42] Data Clustering Based on Approach of Genetic Algorithm
    Wang, Hai-hui
    Zhao, Wen-jie
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2753 - 2757
  • [43] A hybrid data clustering algorithm based on improved krill herd algorithm and KHM clustering
    Wang Q.-P.
    Ding C.
    Wang X.-F.
    Kongzhi yu Juece/Control and Decision, 2020, 35 (10): : 2449 - 2458
  • [44] An optimized Weight-Based Clustering Algorithm in Wireless Sensor Networks
    Belabed, Fatma
    Bouallegue, Ridha
    2016 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2016, : 757 - 762
  • [45] Optimized Clustering Routing Algorithm Based on Deviation Maximization Method and the BFS
    Gou, Pingzhang
    Sun, Xianchao
    Zhang, Fen
    Tian, Ran
    2019 4TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2019), 2019, : 975 - 978
  • [46] An Optimized Image Retrieval Method based on Hierarchal Clustering and Genetic Algorithm
    Huang Min
    Sun Bo
    Xi Jianqing
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 747 - +
  • [47] Radar Cross Section Reduction Based on Gravitational Search Algorithm optimized Metasurface
    Song, Yi-Chuan
    Ding, Jun
    Guo, Chen-Jiang
    Ren, Yu-Hui
    Zhang, Jia-Kai
    9TH INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY PROCEEDINGS, VOL. 1, (ICMMT 2016), 2016, : 312 - 314
  • [48] Clustering Algorithm Based on Time Series Similarity to Web Data Clustering
    Yang Yan
    Yao Hua-Xiong
    Li Rong
    PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1373 - 1377
  • [49] Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection
    Guha, Ritam
    Ghosh, Manosij
    Chakrabarti, Akash
    Sarkar, Ram
    Mirjalili, Seyedali
    APPLIED SOFT COMPUTING, 2020, 93
  • [50] An Energy Consumption Optimized Clustering Algorithm for Radar Sensor Networks Based on an Ant Colony Algorithm
    Jiang, Ting
    Zang, Wei
    Zhao, Chenglin
    Shi, Jiong
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2010,