IBBA: an improved binary bat algorithm for solving low and high-dimensional feature selection problems

被引:0
|
作者
Wang, Tao [1 ,2 ]
Xie, Minzhu [1 ,2 ,3 ]
机构
[1] Hunan Normal Univ, Coll Math & Stat, Changsha 410081, Peoples R China
[2] Minist Educ, Key Lab Comp & Stochast Math, Changsha 410081, Peoples R China
[3] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Metaheuristics algorithms; Bat algorithm; Mutual information; Low and high-dimensional datasets; OPTIMIZATION ALGORITHM;
D O I
10.1007/s13042-025-02588-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Technological advancements have resulted in the accumulation of vast amounts of data across various industries, often containing redundant or irrelevant features. As a result, the development of efficient feature selection methods has become increasingly critical. This paper proposes an Improved Binary Bat Algorithm (IBBA) to overcome the limitations of the original Bat Algorithm (BA), particularly its weak exploration ability and tendency to become trapped in local optima. IBBA enhances both exploration and exploitation through a novel Fitness-based Exploitation Strategy (FES) and an improved Harris Hawks Optimization (HHO). Additionally, random perturbations are introduced during iterations to adjust positions that deviate from the search space, thus preventing ineffective searches. Since the original BA is primarily designed for continuous optimization problems, this study also investigates the effect of four V-shaped transfer functions on the algorithm's performance. Experimental results on 28 datasets with varying dimensionalities (ranging from nine to 12,600 features) demonstrate that IBBA outperforms 12 state-of-the-art metaheuristic algorithms in terms of fitness, accuracy, feature selection ratio, and runtime. Moreover, an analysis of exploration and exploitation shows that IBBA effectively balances these two processes, addressing BA's exploration shortcomings. The Wilcoxon signed-rank test, conducted at a significance level of 0.05, validates the algorithm's effectiveness, revealing that IBBA demonstrates significant advantages in 87.5% of the tests. Finally, comparisons with 14 recently proposed feature selection methods highlight IBBA's competitive classification accuracy. Therefore, this study is expected to make a valuable contribution to solving feature selection problems across datasets with diverse dimensionalities.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
    Akinola, Olatunji A.
    Agushaka, Jeffrey O.
    Ezugwu, Absalom E.
    PLOS ONE, 2022, 17 (10):
  • [2] Fractional-order binary bat algorithm for feature selection on high-dimensional microarray data
    Esfandiari A.
    Farivar F.
    Khaloozadeh H.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 7453 - 7467
  • [3] A new binary object-oriented programming optimization algorithm for solving high-dimensional feature selection problem
    Khalid, Asmaa M.
    Said, Wael
    Elmezain, Mahmoud
    Hosny, Khalid M.
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 85 : 72 - 85
  • [4] Bi-objective feature selection in high-dimensional datasets using improved binary chimp optimization algorithm
    Al-qudah, Nour Elhuda A.
    Abed-alguni, Bilal H.
    Barhoush, Malek
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 6107 - 6148
  • [5] Improved PSO for feature selection on high-dimensional datasets
    Tran, Binh (binh.tran@ecs.vuw.ac.nz), 1600, Springer Verlag (8886):
  • [6] Improved PSO for Feature Selection on High-Dimensional Datasets
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 503 - 515
  • [7] Preconditioning for feature selection and regression in high-dimensional problems'
    Paul, Debashis
    Bair, Eric
    Hastie, Trevor
    Tibshirani, Robert
    ANNALS OF STATISTICS, 2008, 36 (04): : 1595 - 1618
  • [8] A Novel Feature Selection Method for High-Dimensional Biomedical Data Based on an Improved Binary Clonal Flower Pollination Algorithm
    Yan, Chaokun
    Ma, Jingjing
    Luo, Huimin
    Zhang, Ge
    Luo, Junwei
    HUMAN HEREDITY, 2019, 84 (01) : 34 - 46
  • [9] High-Dimensional Feature Selection Based on Improved Binary Ant Colony Optimization Combined with Hybrid Rice Optimization Algorithm
    Ye, A. Zhiwei
    Li, B. Ruihan
    Zhou, C. Wen
    Wang, D. Mingwei
    Mei, E. Mengqing
    Shu, F. Zhe
    Shen, G. Jun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [10] ITCSO algorithm for solving high-dimensional optimization problems
    Zhang W.
    Wei W.-F.
    Huang W.-M.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (02): : 449 - 457