A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Joint classifier and feature optimization for comprehensive cancer diagnosis using gene expression data
    Krishnapuram, B
    Carin, L
    Hartemink, AJ
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (2-3) : 227 - 242
  • [32] Null space based feature selection method for gene expression data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    Sharma, Vandana
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2012, 3 (04) : 269 - 276
  • [33] Stable Feature Selection for Gene Expression using Enhanced Binary Particle Swarm Optimization
    Dhrif, Hassen
    Wuchty, Stefan
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 437 - 444
  • [34] QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection
    Sadeg, Souhila
    Hamdad, Leila
    Remache, Amine Riad
    Karech, Mehdi Nedjmeddine
    Benatchba, Karima
    Habbas, Zineb
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 785 - 796
  • [35] An intelligent feature selection approach based on moth flame optimization for medical diagnosis
    Abu Khurmaa, Ruba
    Aljarah, Ibrahim
    Sharieh, Ahmad
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (12) : 7165 - 7204
  • [36] Feature (gene) selection in gene expression-based tumor classification
    Xiong, MM
    Li, WJ
    Zhao, JY
    Jin, L
    Boerwinkle, E
    MOLECULAR GENETICS AND METABOLISM, 2001, 73 (03) : 239 - 247
  • [37] A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification
    Almugren, Nada
    Alshamlan, Hala
    IEEE ACCESS, 2019, 7 : 78533 - 78548
  • [38] Discrete Biogeography Based Optimization for Feature Selection in Molecular Signatures
    Liu, Bo
    Tian, Meihong
    Zhang, Chunhua
    Li, Xiangtao
    MOLECULAR INFORMATICS, 2015, 34 (04) : 197 - 215
  • [39] Feature selection and gene clustering from gene expression data
    Mitra, P
    Majumder, DD
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 343 - 346
  • [40] Critical Gene Selection by a Modified Particle Swarm Optimization Approach
    Jana, Biswajit
    Acharyaa, Sriyankar
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 165 - 175