A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Comprehensive Survey of Recent Hybrid Feature Selection Methods in Cancer Microarray Gene Expression Data
    Almazrua, Halah
    Alshamlan, Hala
    IEEE ACCESS, 2022, 10 : 71427 - 71449
  • [2] A Particle Swarm Optimization based Feature Selection Approach to Transfer Learning in Classification
    Nguyen, Bach Hoai
    Xue, Bing
    Andreae, Peter
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 37 - 44
  • [3] An efficient statistical feature selection approach for classification of gene expression data
    Chandra, B.
    Gupta, Manish
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) : 529 - 535
  • [4] Microarray Gene Expression Dataset Feature Selection and Classification with Swarm Optimization to Diagnosis Diseases
    Krishna, Peddarapu Rama
    Rajarajeswari, Pothuraju
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 536 - 546
  • [5] Feature Selection for Alzheimer's Gene Expression Data Using Modified Binary Particle Swarm Optimization
    Ramaswamy, Ramya
    Kandhasamy, Premalatha
    Palaniswamy, Swathypriyadharsini
    IETE JOURNAL OF RESEARCH, 2023, 69 (01) : 9 - 20
  • [6] Multiobjective Binary Biogeography Based Optimization for Feature Selection Using Gene Expression Data
    Li, Xiangtao
    Yin, Minghao
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2013, 12 (04) : 343 - 353
  • [7] Unsupervised Feature Selection for Microarray Gene Expression Data Based on Discriminative Structure Learning
    Ye, Xiucai
    Sakurai, Tetsuya
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2018, 24 (06) : 725 - 741
  • [8] A particle swarm optimization-based feature selection for unsupervised transfer learning
    Sanodiya, Rakesh Kumar
    Tiwari, Mrinalini
    Mathew, Jimson
    Saha, Sriparna
    Saha, Subhajyoti
    SOFT COMPUTING, 2020, 24 (24) : 18713 - 18731
  • [9] A particle swarm optimization-based feature selection for unsupervised transfer learning
    Rakesh Kumar Sanodiya
    Mrinalini Tiwari
    Jimson Mathew
    Sriparna Saha
    Subhajyoti Saha
    Soft Computing, 2020, 24 : 18713 - 18731
  • [10] An innovative approach for feature selection based on chicken swarm optimization
    Hafez, Ahmed Ibrahem
    Zawbaa, Hossam M.
    Emary, E.
    Mahmoud, Hamdi A.
    Hassanien, Aboul Ella
    PROCEEDINGS OF THE 2015 SEVENTH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2015), 2015, : 19 - 24