A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Particle swarm optimization with a modified sigmoid function for gene selection from gene expression data
    Mohamad M.S.
    Omatu S.
    Deris S.
    Yoshioka M.
    Artificial Life and Robotics, 2010, 15 (01) : 21 - 24
  • [22] Intelligent Facial Expression Recognition Using Particle Swarm Optimization Based Feature Selection
    Robson, Adam
    Zhang, Li
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 305 - 311
  • [23] Swarm Intelligence Approach for Feature Selection Problem
    Tuba, Eva
    Alihodzic, Adis
    Tuba, Una
    Hrosik, Romana Capor
    Tuba, Milan
    2022 10TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2022,
  • [24] A comprehensive survey on computational learning methods for analysis of gene expression data
    Bhandari, Nikita
    Walambe, Rahee
    Kotecha, Ketan
    Khare, Satyajeet P.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [25] Feature Selection in Microarray Gene Expression Data Using Fisher Discriminant Ratio
    Sarbazi-Azad, Saeed
    Abadeh, Mohammad Saniee
    Abadi, Mehdi Irannejad Najaf
    2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2018, : 225 - 230
  • [26] Review on Feature Selection Methods for Gene Expression Data Classification
    Almutiri, Talal
    Saeed, Faisal
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 24 - 34
  • [27] Feature Selection of Gene Expression Data for Cancer Classification: A Review
    Singh, Rabindra Kumar
    Sivabalakrishnan, M.
    BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 52 - 57
  • [28] Particle swarm optimization algorithm based on comprehensive scoring framework for high-dimensional feature selection
    Wei, Bo
    Yang, Shanshan
    Zha, Wentao
    Deng, Li
    Huang, Jiangyi
    Su, Xiaohui
    Wang, Feng
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 95
  • [29] Feature Selection for Classification Using Particle Swarm Optimization
    Brezocnik, Lucija
    17TH IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES - IEEE EUROCON 2017 CONFERENCE PROCEEDINGS, 2017, : 966 - 971
  • [30] Correlation feature selection based improved-Binary Particle Swarm Optimization for gene selection and cancer classification
    Jain, Indu
    Jain, Vinod Kumar
    Jain, Renu
    APPLIED SOFT COMPUTING, 2018, 62 : 203 - 215