Stable Feature Selection using Improved Whale Optimization Algorithm for Microarray Datasets

被引:0
作者
Theng, Dipti [1 ]
Bhoyar, Kishor K. [2 ]
机构
[1] YCCE, Comp Technol Dept, Nagpur, Maharashtra, India
[2] YCCE, Comp Sci & Engn Dept, Nagpur, Maharashtra, India
来源
ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL | 2023年 / 12卷 / 01期
关键词
feature selection; stability of feature selection; whale optimization algorithm; marine predator algorithm; grey wolf optimization; microarray datasets; high dimensional datasets;
D O I
10.14201/adcaij.31187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A microarray is a collection of DNA sequences that reflect an organism's whole gene set and are organized in a grid pattern for use in genetic testing. Microarray datasets are extremely high-dimensional and have a very small sample size, posing the challenges of insufficient data and high computational complexity. Identification of true biomarkers that are the most significant features (a very small subset of the complete feature set) is desired to solve these issues. This reduces over-fitting, and time complexity, and improves model generalization. Various feature selection algorithms are used for this biomarker identification. This research proposed a modification to the whale optimization algorithm (WOAm) for biomarker discovery, in which the fitness of each search agent is evaluated using the hinge loss function during the hunting for prey phase to determine the optimal search agent. Also compared the results of the proposed modified algorithm with the original whale optimization algorithm and also with contemporary algorithms like the marine predator algorithm and grey wolf optimization. All these algorithms are evaluated on six different high-dimensional microarray datasets. It has been observed that the proposed modification for the whale optimization algorithm has significantly improved the results of feature selection across all the datasets. Domain experts trust the resultant biomarker/ associated genes by the stability of the results obtained. The chosen feature set's stability was also evaluated during the research work. According to the findings, our proposed WOAm has superior stability compared to other algorithms for the CNS, colon, Leukemia, and OSCC. datasets.
引用
收藏
页数:16
相关论文
共 50 条
[21]   A Spark-based Distributed Whale Optimization Algorithm for Feature Selection [J].
Chen, Hongwei ;
Hu, Zhou ;
Han, Lin ;
Hou, Qiao ;
Ye, Zhiwei ;
Yuan, Jiansen ;
Zeng, Jun .
PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, :70-74
[22]   Multi-objective feature selection algorithm using Beluga Whale Optimization [J].
Esfahani, Kiana Kouhpah ;
Zade, Behnam Mohammad Hasani ;
Mansouri, Najme .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2025, 257
[23]   Hybrid feature selection based on SLI and genetic algorithm for microarray datasets [J].
Sedighe Abasabadi ;
Hossein Nematzadeh ;
Homayun Motameni ;
Ebrahim Akbari .
The Journal of Supercomputing, 2022, 78 :19725-19753
[24]   Hybrid feature selection based on SLI and genetic algorithm for microarray datasets [J].
Abasabadi, Sedighe ;
Nematzadeh, Hossein ;
Motameni, Homayun ;
Akbari, Ebrahim .
JOURNAL OF SUPERCOMPUTING, 2022, 78 (18) :19725-19753
[25]   A Novel Hybrid Algorithm for Feature Selection Based on Whale Optimization Algorithm [J].
Zheng, Yuefeng ;
Li, Ying ;
Wang, Gang ;
Chen, Yupeng ;
Xu, Qian ;
Fan, Jiahao ;
Cui, Xueting .
IEEE ACCESS, 2019, 7 :14908-14923
[26]   A Feature Selection Method for Software Defect Prediction Based on Improved Beluga Whale Optimization Algorithm [J].
Qiu, Shaoming ;
He, Jingjie ;
Wang, Yan ;
E, Bicong .
CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 83 (03) :4879-4898
[27]   Multiobjective whale optimization algorithm-based feature selection for intelligent systems [J].
Riyahi, Milad ;
Rafsanjani, Marjan K. ;
Gupta, Brij B. ;
Alhalabi, Wadee .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) :9037-9054
[28]   A binary Sine Cosine-Modified Whale Optimization Algorithm for Feature Selection [J].
Eid, Marwa M. ;
El-kenawy, El-Sayed M. ;
Ibrahim, Abdelhameed .
2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, :1133-+
[29]   Whale optimization algorithm-based email spam feature selection method using rotation forest algorithm for classification [J].
Shuaib, Maryam ;
Abdulhamid, Shafi'i Muhammad ;
Adebayo, Olawale Surajudeen ;
Osho, Oluwafemi ;
Idris, Ismaila ;
Alhassan, John K. ;
Rana, Nadim .
SN APPLIED SCIENCES, 2019, 1 (05)
[30]   Whale optimization algorithm-based email spam feature selection method using rotation forest algorithm for classification [J].
Maryam Shuaib ;
Shafi’i Muhammad Abdulhamid ;
Olawale Surajudeen Adebayo ;
Oluwafemi Osho ;
Ismaila Idris ;
John K. Alhassan ;
Nadim Rana .
SN Applied Sciences, 2019, 1