AltWOA: Altruistic Whale Optimization Algorithm for feature selection on microarray datasets

被引:63
作者
Kundu, Rohit [1 ]
Chattopadhyay, Soham [1 ]
Cuevas, Erik [2 ]
Sarkar, Ram [3 ]
机构
[1] Jadavpur Univ, Dept Elect Engn, Kolkata 700032, India
[2] Univ Guadalajara, Dept Elect, CUCEI, Av Revolut 1500, Guadalajara, Jal, Mexico
[3] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, India
关键词
Feature selection; Evolutionary meta-heuristic; Altruism; Cancer detection; Gene expression; Microarray data; GRAVITATIONAL SEARCH ALGORITHM; PARAMETER-ESTIMATION; MOLECULAR CLASSIFICATION; GENETIC ALGORITHM; PREDICTION; CANCER; PSO; IDENTIFICATION; DIAGNOSIS; BIOMARKER;
D O I
10.1016/j.compbiomed.2022.105349
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The data-driven modern era has enabled the collection of large amounts of biomedical and clinical data. DNA microarray gene expression datasets have mainly gained significant attention to the research community owing to their ability to identify diseases through the "bio-markers" or specific alterations in the gene sequence that represent that particular disease (for example, different types of cancer). However, gene expression datasets are very high-dimensional, while only a few of those are "bio-markers". Meta-heuristic-based feature selection effectively filters out only the relevant genes from a large set of attributes efficiently to reduce data storage and computation requirements. To this end, in this paper, we propose an Altruistic Whale Optimization Algorithm (AltWOA) for the feature selection problem in high-dimensional microarray data. AltWOA is an improvement on the basic Whale Optimization Algorithm. We embed the concept of altruism in the whale population to help efficient propagation of candidate solutions that can reach the global optima over the iterations. Evaluation of the proposed method on eight high dimensional microarray datasets reveals the superiority of AltWOA compared to popular and classical techniques in the literature on the same datasets both in terms of accuracy and the final number of features selected. The relevant codes for the proposed approach are available publicly at https://gith ub.com/Rohit-Kundu/AltWOA.
引用
收藏
页数:16
相关论文
共 114 条
[1]   Solar photovoltaic parameter estimation using an improved equilibrium optimizer [J].
Abdel-Basset, Mohamed ;
Mohamed, Reda ;
Mirjalili, Seyedali ;
Chakrabortty, Ripon K. ;
Ryan, Michael J. .
SOLAR ENERGY, 2020, 209 :694-708
[2]   RETRACTED: A hybrid whale optimization algorithm based on local search strategy for the permutation flow shop scheduling problem (Retracted article. See vol. 128, pg. 567, 2022) [J].
Abdel-Basset, Mohamed ;
Manogaran, Gunasekaran ;
El-Shahat, Doaa ;
Mirjalili, Seyedali .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 85 :129-145
[3]   Equilibrium optimizer based multi dimensions operation of hybrid AC/DC grids [J].
Abdul-hamied, Dalia T. ;
Shaheen, Abdullah M. ;
Salem, Waleed A. ;
Gabr, Walaa, I ;
El-sehiemy, Ragab A. .
ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (06) :4787-4803
[4]   Toward a gold standard for promoter prediction evaluation [J].
Abeel, Thomas ;
Van de Peer, Yves ;
Saeys, Yvan .
BIOINFORMATICS, 2009, 25 (12) :I313-I320
[5]   The Arithmetic Optimization Algorithm [J].
Abualigah, Laith ;
Diabat, Ali ;
Mirjalili, Seyedali ;
Elaziz, Mohamed Abd ;
Gandomi, Amir H. .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 376
[6]   A new hybrid firefly algorithm and particle swarm optimization for tuning parameter estimation in penalized support vector machine with application in chemometrics [J].
Al-Thanoon, Niam Abdulmunim ;
Qasim, Omar Saber ;
Algamal, Zakariya Yahya .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 184 :142-152
[7]   Optimizing connection weights in neural networks using the whale optimization algorithm [J].
Aljarah, Ibrahim ;
Faris, Hossam ;
Mirjalili, Seyedali .
SOFT COMPUTING, 2018, 22 (01) :1-15
[8]   A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification [J].
Almugren, Nada ;
Alshamlan, Hala .
IEEE ACCESS, 2019, 7 :78533-78548
[9]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[10]   The curse(s) of dimensionality [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2018, 15 (06) :399-400