Analysis of high-dimensional biomedical data using an evolutionary multi-objective emperor penguin optimizer

被引:48
作者
Baliarsingh, Santos Kumar [1 ]
Vipsita, Swati [1 ]
Muhammad, Khan [2 ]
Bakshi, Sambit [3 ]
机构
[1] Int Inst Informat Technol, Dept Comp Sci & Engn, Bhubaneswar, India
[2] Sejong Univ, Dept Software, Seoul 143747, South Korea
[3] Natl Inst Technol, Dept Comp Sci & Engn, Rourkela, India
关键词
Microarray; Fisher score; MOCEPO; mRMR; Kernel ridge regression; CONVOLUTIONAL NEURAL-NETWORKS; DISTRIBUTED FEATURE-SELECTION; GENE-EXPRESSION DATA; FIRE DETECTION; CLASSIFICATION; CANCER; ALGORITHM; ENSEMBLE; EXPLORATION/EXPLOITATION; SURVEILLANCE;
D O I
10.1016/j.swevo.2019.04.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last two decades, there has been an expeditious expansion in the generation and exploration of high dimensional biomedical data. Identification of biomarkers from the genomics data poses a significant challenge in microarray data analysis. Therefore, for the methodical analysis of the genomics dataset, it is paramount to develop some effective algorithms. In this work, a multi-objective version of the emperor penguin optimization (EPO) algorithm with chaos, namely, multi-objective chaotic EPO (MOCEPO) is proposed. The suggested approach extends the original continuous single objective EPO to a competent binary multi-objective model. The objectives are to minimize the number of selected genes (NSG) and to maximize the classification accuracy (CA). In this work, Fisher score and minimum redundancy maximum relevance (mRMR) are independently used as initial filters. Further, the proposed MOCEPO is employed for the simultaneous optimal feature selection and cancer classification. The proposed algorithm is successfully experimented on seven well-known high-dimensional binary-class as well as multi-class datasets. To evaluate the effectiveness, the proposed method is compared with non-dominated sorting genetic algorithm (NSGA-H), multi-objective particle swarm optimization (MOPSO), chaotic version of GA for multi-objective optimization (CGAMO), and chaotic MOPSO methods. The experimental results show that the proposed framework achieves better CA with minimum NSG compared to the existing schemes. The presented approach exhibits its efficacy with regard to NSG, accuracy, sensitivity, specificity, and F-measure.
引用
收藏
页码:262 / 273
页数:12
相关论文
共 66 条
[1]  
Aiguo Wang, 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), P74, DOI 10.1109/BIBM.2014.6999251
[2]   The exploration/exploitation tradeoff in dynamic cellular genetic algorithms [J].
Alba, E ;
Dorronsoro, B .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2005, 9 (02) :126-142
[3]   An improved class of real-coded Genetic Algorithms for numerical optimization [J].
Ali, Mostafa Z. ;
Awad, Noor H. ;
Suganthan, Ponnuthurai N. ;
Shatnawi, Ali M. ;
Reynolds, Robert G. .
NEUROCOMPUTING, 2018, 275 :155-166
[4]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[5]   mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling [J].
Alshamlan, Hala ;
Badr, Ghada ;
Alohali, Yousef .
BIOMED RESEARCH INTERNATIONAL, 2015, 2015
[6]   Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification [J].
Alshamlan, Hala M. ;
Badr, Ghada H. ;
Alohali, Yousef A. .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 :49-60
[7]  
[Anonymous], 2017, CHIN J OCEANOL LIMNO
[8]  
[Anonymous], 2014, J. Power Energy Eng, DOI DOI 10.4236/JPEE.2014.24025
[9]  
[Anonymous], IEEE T CYBERN
[10]  
[Anonymous], 2018, Adv. Data Anal. Classif.