Hybridization of Moth flame optimization algorithm and quantum computing for gene selection in microarray data

被引:47
作者
Dabba, Ali [1 ,2 ,5 ]
Tari, Abdelkamel [2 ,3 ]
Meftali, Samy [4 ,5 ]
机构
[1] Mohamed Boudiaf Univ, Fac Math & Comp Sci, Comp Sci Dept, Msila, Algeria
[2] Abderrahmane Mira Univ, Fac Sci, Comp Sci Dept, Bejaia, Algeria
[3] Med Comp Lab LIMED, Bejaia, Algeria
[4] Univ Lille, Lille, France
[5] Res Ctr Comp Sci Signal & Automat Control Lille C, Lille, France
关键词
Gene expression; Feature selection; Moth flame optimization algorithm; Quantum computing; Microarray data; Cancer classification; Bio-inspired algorithms; Molecular biology; Optimization algorithms; Evolutionary algorithms; Swarm intelligence; MOLECULAR CLASSIFICATION; MUTUAL INFORMATION; EXPRESSION; HYBRID; CANCER; PREDICTION; PATTERNS; TUMOR; CARCINOMAS;
D O I
10.1007/s12652-020-02434-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ever-increasing data in various fields like Bioinformatics field, which has led to the need to find a way to reduce the data dimensionality. Gene selection problem has a large number of genes (relevant, redundant or noise), which needs an effective method to help us in detecting diseases and cancer. In this problem, computational complexity is reduced by selecting a small number of genes, but it is necessary to choose the relevant genes to keep a high level of accuracy. Therefore, in order to find the optimal gene subset, it is essential to devise an effective exploration approach that can investigate a large number of possible gene subsets. In addition, it is required to use a powerful evaluation method to evaluate the relevance of these gene subsets. In this paper, we present a novel swarm intelligence algorithm for gene selection called quantum moth flame optimization algorithm (QMFOA), which based on hybridization between quantum computation and moth flame optimization (MFO) algorithm. The purpose of QMFOA is to identify a small gene subset that can be used to classify samples with high accuracy. The QMFOA has a simple two-phase approach, the first phase is a pre-processing that uses to address the difficulty of high-dimensional data, which measure the redundancy and the relevance of the gene, in order to obtain the relevant gene set. The second phase is a hybridization among MFOA, quantum computing, and support vector machine with leave-one-out cross-validation, etc., in order to solve the gene selection problem. We use quantum computing to guarantee a good trade-off between the exploration and the exploitation of the search space, while a new update moth operation using Hamming distance and Archimedes spiral allows an efficient exploration of all possible gene-subsets. The main objective of the second phase is to determine the best relevant gene subset of all genes obtained in the first phase. In order to assess the performance of the proposed QMFOA, we test QMFOA on thirteen microarray datasets (six binary-class and seven multi-class) to evaluate and compare the classification accuracy and the number of genes selected by the QMFOA against many recently published algorithms. Experimental results show that QMFOA provides greater classification accuracy and the ability to reduce the number of selected genes compared to the other algorithms.
引用
收藏
页码:2731 / 2750
页数:20
相关论文
共 66 条
[1]   HYBRIDIZATION OF GENETIC AND QUANTUM ALGORITHM FOR GENE SELECTION AND CLASSIFICATION OF MICROARRAY DATA [J].
Abderrahim, Allani ;
Talbi, El-Ghazali ;
Khaled, Mellouli .
INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (02) :431-444
[2]   A novel gene selection algorithm for cancer classification using microarray datasets [J].
Alanni, Russul ;
Hou, Jingyu ;
Azzawi, Hasseeb ;
Xiang, Yong .
BMC MEDICAL GENOMICS, 2019, 12 (1)
[3]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[4]   Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification [J].
Alshamlan, Hala M. ;
Badr, Ghada H. ;
Alohali, Yousef A. .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 :49-60
[5]  
[Anonymous], 2005, NEUROFUZZY MODELING
[6]  
[Anonymous], 1994, AAAI S INTELLIGENT R
[7]   MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia [J].
Armstrong, SA ;
Staunton, JE ;
Silverman, LB ;
Pieters, R ;
de Boer, ML ;
Minden, MD ;
Sallan, SE ;
Lander, ES ;
Golub, TR ;
Korsmeyer, SJ .
NATURE GENETICS, 2002, 30 (01) :41-47
[8]   Artificial neural network classification of microarray data using new hybrid gene selection method [J].
Aziz, Rabia ;
Verma, C. K. ;
Jha, Manoj ;
Srivastava, Namita .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (01) :42-65
[9]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[10]   Quantum information and computation [J].
Bennett, CH ;
DiVincenzo, DP .
NATURE, 2000, 404 (6775) :247-255