Gene selection and classification of microarray data method based on mutual information and moth flame algorithm

被引:69
作者
Dabba, Ali [1 ,3 ,4 ,6 ]
Tari, Abdelkamel [1 ,5 ]
Meftali, Samy [2 ,4 ]
Mokhtari, Rabah [3 ,6 ]
机构
[1] Abderrahmane Mira Univ, Dept Comp Sci, Fac Exact Sci, Bejaia, Algeria
[2] Lille Univ, Fac Sci & Technol, Lille, France
[3] Mohamed Boudiaf Univ, Dept Comp Sci, Fac Math & Comp Sci, Msila, Algeria
[4] Res Ctr Comp Sci Signal & Automat Control Lille C, Lille, France
[5] Lab Med Comp LIMED, Msila, Algeria
[6] Lab Informat & Its Applicat Msila LIAM, Msila, Algeria
关键词
Gene expression; Feature selection; Microarray; Cancer classification; Moth Flame Algorithm; Mutual information maximization; Bio-inspired algorithms; Bioinformatics; Optimization algorithms; Evolutionary algorithm; Molecular biology; Swarm intelligence; PARTICLE SWARM OPTIMIZATION; HYBRID FEATURE-SELECTION; CANCER CLASSIFICATION; EXPRESSION DATA; MOLECULAR CLASSIFICATION; PREDICTION; PATTERNS; BINARY; TUMOR; CARCINOMAS;
D O I
10.1016/j.eswa.2020.114012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several techniques or methods may help in detecting diseases and cancer. Creating an effective method for extracting disease information is one of the major challenges in the classification of gene expression data as long as there is (in the presence) a massive amount of redundant data and noise. Bio-inspired algorithms are among the most effective when used for solving gene selection. Moth Flame Optimization Algorithm (MFOA) is computationally less expensive and can converge faster than other methods. In this paper, we propose a new extension of the MFOA called the modified Moth Flame Algorithm (mMFA), the mMFA is combined with Mutual Information Maximization (MIM) to solve gene selection in microarray data classification. Our approach Called Mutual Information Maximization - modified Moth Flame Algorithm (MIMmMFA), the MIM based pre-filtering technique is used to measure the relevance and the redundancy of the genes, and the mMFA is used to evolve gene subsets and evaluated by the fitness function, which uses a Support Vector Machine (SVM) with Leave One Out Cross Validation (LOOCV) classifier and the number of selected genes. In order to test the performance of the proposed MIM-mMFA algorithm, we compared the MIM-mMFA algorithm with other recently published algorithms in the literature. The experiment results which have been conducted on sixteen benchmark datasets either binary-class or multi-class, confirm that MIM-mMFA algorithm provides a greater classification accuracy.
引用
收藏
页数:17
相关论文
共 64 条
  • [1] A novel gene selection algorithm for cancer classification using microarray datasets
    Alanni, Russul
    Hou, Jingyu
    Azzawi, Hasseeb
    Xiang, Yong
    [J]. BMC MEDICAL GENOMICS, 2019, 12 (1)
  • [2] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [3] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [4] Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification
    Alshamlan, Hala M.
    Badr, Ghada H.
    Alohali, Yousef A.
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 : 49 - 60
  • [5] [Anonymous], 1997, Proceedings of the Fourteenth International Conference on Machine Learning
  • [6] [Anonymous], 2007, P 9 ANN C COMP GEN E
  • [7] MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia
    Armstrong, SA
    Staunton, JE
    Silverman, LB
    Pieters, R
    de Boer, ML
    Minden, MD
    Sallan, SE
    Lander, ES
    Golub, TR
    Korsmeyer, SJ
    [J]. NATURE GENETICS, 2002, 30 (01) : 41 - 47
  • [8] A Clustering Approach for Feature Selection in Microarray Data Classification Using Random forest
    Aydadenta, Husna
    Adiwijaya
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (05): : 1167 - 1175
  • [9] Artificial neural network classification of microarray data using new hybrid gene selection method
    Aziz, Rabia
    Verma, C. K.
    Jha, Manoj
    Srivastava, Namita
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (01) : 42 - 65
  • [10] Bell E. T., 2014, MEN OF MATH