A New Algorithm for Cancer Biomarker Gene Detection Using Harris Hawks Optimization

被引:4
作者
AlMazrua, Halah [1 ]
AlShamlan, Hala [1 ]
机构
[1] King Saud Univ KSU, Coll Comp & Informat Sci, Informat Technol Dept, Riyadh 11451, Saudi Arabia
关键词
bio-inspired algorithms; bioinformatics; cancer classification; evolutionary algorithm; feature selection; gene expression; Harris Hawks Optimization; k-nearest neighbor; support vector machine; CLASSIFICATION; PREDICTION;
D O I
10.3390/s22197273
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents two novel swarm intelligence algorithms for gene selection, HHO-SVM and HHO-KNN. Both of these algorithms are based on Harris Hawks Optimization (HHO), one in conjunction with support vector machines (SVM) and the other in conjunction with k-nearest neighbors (k-NN). In both algorithms, the goal is to determine a small gene subset that can be used to classify samples with a high degree of accuracy. The proposed algorithms are divided into two phases. To obtain an accurate gene set and to deal with the challenge of high-dimensional data, the redundancy analysis and relevance calculation are conducted in the first phase. To solve the gene selection problem, the second phase applies SVM and k-NN with leave-one-out cross-validation. A performance evaluation was performed on six microarray data sets using the two proposed algorithms. A comparison of the two proposed algorithms with several known algorithms indicates that both of them perform quite well in terms of classification accuracy and the number of selected genes.
引用
收藏
页数:11
相关论文
共 18 条
  • [1] Almugren N., 2019, 2019 IEEE C COMPUTAT, P1, DOI DOI 10.1109/cais.2019.8769532
  • [2] Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification
    Alshamlan, Hala M.
    Badr, Ghada H.
    Alohali, Yousef A.
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 : 49 - 60
  • [3] MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia
    Armstrong, SA
    Staunton, JE
    Silverman, LB
    Pieters, R
    de Boer, ML
    Minden, MD
    Sallan, SE
    Lander, ES
    Golub, TR
    Korsmeyer, SJ
    [J]. NATURE GENETICS, 2002, 30 (01) : 41 - 47
  • [4] Gene-expression profiles predict survival of patients with lung adenocarcinoma
    Beer, DG
    Kardia, SLR
    Huang, CC
    Giordano, TJ
    Levin, AM
    Misek, DE
    Lin, L
    Chen, GA
    Gharib, TG
    Thomas, DG
    Lizyness, ML
    Kuick, R
    Hayasaka, S
    Taylor, JMG
    Iannettoni, MD
    Orringer, MB
    Hanash, S
    [J]. NATURE MEDICINE, 2002, 8 (08) : 816 - 824
  • [5] Gene selection and classification of microarray data method based on mutual information and moth flame algorithm
    Dabba, Ali
    Tari, Abdelkamel
    Meftali, Samy
    Mokhtari, Rabah
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 166
  • [6] Hybridization of Moth flame optimization algorithm and quantum computing for gene selection in microarray data
    Dabba, Ali
    Tari, Abdelkamel
    Meftali, Samy
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2731 - 2750
  • [7] Doreswamy M. Umme, 2016, Int. J. Soft Comput., Artif. Intell. Appl., V5, P01, DOI [10.5121/ijscai.2016.5301, DOI 10.5121/IJSCAI.2016.5301]
  • [8] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [9] Hameed S.S., 2018, J COMPUT SCI-NETH, V14, P868, DOI [10.3844/jcssp.2018.868.880, DOI 10.3844/JCSSP.2018.868.880]
  • [10] Han J., 2006, Data Mining: Concepts and Techniques, V2nd ed.