Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines

被引:0
作者
Essam H. Houssein
Hager N. Hassan
Mustafa M. Al-Sayed
Emad Nabil
机构
[1] Minia University,Faculty of Computers and Information
[2] Cairo University,Faculty of Computers and Artificial Intelligence
[3] Islamic University of Madinah,Faculty of Computer Science and Information Systems
来源
Arabian Journal for Science and Engineering | 2022年 / 47卷
关键词
Microarray; Gene expression; Gene selection; Cancer classification; Feature selection; Manta Ray Foraging Optimization algorithm; Support vector machines; Minimum Redundancy Maximum Relevance;
D O I
暂无
中图分类号
学科分类号
摘要
In DNA microarray applications, many techniques are proposed for cancer classification in order to detect normal and cancerous humans or classify different types of cancers. Gene selection is usually required as a preliminary step for a cancer classification problem. This step aims to select the most informative genes among a great number of genes, which represent an important issue. Although many studies have been proposed to address this issue, they lack getting the most informative and fewest number of genes with the highest accuracy and little effort from the high dimensionality of microarray datasets. Manta ray foraging optimization(MRFO) algorithm is a new meta-heuristic algorithm that mimics the nature of manta ray fishes in food foraging. MRFO has achieved promising results in other fields, such as solar generating units. Due to the high accuracy results of the support vector machines (SVM), it is the most commonly used classification algorithm in cancer studies, especially with microarray data. For exploiting the pros of both algorithms (i.e., MRFO and SVM), in this paper, a hybrid algorithm is proposed to select the most predictive and informative genes for cancer classification. A binary microarray dataset, which includes colon and leukemia1, and a multi-class microarray dataset that includes SRBCT, lymphoma, and leukemia2, are used to evaluate the accuracy of the proposed technique. Like other optimization techniques, MRFO suffers from some problems related to the high dimensionality and complexity of the microarray data. For solving such problems as well as improving the performance, the minimum redundancy maximum relevance (mRMR) method is used as a preprocessing stage. The proposed technique has been evaluated compared to the most common cancer classification algorithms. The experimental results show that our proposed technique achieves the highest accuracy with the fewest number of informative genes and little effort.
引用
收藏
页码:2555 / 2572
页数:17
相关论文
共 50 条
  • [1] Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines
    Houssein, Essam H.
    Hassan, Hager N.
    Al-Sayed, Mustafa M.
    Nabil, Emad
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 2555 - 2572
  • [2] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [3] Gene Selection for Cancer Classification using Support Vector Machines
    Isabelle Guyon
    Jason Weston
    Stephen Barnhill
    Vladimir Vapnik
    Machine Learning, 2002, 46 : 389 - 422
  • [4] A Hybrid Barnacles Mating Optimizer Algorithm With Support Vector Machines for Gene Selection of Microarray Cancer Classification
    Houssein, Essam H.
    Abdelminaam, Diaa Salama
    Hassan, Hager N.
    Al-Sayed, Mustafa M.
    Nabil, Emad
    IEEE ACCESS, 2021, 9 : 64895 - 64905
  • [5] Applications of support vector machines to cancer classification with microarray data
    Chu, F
    Wang, LP
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (06) : 475 - 484
  • [6] Hybrid Firefly based Simultaneous Gene Selection and Cancer Classification using Support Vector Machines and Random Forests
    Srivastava, Atulji
    Chakrabarti, Saurabh
    Das, Subrata
    Ghosh, Shameek
    Jayaraman, V. K.
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS (BIC-TA 2012), VOL 1, 2013, 201 : 485 - +
  • [7] Saliency Analysis of Support Vector Machines for Gene Selection in Tissue Classification
    L. Cao
    H.P. Lee
    C.K. Seng
    Q. Gu
    Neural Computing & Applications, 2003, 11 : 244 - 249
  • [8] Lung Cancer Classification Tool Using Microarray Data and Support Vector Machines
    Cabrera, Jennifer
    Dionisio, Abigaile
    Solano, Geoffrey
    2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2015,
  • [9] Saliency analysis of support vector machines for gene selection in tissue classification
    Cao, L
    Seng, CK
    Gu, Q
    Lee, HP
    NEURAL COMPUTING & APPLICATIONS, 2003, 11 (3-4) : 244 - 249
  • [10] Gene Selection Using Interaction Information for Microarray-based Cancer Classification
    Nakariyakul, Songyot
    2016 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2016,