A new filter-based Gene selection method based on dragonfly optimization and correlation-based feature selection

被引:0
作者
Ghoneimy, Mohamed [1 ]
Nabil, Emad [2 ,3 ]
Badr, Amr [2 ]
El-Khamisy, Sherif F. [4 ]
机构
[1] 6th October City, MUST Univ, Fac Informat Technol, Giza, Egypt
[2] Cairo Univ, Fac Comp & Artificial Intelligence, Giza, Egypt
[3] Islamic Univ Madinah, Fac Comp & Informat Syst, Madinah, Saudi Arabia
[4] Zewail City Sci & Technol, Helmy Inst, Ctr Genom, Giza 12588, Egypt
来源
BIOSCIENCE RESEARCH | 2019年 / 16卷 / 03期
关键词
Feature Selection; Filter Approach; Gene Selection; Microarray Data; DragonFly Optimization; Correlation Coefficient; EFFICIENT FEATURE-SELECTION; RANDOM SUBSPACE METHOD; HIGH-DIMENSIONAL DATA; FEATURE SUBSET; CANCER; ALGORITHM; CLASSIFICATION; EXPRESSION; EVOLUTIONARY; INFORMATION;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer Diagnosis is considered one of microarray data's most developing applications. But the classification of cancer using microarray data stills a hard problem, this is because of the microarray data consists of a massive number of genes and a small number of cases. In order to tackle this problem a gene selection method must be used which improves the accuracy of classification. A new filter-based gene selection method is proposed in this paper. The proposed method merges the Dragonfly algorithm and the correlation-based feature selection, this is to reduce the redundancy between the genes selected and increase the relevance between the selected genes and the decision. Our proposed method is compared with nine famous feature selection methods. The experiments in this paper are applied to five widely used public microarray datasets. The used evaluation criterion of the selected features is the average accuracy of classification using three different classifiers, which are support vector machine, naive Bayes, and decision tree. Experimental results demonstrate that our proposed method is efficient and performs better than the other nine methods used in the experiment. It also shows that the proposed method can be used with anyone of the three classifiers included in our study to obtain an efficient automatic cancer diagnostic system.
引用
收藏
页码:3139 / 3154
页数:16
相关论文
共 72 条
  • [1] Text feature selection using ant colony optimization
    Aghdam, Mehdi Hosseinzadeh
    Ghasem-Aghaee, Nasser
    Basiri, Mohammad Ehsan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 6843 - 6853
  • [2] [Anonymous], 2009, ACM SIGKDD explorations newsletter, DOI 10.1145/1656274.1656278
  • [3] [Anonymous], 2005, DATA MINING PRACTICA
  • [4] Fast wrapper feature subset selection in high-dimensional datasets by means of filter re-ranking
    Bermejo, Pablo
    de la Ossa, Luis
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 25 (01) : 35 - 44
  • [5] Bio-molecular cancer prediction with random subspace ensembles of support vector machines
    Bertoni, A
    Folgieri, R
    Valentini, G
    [J]. NEUROCOMPUTING, 2005, 63 : 535 - 539
  • [6] A review of microarray datasets and applied feature selection methods
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    Benitez, J. M.
    Herrera, F.
    [J]. INFORMATION SCIENCES, 2014, 282 : 111 - 135
  • [7] An efficient gene selection algorithm based on mutual information
    Cai, Ruichu
    Hao, Zhifeng
    Yang, Xiaowei
    Wen, Wen
    [J]. NEUROCOMPUTING, 2009, 72 (4-6) : 991 - 999
  • [8] Towards improving cluster-based feature selection with a simplified silhouette filter
    Covoes, Thiago F.
    Hruschka, Eduardo R.
    [J]. INFORMATION SCIENCES, 2011, 181 (18) : 3766 - 3782
  • [9] Gene selection for microarray cancer classification using a new evolutionary method employing artificial intelligence concepts
    Dashtban, M.
    Balafar, Mohammadali
    [J]. GENOMICS, 2017, 109 (02) : 91 - 107
  • [10] A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms
    Derrac, Joaquin
    Garcia, Salvador
    Molina, Daniel
    Herrera, Francisco
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) : 3 - 18