Support Vector Machine Ensembles Using Feature-Subset Selection for Enhancing Microarray Data Classification

被引:0
|
作者
Ahmed, Eman [1 ]
El Gayar, Neamat [1 ]
El Azab, Iman A. [1 ]
机构
[1] Cairo Univ, Fac Comp & Informat, Giza 12613, Egypt
关键词
Support Vector Machines (SVM); Ensemble classification; SVM fusion; Feature subsets; Feature selection; Microarray data;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Support Vector Machines (SVMs) are known to be robust tools for classification and regression in noisy and complex domains. SVM ensembles have been widely used to improve classification accuracy in complicated pattern recognition tasks. A good example is the DNA microarray data -for tumor classification-which is usually characterized by low sample size, high dimensionality, noise and large biological variability. In this work we propose to apply an ensemble of SVMs coupled with feature-subset selection methods to alleviate the curse of dimensionality associated with expression-based classification of DNA data in order to achieve stable and reliable results. We compare the single SVM classifier to SVM ensembles applying two different feature-subset selection techniques, namely random selection and k-means clustering, and combining the base classifiers using either majority vote or SVM fusion. Two real-world datasets are used as benchmarks to evaluate and compare the performance. Experimental results show that the ensemble with k-means clustering for feature-subset selection which uses SVM base classifiers and an SVM combiner achieves the best classification accuracy, and that feature-subset-selection methods can have a considerable impact on the classification accuracy.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [41] Feature selection using differential evolution for microarray data classification
    Prajapati S.
    Das H.
    Gourisaria M.K.
    Discover Internet of Things, 2023, 3 (01):
  • [42] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [43] Variable selection using probability density function similarity for support vector machine classification of high-dimensional microarray data
    Tang, Li-Juan
    Jiang, Jian-Hui
    Wu, Hai-Long
    Shen, Guo-Li
    Yu, Ru-Qin
    TALANTA, 2009, 79 (02) : 260 - 267
  • [44] Multicategory classification of 11 neuromuscular diseases based on microarray data using support vector machine
    Choi, Soo Beom
    Park, Jee Soo
    Chung, Jai Won
    Yoo, Tae Keun
    Kim, Deok Won
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 3460 - 3463
  • [45] Support vector machine classification and validation of cancer tissue samples using microarray expression data
    Furey, TS
    Cristianini, N
    Duffy, N
    Bednarski, DW
    Schummer, M
    Haussler, D
    BIOINFORMATICS, 2000, 16 (10) : 906 - 914
  • [46] Feature Selection for Multi-class Classification using Support Vector Data Description
    Jeong, Daun
    Kang, Dongyeop
    Won, Sangchul
    IECON 2010 - 36TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2010,
  • [47] On domain knowledge and feature selection using a support vector machine
    Barzilay, O
    Brailovsky, VL
    PATTERN RECOGNITION LETTERS, 1999, 20 (05) : 475 - 484
  • [48] Efficient feature selection and classification for microarray data
    Li, Zifa
    Xie, Weibo
    Liu, Tao
    PLOS ONE, 2018, 13 (08):
  • [49] Support Feature Machine for DNA Microarray Data
    Maszczyk, Tomasz
    Duch, Wlodzislaw
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2010, 6086 : 178 - 186
  • [50] Gene subset selection in microarray data using entropic filtering for cancer classification
    Navarro, Felix F. Gonzalez
    Munoz, Lluis A. Belanche
    EXPERT SYSTEMS, 2009, 26 (01) : 113 - 124