An Efficient Approach for Microarray Data Classification using Filter Wrapper Hybrid Approach

被引:0
作者
Sreepada, Rama Syamala [1 ]
Vipsita, Swati [2 ]
Mohapatra, Puspanjali [2 ]
机构
[1] Univ Coll Enginnering JNTUK, Dept CSE, Vizianagaram, Andhra Pradesh, India
[2] IIIT Bhubaneswar, Dept CSE, Bhubaneswar, Orissa, India
来源
2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC) | 2015年
关键词
Microarray; feature selection; filters; wrappers; Support Vector Machine; CANCER CLASSIFICATION; GENE; SELECTION; ALGORITHM; ENSEMBLE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Microarrays store gene expression data of each cell; thereby microarray contains thousands of features. Each row represents gene samples and each column represents conditions. In any classification task, selection of irrelevant or redundant features greatly reduces the performance of classifier. Therefore, selection of optimal number of significant features is a major challenge for any classification problem. Filter and wrapper approaches are mostly used for feature subset selection. Filters are computationally faster but wrapper approach is more efficient in terms of classification accuracy. This paper proposes a hybrid approach combining the filters and wrappers is proposed which takes the features from both the filters and the wrappers. In the initial step, a feature subset is selected using filters and the feature subset size is further reduced using the wrapper approach. The proposed method is tested on Colon Tumor, B-Cell Lymphoma (DLBCL) and Leukemia datasets. Simulation results show that the hybrid method achieved higher accuracy with a smaller feature subset when compared to the existing methods.
引用
收藏
页码:263 / 267
页数:5
相关论文
共 24 条
  • [1] Microarray gene expression classification with few genes: Criteria to combine attribute selection and classification methods
    Alonso-Gonzalez, Carlos J.
    Isaac Moro-Sancho, Q.
    Simon-Hurtado, Arancha
    Varela-Arrabal, Ricardo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) : 7270 - 7280
  • [2] Classifying gene expression data of cancer using classifier ensemble with mutually exclusive features
    Cho, SB
    Ryu, JW
    [J]. PROCEEDINGS OF THE IEEE, 2002, 90 (11) : 1744 - 1753
  • [3] Cancer classification using ensemble of neural networks with multiple significant gene subset's
    Cho, Sung-Bae
    Won, Hong-Hee
    [J]. APPLIED INTELLIGENCE, 2007, 26 (03) : 243 - 250
  • [4] Faraoun K., 2007, INFOCOMP J COMPUT SC, V6, P36
  • [5] Classification of heterogeneous microarray data by maximum entropy kernel
    Fujibuchi, Wataru
    Kato, Tsuyoshi
    [J]. BMC BIOINFORMATICS, 2007, 8 (1)
  • [6] Han J., Data Mining: Concepts and Techniques
  • [7] Hybrid feature selection by combining filters and wrappers
    Hsu, Hui-Huang
    Hsieh, Cheng-Wei
    Lu, Ming-Da
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8144 - 8150
  • [8] Gene selection and sample classification on microarray data based on adaptive genetic algorithm/k-nearest neighbor method
    Lee, Chien-Pang
    Lin, Wen-Shin
    Chen, Yuh-Min
    Kuo, Bo-Jein
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 4661 - 4667
  • [9] An integrated algorithm for gene selection and classification applied to microarray data of ovarian cancer
    Lee, Zne-Jung
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2008, 42 (01) : 81 - 93
  • [10] Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis
    Li, Guo-Zheng
    Bu, Hua-Long
    Yang, Mary Qu
    Zeng, Xue-Qiang
    Yang, Jack Y.
    [J]. BMC GENOMICS, 2008, 9 (Suppl 2)