RFS: Efficient feature selection method based on R-value

被引:17
作者
Lee, Jimin
Batnyam, Nomin
Oh, Sejong [1 ]
机构
[1] Dankook Univ, Dept Nanobiomed Sci, Anseodong 330714, Cheonan, South Korea
关键词
Feature selection; Classification; Dataset; R-value; GENE-EXPRESSION; CANCER;
D O I
10.1016/j.compbiomed.2012.11.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection is one of the most important issues in classification. Many filter and wrapper methods have been proposed. Here, we propose a new efficient feature selection method based on the R-value, which is a measure that is used to capture the overlapped areas among classes in a feature. Our strategy was to select features that have low overlapping areas among classes. Proposed idea is simple, but powerful for feature selection. The experiment results showed that the proposed method is better than previous typical methods in many cases. Accordingly, the proposed method can be used in combination with other feature selection methods. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:91 / 99
页数:9
相关论文
共 24 条
  • [1] Berrar D.P., 2009, PRACTICAL APPROACH M, P1
  • [2] Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process
    Chandran, Uma R.
    Ma, Changqing
    Dhir, Rajiv
    Bisceglia, Michelle
    Lyons-Weiler, Maureen
    Liang, Wenjing
    Michalopoulos, George
    Becich, Michael
    Monzon, Federico A.
    [J]. BMC CANCER, 2007, 7 (1)
  • [3] Chang C.-C., LIBSVM: a Library for Support Vector Machines
  • [4] Gene profiling in spinal cord injury shows role of cell cycle neuronal death
    Di Giovanni, S
    Knoblach, SM
    Brandoli, C
    Aden, SA
    Hoffman, EP
    Faden, AI
    [J]. ANNALS OF NEUROLOGY, 2003, 53 (04) : 454 - 468
  • [5] Genomic analysis of rodent pulmonary tissue following bis-(2-chloroethyl) sulfide exposure
    Dillman, JF
    Phillips, CS
    Dorsch, LM
    Croxton, MD
    Hege, AI
    Sylvester, AJ
    Moran, TS
    Sciuto, AM
    [J]. CHEMICAL RESEARCH IN TOXICOLOGY, 2005, 18 (01) : 28 - 34
  • [6] Minimum redundancy feature selection from microarray gene expression data
    Ding, C
    Peng, HC
    [J]. PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 523 - 528
  • [7] The use of multiple measurements in taxonomic problems
    Fisher, RA
    [J]. ANNALS OF EUGENICS, 1936, 7 : 179 - 188
  • [8] Hoare C. A., 1961, Communications of the ACM, V4, P321, DOI DOI 10.1145/366622.366644
  • [9] Nearest Template Prediction: A Single-Sample-Based Flexible Class Prediction with Confidence Assessment
    Hoshida, Yujin
    [J]. PLOS ONE, 2010, 5 (11):
  • [10] Subclass Mapping: Identifying Common Subtypes in Independent Disease Data Sets
    Hoshida, Yujin
    Brunet, Jean-Philippe
    Tamayo, Pablo
    Golub, Todd R.
    Mesirov, Jill P.
    [J]. PLOS ONE, 2007, 2 (11):