RFS: Efficient feature selection method based on R-value

被引:17
作者
Lee, Jimin
Batnyam, Nomin
Oh, Sejong [1 ]
机构
[1] Dankook Univ, Dept Nanobiomed Sci, Anseodong 330714, Cheonan, South Korea
关键词
Feature selection; Classification; Dataset; R-value; GENE-EXPRESSION; CANCER;
D O I
10.1016/j.compbiomed.2012.11.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection is one of the most important issues in classification. Many filter and wrapper methods have been proposed. Here, we propose a new efficient feature selection method based on the R-value, which is a measure that is used to capture the overlapped areas among classes in a feature. Our strategy was to select features that have low overlapping areas among classes. Proposed idea is simple, but powerful for feature selection. The experiment results showed that the proposed method is better than previous typical methods in many cases. Accordingly, the proposed method can be used in combination with other feature selection methods. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:91 / 99
页数:9
相关论文
共 24 条
[1]  
Berrar D.P., 2009, PRACTICAL APPROACH M, P1
[2]   Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process [J].
Chandran, Uma R. ;
Ma, Changqing ;
Dhir, Rajiv ;
Bisceglia, Michelle ;
Lyons-Weiler, Maureen ;
Liang, Wenjing ;
Michalopoulos, George ;
Becich, Michael ;
Monzon, Federico A. .
BMC CANCER, 2007, 7 (1)
[3]  
Chang C.-C., LIBSVM: a Library for Support Vector Machines
[4]   Gene profiling in spinal cord injury shows role of cell cycle neuronal death [J].
Di Giovanni, S ;
Knoblach, SM ;
Brandoli, C ;
Aden, SA ;
Hoffman, EP ;
Faden, AI .
ANNALS OF NEUROLOGY, 2003, 53 (04) :454-468
[5]   Genomic analysis of rodent pulmonary tissue following bis-(2-chloroethyl) sulfide exposure [J].
Dillman, JF ;
Phillips, CS ;
Dorsch, LM ;
Croxton, MD ;
Hege, AI ;
Sylvester, AJ ;
Moran, TS ;
Sciuto, AM .
CHEMICAL RESEARCH IN TOXICOLOGY, 2005, 18 (01) :28-34
[6]   Minimum redundancy feature selection from microarray gene expression data [J].
Ding, C ;
Peng, HC .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :523-528
[7]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[8]  
Hoare C. A., 1961, Communications of the ACM, V4, P321, DOI DOI 10.1145/366622.366644
[9]   Nearest Template Prediction: A Single-Sample-Based Flexible Class Prediction with Confidence Assessment [J].
Hoshida, Yujin .
PLOS ONE, 2010, 5 (11)
[10]   Subclass Mapping: Identifying Common Subtypes in Independent Disease Data Sets [J].
Hoshida, Yujin ;
Brunet, Jean-Philippe ;
Tamayo, Pablo ;
Golub, Todd R. ;
Mesirov, Jill P. .
PLOS ONE, 2007, 2 (11)