Feature selection based on sparse representation with the measures of classification error rate and complexity of boundary

被引:6
作者
Deng, Yanli [1 ]
Jin, Weidong [1 ]
机构
[1] Southwest Jiao Tong Univ, Sch Elect Engn, Chengdu 610031, Sichuan, Peoples R China
来源
OPTIK | 2015年 / 126卷 / 20期
基金
美国国家科学基金会;
关键词
Feature selection; Sparse representation; Rough sets; Complexity of boundary; Classification;
D O I
10.1016/j.ijleo.2015.06.057
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Feature subset selection plays an important role in pattern recognition, classification systems, and data mining. We study how to select good features by optimizing multivariate performance measures based on sparse representation. In this paper, we first propose a novel feature evaluation measure, called counting region covering (CRC), for estimating classification complexity in different feature subspaces. Then, we present a unified feature selection framework by optimizing the classification error rate and complexity of boundary simultaneously. This allows us to select a compact set of superior features at high classification accuracy. Finally, we discuss the influence of weighting factors in feature selection framework. We perform experimental comparison of our algorithm and other methods using a support vector machine classifier and five different data sets (iris, wine, sonar, iono, and waveform). Experimental results on five real-world datasets demonstrate the effectiveness of our algorithm. (C) 2015 Elsevier GmbH. All rights reserved.
引用
收藏
页码:2634 / 2639
页数:6
相关论文
共 20 条
[1]  
[Anonymous], P INT C MACH LEARN
[2]   Consistency-based search in feature selection [J].
Dash, M ;
Liu, HA .
ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) :155-176
[3]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[4]  
Guyon Isabelle, 2003, Journal of Machine Learning, V3, P1157
[5]  
Ho TK, 2002, IEEE T PATTERN ANAL, V24, P289, DOI 10.1109/34.990132
[6]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594
[7]   Feature selection for multiclass discrimination via mixed-integer linear programming [J].
Iannarilli, FJ ;
Rubin, PA .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (06) :779-783
[8]   Statistical pattern recognition: A review [J].
Jain, AK ;
Duin, RPW ;
Mao, JC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (01) :4-37
[9]   Wrappers for feature subset selection [J].
Kohavi, R ;
John, GH .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :273-324
[10]   Input feature selection by mutual information based on Parzen window [J].
Kwak, N ;
Choi, CH .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (12) :1667-1671