Structured variable selection in support vector machines

被引:8
|
作者
Wu, Seongho [1 ]
Zou, Hui [1 ]
Yuan, Ming [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2008年 / 2卷
基金
美国国家科学基金会;
关键词
Classification; Heredity; Nonparametric estimation; Support vector machine; Variable selection;
D O I
10.1214/07-EJS125
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When applying the support vector machine (SVM) to high-dimensional classification problems, we often impose a sparse structure in the SVM to eliminate the influences of the irrelevant predictors. The lasso and other variable selection techniques have been successfully used in the SVM to perform automatic variable selection. In some problems, there is a natural hierarchical structure among the variables. Thus, in order to have an interpretable SVM classifier, it is important to respect the heredity principle when enforcing the sparsity in the SVM. Many variable selection methods, however, do not respect the heredity principle. In this paper we enforce both sparsity and the heredity principle in the SVM by using the so-called structured variable selection (SVS) framework originally proposed in [20]. We minimize the empirical hinge loss under a set of linear inequality constraints and a lasso-type penalty. The solution always obeys the desired heredity principle and enjoys sparsity. The new SVM classifier can be efficiently fitted, because the optimization problem is a linear program. Another contribution of this work is to present a nonparametric extension of the SVS framework, and we propose nonparametric heredity SVMs. Simulated and real data are used to illustrate the merits of the proposed method.
引用
收藏
页码:103 / 117
页数:15
相关论文
共 50 条
  • [31] Modified Cat Swarm Optimization Algorithm for Feature Selection of Support Vector Machines
    Lin, Kuan-Cheng
    Huang, Yi-Hung
    Hung, Jason C.
    Lin, Yung-Tso
    FRONTIER AND INNOVATION IN FUTURE COMPUTING AND COMMUNICATIONS, 2014, 301 : 328 - 335
  • [32] A meta-learning approach to automatic kernel selection for support vector machines
    Ali, Shawkat
    Smith-Miles, Kate A.
    NEUROCOMPUTING, 2006, 70 (1-3) : 173 - 186
  • [33] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178
  • [34] Comparison on neural networks and support vector machines in suppliers' selection
    Hu Guosheng1
    2. School of Computer and Information
    3. Adult Education Coll.
    Journal of Systems Engineering and Electronics, 2008, (02) : 316 - 320
  • [35] A wrapper method for feature selection using Support Vector Machines
    Maldonado, Sebastian
    Weber, Richard
    INFORMATION SCIENCES, 2009, 179 (13) : 2208 - 2217
  • [36] Fast instance selection for speeding up support vector machines
    Chen, Jingnian
    Zhang, Caiming
    Xue, Xiaoping
    Liu, Cheng-Lin
    KNOWLEDGE-BASED SYSTEMS, 2013, 45 : 1 - 7
  • [37] Comparison on neural networks and support vector machines in suppliers' selection
    Hu Guosheng
    Zhang Guohong
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2008, 19 (02) : 316 - 320
  • [38] Model selection for support vector machines via uniform design
    Huang, Chien-Ming
    Lee, Yuh-Jye
    Lin, Dennis K. J.
    Huang, Su-Yun
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 335 - 346
  • [39] Purity Filtering: An Instance Selection Method for Support Vector Machines
    Moran-Pomes, David
    Belanche-Munoz, Lluis A.
    ARTIFICIAL INTELLIGENCE XXXVI, 2019, 11927 : 21 - 35
  • [40] Bayesian Group Feature Selection for Support Vector Learning Machines
    Du, Changde
    Du, Changying
    Zhe, Shandian
    Luo, Ali
    He, Qing
    Long, Guoping
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 239 - 252