A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification

被引:59
作者
Jimenez-Cordero, Asuncion [1 ]
Miguel Morales, Juan [1 ]
Pineda, Salvador [1 ]
机构
[1] Univ Malaga, OASYS Grp, Malaga, Spain
基金
欧洲研究理事会;
关键词
Machine learning; Min-max optimization; Duality theory; Feature selection; Nonlinear Support Vector Machine classification; VARIABLE SELECTION; KERNEL;
D O I
10.1016/j.ejor.2020.12.009
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In recent years, feature selection has become a challenging problem in several machine learning fields, such as classification problems. Support Vector Machine (SVM) is a well-known technique applied in classification tasks. Various methodologies have been proposed in the literature to select the most relevant features in SVM. Unfortunately, all of them either deal with the feature selection problem in the linear classification setting or propose ad-hoc approaches that are difficult to implement in practice. In contrast, we propose an embedded feature selection method based on a min-max optimization problem, where a trade-off between model complexity and classification accuracy is sought. By leveraging duality theory, we equivalently reformulate the min-max problem and solve it without further ado using off-the-shelf software for nonlinear optimization. The efficiency and usefulness of our approach are tested on several benchmark data sets in terms of accuracy, number of selected features and interpretability. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:24 / 35
页数:12
相关论文
共 38 条
  • [1] Feature selection for classification models via bilevel optimization
    Agor, Joseph
    Ozaltin, Osman Y.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2019, 106 (156-168) : 156 - 168
  • [2] Automatic Feature Selection via Weighted Kernels and Regularization
    Allen, Genevera I.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (02) : 284 - 299
  • [3] Variable selection in regression-a tutorial
    Andersen, C. M.
    Bro, R.
    [J]. JOURNAL OF CHEMOMETRICS, 2010, 24 (11-12) : 728 - 737
  • [4] [Anonymous], 2000, INTRO SUPPORT VECTOR
  • [5] Integer programming models for feature selection: New extensions and a randomized solution algorithm
    Bertolazzi, P.
    Felici, G.
    Festa, P.
    Fiscon, G.
    Weitschek, E.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 250 (02) : 389 - 399
  • [6] Functional-bandwidth kernel for Support Vector Machine with Functional Data: An alternating optimization algorithm
    Blanquero, R.
    Carrizosa, E.
    Jimenez-Cordero, A.
    Martin-Barragan, B.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 275 (01) : 195 - 207
  • [7] Variable selection in classification for multivariate functional data
    Blanquero, Rafael
    Carrizosa, Emilio
    Jimenez-Cordero, Asuncion
    Martin-Barragan, Belen
    [J]. INFORMATION SCIENCES, 2019, 481 : 445 - 462
  • [8] Recent advances and emerging challenges of feature selection in the context of big data
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 86 : 33 - 45
  • [9] A review of microarray datasets and applied feature selection methods
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    Benitez, J. M.
    Herrera, F.
    [J]. INFORMATION SCIENCES, 2014, 282 : 111 - 135
  • [10] Boyd L., 2004, Convex Optimization, DOI DOI 10.1017/CBO9780511804441