Optimal Feature Selection for Support Vector Machine Classifiers

被引:0
作者
Strub, O. [1 ]
机构
[1] Univ Bern, Dept Business Adm, Bern, Switzerland
来源
2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM) | 2020年
关键词
Delayed Constraint Generation; Feature Selection; Mixed-Integer Quadratic Programming; Support Vector Machines; OPTIMIZATION;
D O I
10.1109/ieem45057.2020.9309859
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Binary classification is a fundamental task in machine learning. It consists of learning a relationship between observable features of a set of training objects and their observable membership to either of two classes to predict as accurately as possible the class membership of new test objects whose features are observable but whose class membership is unknown. One of the most successful methods for binary classification is the support vector machine classifier that aims at finding a hyperplane in the feature space separating the training objects of the two classes. However, the accuracy of this classifier in predicting the correct classes strongly depends on the features selected for determining the hyperplane. In this paper, we propose the first exact approach, which is based on mixed-integer quadratic programming and delayed constraint generation, to identify an optimal set of relevant features for determining the hyperplane. The results of a computational experiment demonstrate that the proposed approach is able to successfully select an optimal set of relevant features in a short running time even for classification tasks with over 10,000 objects and 100 features.
引用
收藏
页码:304 / 308
页数:5
相关论文
共 8 条
[1]   A comparative study of the leading machine learning techniques and two new optimization algorithms [J].
Baumann, P. ;
Hochbaum, D. S. ;
Yang, Y. T. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 272 (03) :1041-1057
[2]   OR Forum-An Algorithmic Approach to Linear Regression [J].
Bertsimas, Dimitris ;
King, Angela .
OPERATIONS RESEARCH, 2016, 64 (01) :2-16
[3]   Support Vector Machines with the Ramp Loss and the Hard Margin Loss [J].
Brooks, J. Paul .
OPERATIONS RESEARCH, 2011, 59 (02) :467-479
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]   A GA-based feature selection and parameters optimization for support vector machines [J].
Huang, Cheng-Lung ;
Wang, Chieh-Jen .
EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (02) :231-240
[6]  
Lerch D, 2019, IN C IND ENG ENG MAN, P144, DOI [10.1109/ieem44572.2019.8978524, 10.1109/IEEM44572.2019.8978524]
[7]   Particle swarm optimization for parameter determination and feature selection of support vector machines [J].
Lin, Shih-Wei ;
Ying, Kuo-Ching ;
Chen, Shih-Chieh ;
Lee, Zne-Jung .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (04) :1817-1824
[8]  
Sharma M., 2017, J TELECOMMUNICATION, V9, P41