Optimal parameter selection in support vector machines

被引:36
作者
Schittkowski, K. [1 ]
机构
[1] Univ Bayreuth, Dept Comp Sci, D-95440 Bayreuth, Germany
关键词
machine learning; support vector machine; SVM; binary classification; Gaussian kernel; kernel parameter; sequential quadratic programming; SQP; nonlinear programming;
D O I
10.3934/jimo.2005.1.465
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The purpose of the paper is to apply a nonlinear programming algorithm for computing kernel and related parameters of a support vector machine (SVM) by a two-level approach. Available training data are split into two groups, one set for formulating a quadratic SVM with L-2-soft margin and another one for minimizing the generalization error, where the optimal SVM variables are inserted. Subsequently, the total generalization error is evaluated for a separate set of test data. Derivatives of functions by which the optimization problem is defined, are evaluated in an analytical way, where an existing Cholesky decomposition needed for solving the quadratic SVM, is exploited. The approach is implemented and tested on a couple of standard data sets with up to 4,800 patterns. The results show a significant reduction of the generalization error, an increase of the margin, and a reduction of the number of support vectors in all cases where the data sets are sufficiently large. By a second set of test runs, kernel parameters are assigned to individual features. Redundant attributes are identified and suitable relative weighting factors are computed.
引用
收藏
页码:465 / 476
页数:12
相关论文
共 18 条
[1]  
[Anonymous], 2004, KERNEL METHODS PATTE
[2]  
[Anonymous], SUPPORT VECTOR MACHI
[3]  
AYAT NE, 2002, P INT WORKSH FRONT H
[4]   Mathematical programming for data mining: Formulations and challenges [J].
Bradley, PS ;
Fayyad, UM ;
Mangasarian, OL .
INFORMS JOURNAL ON COMPUTING, 1999, 11 (03) :217-238
[5]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159
[6]   Practical selection of SVM parameters and noise estimation for SVM regression [J].
Cherkassky, V ;
Ma, YQ .
NEURAL NETWORKS, 2004, 17 (01) :113-126
[7]  
Christianini N, 2002, INTRO SUPPORT VECTOR
[8]  
DAI YH, 2005, UNPUB SEQUENTIAL QUA
[9]  
DEBNATH R, 2004, NEURAL INFORM PROCES, V5, P41
[10]   A feature selection Newton method for support vector machine classification [J].
Fung, GM ;
Mangasarian, OL .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2004, 28 (02) :185-202