Efficient optimization of support vector machine learning parameters for unbalanced datasets

被引:93
作者
Eitrich, Tatjana [1 ]
Lang, Bruno
机构
[1] Res Ctr Julich, Cent Inst Appl Math, John Von Neumann Inst Comp, Julich, Germany
[2] Univ Wuppertal, Dept Math, Wuppertal, Germany
关键词
support vector machine; parameter tuning; unbalanced datasets; derivative-free optimization;
D O I
10.1016/j.cam.2005.09.009
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Support vector machines are powerful kernel methods for classification and regression tasks. If trained optimally, they produce excellent separating hyperplanes. The quality of the training, however, depends not only on the given training data but also on additional learning parameters, which are difficult to adjust, in particular for unbalanced datasets. Traditionally, grid search techniques have been used for determining suitable values for these parameters. In this paper, we propose an automated approach to adjusting the learning parameters using a derivative-free numerical optimizer. To make the optimization process more efficient, a new sensitive quality measure is introduced. Numerical tests with a well-known dataset show that our approach can produce support vector machines that are very well tuned to their classification tasks. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:425 / 436
页数:12
相关论文
共 34 条
[1]  
ABOUZAKHAR NS, J DIGITAL EVIDENCE, V3
[2]  
[Anonymous], HPL2002354
[3]  
[Anonymous], 1998, UCI REPOSITORY MACHI
[4]  
[Anonymous], 1990, Report No
[5]  
[Anonymous], P 2 SIAM INT C DAT M
[6]  
[Anonymous], 1998, Encyclopedia of Biostatistics
[7]  
[Anonymous], 2003, P INT FLAIRS C
[8]   The analysis of decomposition methods for support vector machines [J].
Chang, CC ;
Hsu, CW ;
Lin, CJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (04) :1003-1008
[9]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
[10]  
Drish J., 2001, OBTAINING CALIBRATED