On the working set selection in gradient projection-based decomposition techniques for support vector machines

被引:18
作者
Serafini, T [1 ]
Zanni, L [1 ]
机构
[1] Univ Modena & Reggio Emilia, Dipartimento Matemat, I-41100 Modena, Italy
关键词
support vector machines; quadratic programs; decomposition techniques; gradient projection methods; large-scale problems;
D O I
10.1080/10556780500140714
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This work deals with special decomposition techniques for the large quadratic program arising in training support vector machines. These approaches split the problem into a sequence of quadratic programming (QP) subproblems which can be solved by efficient gradient projection methods recently proposed. Owing to the ability of decomposing the problem into much larger subproblems than standard decomposition packages, these techniques show promising performance and are well suited for parallelization. Here, we discuss a crucial aspect for their effectiveness: the selection of the working set; that is, the index set of the variables to be optimized at each step through the QP subproblem. We analyze the most popular working set selections and develop a new selection strategy that improves the convergence rate of the decomposition schemes based on large sized working sets. The effectiveness of the proposed strategy within the gradient projection-based decomposition techniques is shown by numerical experiments on large benchmark problems, both in serial and in parallel environments.
引用
收藏
页码:583 / 596
页数:14
相关论文
共 27 条
[1]  
[Anonymous], 2002, LIBSVM LIB SUPPORT V
[2]  
[Anonymous], 1990, SUPPORT VECTOR LEARN
[3]   The analysis of decomposition methods for support vector machines [J].
Chang, CC ;
Hsu, CW ;
Lin, CJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (04) :1003-1008
[4]   SVMTorch: Support vector machines for large-scale regression problems [J].
Collobert, R ;
Bengio, S .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :143-160
[5]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
[6]  
DAI YH, 2003, 216 U DUND DEP MATH
[7]   Efficient SVM regression training with SMO [J].
Flake, GW ;
Lawrence, S .
MACHINE LEARNING, 2002, 46 (1-3) :271-290
[8]  
Galligani E, 2003, NONCONVEX OPTIM, V68, P185
[9]   A simple decomposition method for support vector machines [J].
Hsu, CW ;
Lin, CJ .
MACHINE LEARNING, 2002, 46 (1-3) :291-314
[10]  
Joachims T., 1998, ADV KERNEL METHODS S