QP algorithms with guaranteed accuracy and run time for support vector machines

被引:0
作者
Hush, Don [1 ]
Kelly, Patrick [1 ]
Scovel, Clint [1 ]
Steinwart, Ingo [1 ]
机构
[1] Los Alamos Natl Lab, Modeling Algorithms & Informat Grp, Los Alamos, NM 87545 USA
关键词
quadratic programming; decomposition algorithms; approximation algorithms; support vector machines;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe polynomial-time algorithms that produce approximate solutions with guaranteed accuracy for a class of QP problems that are used in the design of support vector machine classifiers. These algorithms employ a two-stage process where the first stage produces an approximate solution to a dual QP problem and the second stage maps this approximate dual solution to an approximate primal solution. For the second stage we describe an O(n log n) algorithm that maps an approximate dual solution with accuracy (2 root 2K(n) + 8 root lambda)(-2) lambda epsilon(2)(p) to an approximate primal solution with accuracy epsilon(p) where n is the number of data samples, K-n is the maximum kernel value over the data and lambda > 0 is the SVM regularization parameter. For the first stage we present new results for decomposition algorithms and describe new decomposition algorithms with guaranteed accuracy and run time. In particular, for tau-rate certifying decomposition algorithms we establish the optimality of tau = 1/(n - 1). In addition we extend the recent tau = 1/(n - 1) algorithm of Simon ( 2004) to form two new composite algorithms that also achieve the t = 1/( n- 1) iteration bound of List and Simon ( 2005), but yield faster run times in practice. We also exploit the t - rate certifying property of these algorithms to produce new stopping rules that are computationally efficient and that guarantee a specified accuracy for the approximate dual solution. Furthermore, for the dual QP problem corresponding to the standard classification problem we describe operational conditions for which the Simon and composite algorithms possess an upper bound of O(n) on the number of iterations. For this same problem we also describe general conditions for which a matching lower bound exists for any decomposition algorithm that uses working sets of size 2. For the Simon and composite algorithms we also establish an O(n(2)) bound on the overall run time for the first stage. Combining the first and second stages gives an overall run time of O(n(2)(c(k) + 1)) where c(k) is an upper bound on the computation to perform a kernel evaluation. Pseudocode is presented for a complete algorithm that inputs an accuracy epsilon(p) and produces an approximate solution that satisfies this accuracy in low order polynomial time. Experiments are included to illustrate the new stopping rules and to compare the Simon and composite decomposition algorithms.
引用
收藏
页码:733 / 769
页数:37
相关论文
共 38 条
[11]  
HUSH D, 2005, 057738 LAUR LOS AL N
[12]  
HUSH D, 2003, LAUR034845 LOS AL NA
[13]  
Joachims T, 1999, ADVANCES IN KERNEL METHODS, P169
[14]   Convergence of a generalized SMO algorithm for SVM classifier design [J].
Keerthi, SS ;
Gilbert, EG .
MACHINE LEARNING, 2002, 46 (1-3) :351-360
[15]   Improvements to Platt's SMO algorithm for SVM classifier design [J].
Keerthi, SS ;
Shevade, SK ;
Bhattacharyya, C ;
Murthy, KRK .
NEURAL COMPUTATION, 2001, 13 (03) :637-649
[16]  
KEERTHI SS, 2000, CD0009 NAT U SING DE
[17]  
KEERTHI SS, 2000, IEEE T NEURAL NETWOR, V11, P637
[18]  
LAI D, 2003, MESCE302003 MON U DE
[19]   Feasible direction decomposition algorithms for training support vector machines [J].
Laskov, P .
MACHINE LEARNING, 2002, 46 (1-3) :315-349
[20]   A note on the decomposition methods for support vector regression [J].
Liao, SP ;
Lin, HT ;
Lin, CJ .
NEURAL COMPUTATION, 2002, 14 (06) :1267-1281