A nonlinear least squares quasi-Newton strategy for LP-SVR hyper-parameters selection

被引:8
作者
Rivas-Perea, Pablo [1 ]
Cota-Ruiz, Juan [2 ]
Rosiles, Jose-Gerardo [3 ]
机构
[1] Baylor Univ, Dept Comp Sci, Waco, TX 76798 USA
[2] Autonomous Univ Ciudad Juarez UACJ, Dept Elect & Comp Engn, Ciudad Juarez 32310, Chihuahua, Mexico
[3] Sci Applicat Int Corp, El Paso, TX 79925 USA
关键词
Support vector regression; Hyper-parameters; Large-scale LP-SVR; SUPPORT VECTOR REGRESSION; MODEL SELECTION; PERFORMANCE-MEASURES; HYPERPARAMETERS; CRITERIA; SPACE;
D O I
10.1007/s13042-013-0153-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of hyper-parameters selection for a linear programming-based support vector machine for regression (LP-SVR). The proposed model is a generalized method that minimizes a linear-least squares problem using a globalization strategy, inexact computation of first order information, and an existing analytical method for estimating the initial point in the hyper-parameters space. The minimization problem consists of finding the set of hyper-parameters that minimizes any generalization error function for different problems. Particularly, this research explores the case of two-class, multi-class, and regression problems. Simulation results among standard data sets suggest that the algorithm achieves statistically insignificant variability when measuring the residual error; and when compared to other methods for hyper-parameters search, the proposed method produces the lowest root mean squared error in most cases. Experimental analysis suggests that the proposed approach is better suited for large-scale applications for the particular case of an LP-SVR. Moreover, due to its mathematical formulation, the proposed method can be extended in order to estimate any number of hyper-parameters.
引用
收藏
页码:579 / 597
页数:19
相关论文
共 71 条
[61]  
Stark H., 2001, PROBABILITY RANDOM P, V3rd
[62]   Decomposition techniques for training linear programming support vector machines [J].
Torii, Yusuke ;
Abe, Shigeo .
NEUROCOMPUTING, 2009, 72 (4-6) :973-984
[63]  
Vapnik V, 1997, ADV NEUR IN, V9, P281
[64]  
Waugh S, 1995, THESIS U TASMANIA TA
[65]   Multiple faults diagnosis in motion system based on SVM [J].
Xiao, Jin-Zhuang ;
Wang, Hong-Rui ;
Yang, Xin-Cai ;
Gao, Zheng .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2012, 3 (01) :77-82
[66]   A novel kernel-based maximum a posteriori classification method [J].
Xu, Zenglin ;
Huang, Kaizhu ;
Zhu, Jianke ;
King, Irwin ;
Lyu, Michael R. .
NEURAL NETWORKS, 2009, 22 (07) :977-987
[67]   Modeling of strength of high-performance concrete using artificial neural networks [J].
Yeh, IC .
CEMENT AND CONCRETE RESEARCH, 1998, 28 (12) :1797-1808
[68]  
Yuan XF, 2008, J SYST ENG ELECTRON, V19, P191
[69]  
Zhang JP, 2005, Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, P1637
[70]   On the sparseness of 1-norm support vector machines [J].
Zhang, Li ;
Zhou, Weida .
NEURAL NETWORKS, 2010, 23 (03) :373-385