Fast optimization of hyperparameters for support vector regression models with highly predictive ability

被引:60
作者
Kaneko, Hiromasa [1 ]
Funatsu, Kimito [1 ]
机构
[1] Univ Tokyo, Dept Chem Syst Engn, Bunkyo Ku, Tokyo 1138656, Japan
基金
日本学术振兴会; 日本科学技术振兴机构;
关键词
Support vector regression; Hyperparameter; Optimization; Predictive ability; Computational time; BIG DATA; SELECTION;
D O I
10.1016/j.chemolab.2015.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Support vector regression (SVR) attracts much attention in chemometrics as a nonlinear regression method due to its theoretical background. In SVR modeling, three hyperparameters must be set beforehand. The optimization method based on grid search (GS) and cross-validation (CV) is employed normally in the selection of the SVR hyperparameters. However, this takes enormous time. Although theoretical techniques exist to decide the values of the SVR hyperparameters, predictive ability of SVR models is not considered in the decision. We therefore proposed a method based on the GS and CV method and theoretical techniques for fast optimization of the SVR hyperparameters, considering predictive ability of SVR models. After values of two hyperparameters are decided theoretically, each hyperparameter is optimized independently with GS and CV. The highly predictive ability of SVR models and small computational time for the proposed method are confirmed through the case studies using real data sets. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:64 / 69
页数:6
相关论文
共 19 条
[1]  
[Anonymous], 2006, Pattern recognition and machine learning
[2]   QSAR study of anti-HIV HEPT analogues based on multi-objective genetic programming and counter-propagation neural network [J].
Arakawa, Masamoto ;
Hasegawa, Kiyoshi ;
Funatsu, Kimito .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 83 (02) :91-98
[3]   Cross-validation as the objective function for variable-selection techniques [J].
Baumann, K .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2003, 22 (06) :395-406
[4]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[5]   Visualizing Big data with Compressed Score Plots: Approach and research challenges [J].
Camacho, Jose .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 135 :110-125
[6]  
Chen T, 2007, CHEMOMETR INTELL LAB, V87, P59, DOI 10.1016/j.chemolab.2006.09.004
[7]   Practical selection of SVM parameters and noise estimation for SVM regression [J].
Cherkassky, V ;
Ma, YQ .
NEURAL NETWORKS, 2004, 17 (01) :113-126
[8]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[9]   Beware of q2! [J].
Golbraikh, A ;
Tropsha, A .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2002, 20 (04) :269-276
[10]   ADME evaluation in drug discovery. 4. Prediction of aqueous solubility based on atom contribution approach [J].
Hou, TJ ;
Xia, K ;
Zhang, W ;
Xu, XJ .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :266-275