Empirical comparison of cross-validation and internal metrics for tuning SVM hyperparameters

被引:60
作者
Duarte, Edson [1 ]
Wainer, Jacques [1 ]
机构
[1] Univ Estadual Campinas, Comp Inst, Av Albert Einstein 1251 Cidade Univ Zeferino Vaz, BR-13083852 Campinas, SP, Brazil
关键词
SVM; Internal metrics; Cross validation; Hyper-parameter tuning; Model selection; SUPPORT; CLASSIFIERS; BOUNDS; SAMPLE;
D O I
10.1016/j.patrec.2017.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperparameter tuning is a mandatory step for building a support vector machine classifier. In this work, we study some methods based on metrics of the training set itself, and not the performance of the classifier on a different test set - the usual cross-validation approach. We compare cross-validation (5-fold) with Xi-alpha, radius-margin bound, generalized approximate cross validation, maximum discrepancy and distance between two classes on 110 public binary data sets. Cross validation is the method that resulted in the best selection of the hyper-parameters, but it is also the method with one of the highest execution time. Distance between two classes (DBTC) is the fastest and the second best ranked method. We discuss that DBTC is a reasonable alternative to cross validation when training/hyperparameter-selection times are an issue and that the loss in accuracy when using DBTC is reasonably small. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:6 / 11
页数:6
相关论文
共 20 条
[1]  
Anguita D, 2005, STUD FUZZ SOFT COMP, V177, P159
[2]   Hyperparameter design criteria for support vector classifiers [J].
Anguita, D ;
Ridella, S ;
Rivieccio, F ;
Zunino, R .
NEUROCOMPUTING, 2003, 55 (1-2) :109-134
[3]  
Anguita D., 2010, The 2010 International Joint Conference on Neural Networks, P1, DOI DOI 10.1109/IJCNN.2010.5596450
[4]   In-Sample and Out-of-Sample Model Selection and Error Estimation for Support Vector Machines [J].
Anguita, Davide ;
Ghio, Alessandro ;
Oneto, Luca ;
Ridella, Sandro .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (09) :1390-1406
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Radius margin bounds for support vector machines with the RBF kernel [J].
Chung, KM ;
Kao, WC ;
Sun, CL ;
Wang, LL ;
Lin, CJ .
NEURAL COMPUTATION, 2003, 15 (11) :2643-2681
[7]  
Cristianini N, 1999, ADV NEUR IN, V11, P204
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]   Evaluation of simple performance measures for tuning SVM hyperparameters [J].
Duan, K ;
Keerthi, SS ;
Poo, AN .
NEUROCOMPUTING, 2003, 51 :41-59
[10]  
Fernández-Delgado M, 2014, J MACH LEARN RES, V15, P3133