A comparative study on large scale kernelized support vector machines

被引:0
作者
Daniel Horn
Aydın Demircioğlu
Bernd Bischl
Tobias Glasmachers
Claus Weihs
机构
[1] Technische Universität Dortmund,Fakultät Statistik
[2] Ruhr-Universität Bochum,Department of Statistics
[3] LMU München,undefined
来源
Advances in Data Analysis and Classification | 2018年 / 12卷
关键词
Support vector machine; Multi-objective optimization; Supervised learning; Machine learning; Large scale; Nonlinear SVM; Parameter tuning; 62-07 Data analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Kernelized support vector machines (SVMs) belong to the most widely used classification methods. However, in contrast to linear SVMs, the computation time required to train such a machine becomes a bottleneck when facing large data sets. In order to mitigate this shortcoming of kernel SVMs, many approximate training algorithms were developed. While most of these methods claim to be much faster than the state-of-the-art solver LIBSVM, a thorough comparative study is missing. We aim to fill this gap. We choose several well-known approximate SVM solvers and compare their performance on a number of large benchmark data sets. Our focus is to analyze the trade-off between prediction error and runtime for different learning and accuracy parameter settings. This includes simple subsampling of the data, the poor-man’s approach to handling large scale problems. We employ model-based multi-objective optimization, which allows us to tune the parameters of learning machine and solver over the full range of accuracy/runtime trade-offs. We analyze (differences between) solvers by studying and comparing the Pareto fronts formed by the two objectives classification error and training time. Unsurprisingly, given more runtime most solvers are able to find more accurate solutions, i.e., achieve a higher prediction accuracy. It turns out that LIBSVM with subsampling of the data is a strong baseline. Some solvers systematically outperform others, which allows us to give concrete recommendations of when to use which solver.
引用
收藏
页码:867 / 883
页数:16
相关论文
共 50 条
[31]   A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer [J].
Yoon, Heesung ;
Jun, Seong-Chun ;
Hyun, Yunjung ;
Bae, Gwang-Ok ;
Lee, Kang-Kun .
JOURNAL OF HYDROLOGY, 2011, 396 (1-2) :128-138
[32]   A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting [J].
Kurani A. ;
Doshi P. ;
Vakharia A. ;
Shah M. .
Annals of Data Science, 2023, 10 (01) :183-208
[33]   Derivation, Optimization, and Comparative Analysis of Support Vector Machines Application to Multi-Class Image Data [J].
Shekhar, Avi ;
Saeed, Amir K. ;
Johnson, Benjamin A. ;
Rodriguez, Benjamin M. .
MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2024, 2024, 13033
[34]   Comparative Study of Time Series Models, Support Vector Machines, and GMDH in Forecasting Long-Term Evapotranspiration Rates in Northern Iran [J].
Ashrafzadeh, Afshin ;
Kisi, Ozgur ;
Aghelpour, Pouya ;
Biazar, Seyed Mostafa ;
Masouleh, Mohammadreza Askarizad .
JOURNAL OF IRRIGATION AND DRAINAGE ENGINEERING, 2020, 146 (06)
[35]   Support vector machines maximizing geometric margins for multi-class classification [J].
Keiji Tatsumi ;
Tetsuzo Tanino .
TOP, 2014, 22 :815-840
[36]   Fast Support Vector Classification for Large-Scale Problems [J].
Akram-Ali-Hammouri, Ziad ;
Fernandez-Delgado, Manuel ;
Cernadas, Eva ;
Barro, Senen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :6184-6195
[37]   Fast Training on Large Genomics Data using Distributed Support Vector Machines [J].
Theera-Ampornpunt, Nawanol ;
Kim, Seong Gon ;
Ghoshal, Asish ;
Bagchi, Saurabh ;
Grama, Ananth ;
Chaterji, Somali .
2016 8TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2016,
[38]   Support vector machines for hyperspectral remote sensing classification [J].
Gualtieri, JA ;
Cromp, RF .
ADVANCES IN COMPUTER-ASSISTED RECOGNITION, 1999, 3584 :221-232
[39]   Support Vector Machines for Characterising Whipple Shield Performance [J].
Ryan, S. ;
Kandanaarachchi, S. ;
Smith-Miles, K. .
PROCEEDINGS OF THE 2015 HYPERVELOCITY IMPACT SYMPOSIUM (HVIS 2015), 2015, 103 :522-529
[40]   Analyzing superstars' power using support vector machines [J].
Suarez-Vazquez, Ana ;
Quevedo, Jose R. .
EMPIRICAL ECONOMICS, 2015, 49 (04) :1521-1542