Automatic Learning Algorithms for Local Support Vector Machines

被引:1
作者
Do T.-N. [1 ,2 ]
机构
[1] College of Information Technology, Can Tho University, Cantho
[2] UMI UMMISCO 209 (IRD/UPMC), Bondy Cedex
关键词
Automatic hyper-parameter tuning; Large data sets; Local support vector machines; Support vector machines;
D O I
10.1007/s42979-019-0006-z
中图分类号
学科分类号
摘要
In this paper, we propose automatic learning algorithms (called auto-kSVM-1p, autokSVM-dp) being possible to automatically tune hyper-parameters of k local support vector machines (SVM) for classifying large data sets. The autokSVM algorithms are able to determine the number of clusters k to partition the large training data, followed which they auto-learn the non-linear SVM model in each cluster to classify the data locally in the parallel way on multi-core computers. The autokSVM algorithms combine the grid search, the.632 bootstrap estimator, the hill climbing heuristic to optimize hyper-parameters in the local non-linear SVM training. The numerical test results on four data sets from UCI repository and three benchmarks of handwritten letters recognition showed that our autokSVM algorithms give very competitive classification results compared to the standard LibSVM and the original kSVM. An example of autokSVM-1p’s effectiveness is given with an accuracy of 96.74% obtained in the classification of Forest cover-type data set having 581,012 datapoints in 54-dimensional input space and 7 classes in 334.45 seconds using a PC Intel(R) Core i7-4790 CPU, 3.6 GHz, 4 cores. © 2019, Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 53 条
[1]  
Vapnik V., The nature of statistical learning theory, (1999)
[2]  
Guyon I. Web page on SVM applications
[3]  
Platt J., Fast training of support vector machines using sequential minimal optimization, Advances in kernel methods—support vector learning, pp. 185-208, (1999)
[4]  
Do T.N., Tran-Nguyen M.T., Automatic hyper-parameters tuning for local support vector machines, Proceedings of 5Th International Conference on Future Data and Security Engineering, FDSE 2018, Ho Chi Minh City, November 28–30, pp. 185-199, (2018)
[5]  
Do T.N., Non-linear classification of massive datasets with a parallel algorithm of local support vector machines, Advanced Computational Methods for Knowledge Engineering, pp. 231-241, (2015)
[6]  
Do T.N., Poulet F., Parallel learning of local SVM algorithms for classifying large datasets, Trans Large-Scale Data Knowl Centered Syst, 31, pp. 67-93, (2016)
[7]  
Bottou L., Vapnik V., Local learning algorithms, Neural Comput, 4, 6, pp. 888-900, (1992)
[8]  
Vapnik V., Bottou L., Local algorithms for pattern recognition and dependencies estimation, Neural Comput, 5, 6, pp. 893-909, (1993)
[9]  
Vapnik V., Principles of risk minimization for learning theory, In: Advances in Neural Information Processing Systems, 4, pp. 831-838, (1991)
[10]  
Staelin C., Parameter selection for support vector machines, (2002)