Multiresolution hierarchical support vector machine for classification of large datasets

被引:0
作者
Safaa Alwajidi
Li Yang
机构
[1] University of North Carolina at Pembroke,Department of Mathematics and Computer Science
[2] Western Michigan University,Department of Computer Science
来源
Knowledge and Information Systems | 2022年 / 64卷
关键词
Support vector machine; Data classification; Multiresolution analysis; Hierarchical analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Support vector machine (SVM) is a popular supervised learning algorithm based on margin maximization. It has a high training cost and does not scale well to a large number of data points. We propose a multiresolution algorithm MRH-SVM that trains SVM on a hierarchical data aggregation structure, which also serves as a common data input to other learning algorithms. The proposed algorithm learns SVM models using high-level data aggregates and only visits data aggregates at more detailed levels where support vectors reside. In addition to performance improvements, the algorithm has advantages such as the ability to handle data streams and datasets with imbalanced classes. Experimental results show significant performance improvements in comparison with existing SVM algorithms.
引用
收藏
页码:3447 / 3462
页数:15
相关论文
共 67 条
[1]  
Arun Kumar M(2010)A hybrid SVM based decision tree Pattern Recognit 43 3977-3987
[2]  
Gopal M(2005)Fast kernel classifiers with online and active learning J Mach Learn Res 6 1579-1619
[3]  
Bordes A(2013)An ontology enhanced parallel SVM for scalable spam filter training Neurocomputing 108 45-57
[4]  
Ertekin S(2015)Data selection based on decision tree for SVM classification on large data sets Appl Soft Comput 37 787-798
[5]  
Weston J(2016)Medical Internet of Things and big data in healthcare Healthc Inform Res 22 156-163
[6]  
Bottou L(2006)Binary tree of SVM: a new fast multiclass training and classification algorithm IEEE Trans Neural Netw 17 696-704
[7]  
Caruana G(2009)Learning from imbalanced data IEEE Trans Knowl Data Eng 21 1263-1284
[8]  
Li M(2011)A novel intrusion detection system based on hierarchical clustering and support vector machines Expert Syst Appl 38 306-313
[9]  
Liu Y(2016)Online decorrelation of humidity and temperature in chemical sensors for continuous monitoring Chemom Intell Lab Syst 157 169-176
[10]  
Cervantes J(2018)A divide-and-conquer method for large scale Neural Comput Appl 29 497-509