Performance of SVM with Multiple Kernel Learning for Classification Tasks of Imbalanced Datasets

被引:0
作者
Saeed, Sana [1 ]
Ong, Hong Choon [2 ]
机构
[1] Univ Punjab, Coll Stat & Actuarial Sci, Lahore 54590, Pakistan
[2] Univ Sains Malaysia, Sch Math Sci, Inst Post Grad Studies, Gelugor 11800, Penang, Malaysia
来源
PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY | 2019年 / 27卷 / 01期
关键词
Hybrid algorithm; imbalanced datasets; multiple kernel learning; oversampling algorithm; support vector machine; SUPPORT VECTOR MACHINES; PARAMETER SELECTION; OPTIMIZATION; MATRIX; SMOTE;
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Support vector machine (SVM) is one of the most popular algorithms in machine learning and data mining. However, its reduced efficiency is usually observed for imbalanced datasets. To improve the performance of SVM for binary imbalanced datasets, a new scheme based on oversampling and the hybrid algorithm were introduced. Besides the use of a single kernel function, SVM was applied with multiple kernel learning (MKL). A weighted linear combination was defined based on the linear kernel function, radial basis function (RBF kernel), and sigmoid kernel function for MKL. By generating the synthetic samples in the minority class, searching the best choices of the SVM parameters and identifying the weights of MKL by minimizing the objective function, the improved performance of SVM was observed. To prove the strength of the proposed scheme, an experimental study, including noisy borderline and real imbalanced datasets was conducted. SVM was applied with linear kernel function, RBF kernel, sigmoid kernel function and MKL on all datasets. The performance of SVM with all kernel functions was evaluated by using sensitivity, G Mean, and F measure. A significantly improved performance of SVM with MKL was observed by applying the proposed scheme.
引用
收藏
页码:527 / 545
页数:19
相关论文
共 46 条
[1]  
Abe S., 2005, ADV PTRN RECOGNIT
[2]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[3]  
Alitouche TA., 2013, J INFORM ENG APPL, V3, P27, DOI DOI 10.5121/IJDKP.2013.3402
[4]  
Alwan H. B., 2013, WSEAS T COMPUTERS, V12, P277
[5]  
[Anonymous], 2001, Learning with Kernels |
[6]  
[Anonymous], 2004, KERNEL METHODS PATTE
[7]  
[Anonymous], 2014, R PACKAGE
[8]  
[Anonymous], P 21 INT C MACH LEAR
[9]  
Ben-Hur A, 2010, METHODS MOL BIOL, V609, P223, DOI 10.1007/978-1-60327-241-4_13
[10]   SMOTE for high-dimensional class-imbalanced data [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2013, 14