Weighted support vector machine algorithm for efficient classification and prediction of binary response data

被引:5
作者
Banjoko, A. W. [1 ]
Yahya, W. B. [1 ]
Garba, M. K. [1 ]
Abdulazeez, K. O. [1 ]
机构
[1] Univ Ilorin, Dept Stat, PMB 1515, Ilorin, Kwara State, Nigeria
来源
2ND INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS | 2019年 / 1366卷
关键词
HYBRID; GENES;
D O I
10.1088/1742-6596/1366/1/012101
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper proposes a weighted Support Vector Machine (w-SVM) method for efficient class prediction in binary response data sets. The proposed method was obtained by introducing weights which utilizes the point biserial correlation between each of the predictors and the dichotomized response variable into the standard SVM algorithm to maximize the classification accuracy. The optimal value of the proposed w-SVM cost and each of the kernels parameters were determined by grid search in a 10-fold cross validation resampling method. Monte-Carlo Cross Validation method was employed to examine the predictive power of the proposed method by partitioning the data into train and test samples using different sampling splitting ratios. Application of the proposed method on the simulated data sets yielded high prediction accuracy on the test sample. Results from other performance indices further gave credence to the efficiency of the proposed method. The performance of the proposed method was compared with three of the state-of-the art machine learning methods including the standard SVM and the result showed the superiority of this method over others. Finally, the results generally show that the modified algorithm with Radial Basis Function (RBF) Kernel perform excellently and achieved the best predictive performance than any of the existing classifiers considered.
引用
收藏
页数:7
相关论文
共 17 条
[1]   Feature weighting and SVM parameters optimization based on genetic algorithms for classification problems [J].
Anh Viet Phan ;
Minh Le Nguyen ;
Lam Thu Bui .
APPLIED INTELLIGENCE, 2017, 46 (02) :455-469
[2]  
[Anonymous], 2002, LS SVMLAB MATLAB C T
[3]  
Banjoko A, 2015, ANN COMPUT SCI SER, V13, P69
[4]  
Banjoko A W, 2017, P INT C NIG STAT SOC, V1, P104
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[7]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[8]   An EnKF-based scheme to optimize hyper-parameters and features for SVM classifier [J].
Ji, Yingsheng ;
Chen, Yushu ;
Fu, Haohuan ;
Yang, Guangwen .
PATTERN RECOGNITION, 2017, 62 :202-213
[9]   AIMQ: a methodology for information quality assessment [J].
Lee, YW ;
Strong, DM ;
Kahn, BK ;
Wang, RY .
INFORMATION & MANAGEMENT, 2002, 40 (02) :133-146
[10]   Selecting differentially expressed genes using minimum probability of classification error [J].
Mahata, Pritha ;
Mahata, Kaushik .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (06) :775-786