Concave-Convex Programming for Ramp Loss-Based Maximum Margin and Minimum Volume Twin Spheres Machine

被引:1
作者
Wang, Qian [1 ]
Xu, Yitian [1 ]
机构
[1] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Maximum margin; Minimum volume; Hyper-sphere; Ramp loss; Concave-convex programming; SUPPORT VECTOR MACHINES; CLASSIFIER; SELECTION;
D O I
10.1007/s11063-018-9903-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twin hyper-sphere support vector machine (THSVM) classifies two classes of samples via two hyper-spheres instead of a pair of nonparallel hyper-planes in the conventional twin support vector machine. It avoids the matrix inverse operation. However, the THSVM uses hinge loss which easily leads to sensitivity of the noises. In this paper, we propose a ramp loss-based maximum margin and minimum volume twin spheres machine ((RMTSM)-T-3) to enhance the ability of noise resistance. (RMTSM)-T-3 is robust because of introducing the ramp loss function, but it is non-convex. (RMTSM)-T-3 has several admirable advantages. For example, it can explicitly incorporate noises in the training process. In addition, the ramp loss function can be decomposed into the difference of a convex hinge loss and another convex loss. Then the concave-convex programming procedure is employed to efficiently solve the non-convex problems of (RMTSM)-T-3 by solving a sequence of convex programs iteratively. Experimental results on one artificial data set, 14 imbalanced binary datasets and 3 multiclass datasets indicate that our proposed (RMTSM)-T-3 yields a good generalization performance.
引用
收藏
页码:1093 / 1114
页数:22
相关论文
共 36 条
[1]  
[Anonymous], 2004, KERNEL METHODS PATTE
[2]  
Chang W.C., 2013, TECHNICAL REPORT
[3]  
Collobert R., 2006, P 23 INT C MACH LEAR, P201, DOI DOI 10.1145/1143844.1143870
[4]  
Cristianini N., 2000, INTRO SUPPORT VECTOR, DOI DOI 10.1017/CBO9780511801389
[5]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[6]   Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power [J].
Garcia, Salvador ;
Fernandez, Alberto ;
Luengo, Julian ;
Herrera, Francisco .
INFORMATION SCIENCES, 2010, 180 (10) :2044-2064
[7]   A simple generalisation of the area under the ROC curve for multiple class classification problems [J].
Hand, DJ ;
Till, RJ .
MACHINE LEARNING, 2001, 45 (02) :171-186
[8]  
Huang X, 2014, J MACH LEARN RES, V15, P2185
[9]   Twin support vector machines for pattern classification [J].
Jayadeva ;
Khemchandani, R. ;
Chandra, Suresh .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (05) :905-910
[10]  
Jayadeva, 2017, SPRINGER SERIES STUD