A mathematical programming approach to SVM-based classification with label noise

被引:35
作者
Blanco, Victor [1 ,2 ]
Japon, Alberto [3 ,4 ]
Puerto, Justo [3 ,4 ]
机构
[1] Univ Granada, Inst Math IMAG, Granada, Spain
[2] Univ Granada, Dept Quantitat Methods Econ & Business, Granada, Spain
[3] Univ Seville, Inst Math IMUS, Seville, Spain
[4] Univ Seville, Dept Stats & OR, Seville, Spain
关键词
Supervised classification; SVM; Mixed integer non linear programming; Label noise; SUPPORT VECTOR MACHINES; FEATURE-SELECTION; PREDICTION;
D O I
10.1016/j.cie.2022.108611
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we propose novel methodologies to optimally construct Support Vector Machine-based classifiers that take into account that label noise occur in the training sample. We propose different alternatives based on solving Mixed Integer Linear and Non Linear models by incorporating decisions on relabeling some of the observations in the training dataset. The first method incorporates relabeling directly in the SVM model while a second family of methods combines clustering with classification at the same time, giving rise to a model that applies simultaneously similarity measures and SVM. Extensive computational experiments are reported based on a battery of standard datasets taken from UCI Machine Learning repository, showing the effectiveness of the proposed approaches.
引用
收藏
页数:9
相关论文
共 52 条
[1]  
[Anonymous], 2017, CONSUMER SENTINEL NE
[2]   On-line handwriting recognition with support vector machines - A kernel approach [J].
Bahlmann, C ;
Haasdonk, B ;
Burkhardt, H .
EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, :49-54
[3]   Tightening big Ms in integer programming formulations for support vector machines with ramp loss [J].
Baldomero-Naranjo, Marta ;
Martinez-Merino, Luisa, I ;
Rodriguez-Chia, Antonio M. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2020, 286 (01) :84-100
[4]  
Bertsimas D., 2019, INFORMS Journal on Optimization, V1, P2, DOI DOI 10.1287/IJOO.2018.0001
[5]  
Bi J., 2005, Advances in neural information processing systems, P161
[6]  
Biggio B., 2011, ASIAN C MACHINE LEAR, VVolume 20, P97
[7]  
Blanco V, 2020, J MACH LEARN RES, V21
[8]   Robust optimal classification trees under noisy labels [J].
Blanco, Victor ;
Japon, Alberto ;
Puerto, Justo .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) :155-179
[9]   On the multisource hyperplanes location problem to fitting set of points [J].
Blanco, Victor ;
Japon, Alberto ;
Ponce, Diego ;
Puerto, Justo .
COMPUTERS & OPERATIONS RESEARCH, 2021, 128
[10]   Optimal arrangements of hyperplanes for SVM-based multiclass classification [J].
Blanco, Victor ;
Japon, Alberto ;
Puerto, Justo .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2020, 14 (01) :175-199