Capped L2,p-norm metric based robust least squares twin support vector machine for pattern classification

被引:25
作者
Yuan, Chao [1 ]
Yang, Liming [2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Haidian, Peoples R China
[2] China Agr Univ, Coll Sci, Beijing 100083, Haidian, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification; Robustness; Capped L-2; L-p-norm; Iterative algorithm;
D O I
10.1016/j.neunet.2021.06.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Least squares twin support vector machine (LSTSVM) is an effective and efficient learning algorithm for pattern classification. However, the distance in LSTSVM is measured by squared L-2-norm metric that may magnify the influence of outliers. In this paper, a novel robust least squares twin support vector machine framework is proposed for binary classification, termed as CL2,p-LSTSVM, which utilizes capped L-2,L-p-norm distance metric to reduce the influence of noise and outliers. The goal of CL2,p-LSTSVM is to minimize the capped L-2,L-p-norm intra-class distance dispersion, and eliminate the influence of outliers during training process, where the value of the metric is controlled by the capped parameter, which can ensure better robustness. The proposed metric includes and extends the traditional metrics by setting appropriate values of p and capped parameter. This strategy not only retains the advantages of LSTSVM, but also improves the robustness in solving a binary classification problem with outliers. However, the nonconvexity of metric makes it difficult to optimize. We design an effective iterative algorithm to solve the CL2,p-LSTSVM. In each iteration, two systems of linear equations are solved. Simultaneously, we present some insightful analyses on the computational complexity and convergence of algorithm. Moreover, we extend the CL2,p-LSTSVM to nonlinear classifier and semi-supervised classification. Experiments are conducted on artificial datasets, UCI benchmark datasets, and image datasets to evaluate our method. Under different noise settings and different evaluation criteria, the experiment results show that the CL2,p -LSTSVM has better robustness than state-of-the-art approaches in most cases, which demonstrates the feasibility and effectiveness of the proposed method. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:457 / 478
页数:22
相关论文
共 52 条
[21]   Knowledge based Least Squares Twin support vector machines [J].
Kumar, M. Arun ;
Khemchandani, Reshma ;
Gopal, M. ;
Chandra, Suresh .
INFORMATION SCIENCES, 2010, 180 (23) :4606-4618
[22]   Least squares twin support vector machines for pattern classification [J].
Kumar, M. Arun ;
Gopal, M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7535-7543
[23]   Principal component analysis based on L1-norm maximization [J].
Kwak, Nojun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (09) :1672-1680
[24]   Series DC Arc Fault Detection Based on Ensemble Machine Learning [J].
Le, Vu ;
Yao, Xiu ;
Miller, Chad ;
Tsao, Bang-Hung .
IEEE TRANSACTIONS ON POWER ELECTRONICS, 2020, 35 (08) :7826-7839
[25]   Robust L1-norm non-parallel proximal support vector machine [J].
Li, Chun-Na ;
Shao, Yuan-Hai ;
Deng, Nai-Yang .
OPTIMIZATION, 2016, 65 (01) :169-183
[26]   Robust learning with imperfect privileged information [J].
Li, Xue ;
Du, Bo ;
Xu, Chang ;
Zhang, Yipeng ;
Zhang, Lefei ;
Tao, Dacheng .
ARTIFICIAL INTELLIGENCE, 2020, 282 (282)
[27]   Few-Shot Hyperspectral Image Classification With Unknown Classes Using Multitask Deep Learning [J].
Liu, Shengjie ;
Shi, Qian ;
Zhang, Liangpei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06) :5085-5102
[28]   L2P-Norm Distance Twin Support Vector Machine [J].
Ma, Xu ;
Ye, Qiaolin ;
Yan, He .
IEEE ACCESS, 2017, 5 :23473-23483
[29]   Lagrangian support vector machines [J].
Mangasaian, OL ;
Musicant, DR .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (03) :161-177
[30]   Multisurface proximal support vector machine classification via generalized eigenvalues [J].
Mangasarian, OL ;
Wild, EW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :69-74