A New Fuzzy Set and Nonkernel SVM Approach for Mislabeled Binary Classification With Applications

被引:34
作者
Tian, Ye [1 ,2 ]
Sun, Miao [1 ,2 ]
Deng, Zhibin [3 ,4 ]
Luo, Jian [5 ]
Li, Yueqing [6 ]
机构
[1] Southwest Univ Finance & Econ, Sch Business Adm, Chengdu 611130, Sichuan, Peoples R China
[2] Southwest Univ Finance & Econ, Res Ctr Big Data, Chengdu 611130, Sichuan, Peoples R China
[3] Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[5] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[6] Lamar Univ, Dept Ind Engn, Beaumont, TX 77710 USA
基金
中国国家自然科学基金;
关键词
Binary classification; intuitionistic fuzzy set (IFS); mislabeled information; real applications; semisupervised quadratic surface support vector machine (SSQSSVM); SUPPORT VECTOR MACHINE;
D O I
10.1109/TFUZZ.2017.2752138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new approach based on the kernel-free quadratic surface support vector machine model to handle a binary classification problem with mislabeled information. Unlike the traditional fuzzy and robust support vector machine models that reduce the weights of suspectable mislabeled points or even discard them, our new method first adopts the intuitionistic fuzzy set method to detect those suspectable mislabeled points, then deletes their labels, and indiscriminately utilizes their full position information to build a semisupervised model. In this way, we can not only eliminate the negative effect of mislabeled information but also avoid the difficult task of searching proper kernel functions in classical SVM models. Besides, to improve the efficiency and accuracy, a branch-and-bound algorithm is designed to accelerate the solving process. After that, we conduct some numerical tests with both artificial and real-world datasets to verify the superior performance of our proposed method among several benchmark methods. Furthermore, the proposed method is applied to brain-computer interface and credit risk assessment. The promising results strongly demonstrate the effectiveness of our method and show its big potential in some real applications.
引用
收藏
页码:1536 / 1545
页数:10
相关论文
共 33 条
[1]   Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises [J].
An, Wenjuan ;
Liang, Mangui .
NEUROCOMPUTING, 2013, 110 :101-110
[2]  
[Anonymous], 1990, PROC S APPL MATH
[3]  
[Anonymous], 2002, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
[4]  
[Anonymous], 2009, Introduction to semi-supervised learning
[5]  
[Anonymous], Learning from Labeled and Unlabeled Data with Label Propagation
[6]  
Bache K., 2013, UCI Machine Learning Repository
[7]   New SDP models for protein homology detection with semi-supervised SVM [J].
Bai, Y. Q. ;
Niu, B. L. ;
Chen, Y. .
OPTIMIZATION, 2013, 62 (04) :561-572
[8]  
Chapelle O, 2008, J MACH LEARN RES, V9, P203
[9]  
Chen SC, 2017, IOP EXPAND PHYS, P1, DOI 10.1088/978-0-7503-1674-3
[10]   Support vector learning for fuzzy rule-based classification systems [J].
Chen, YX ;
Wang, JZ .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2003, 11 (06) :716-728