Fuzzy support vector machine with graph for classifying imbalanced datasets

被引:11
作者
Chen, Baihua [1 ]
Fan, Yuling [1 ]
Lan, Weiyao [1 ]
Liu, Jinghua [2 ]
Cao, Chao [3 ,4 ]
Gao, Yunlong [1 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361102, Peoples R China
[2] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen 361021, Peoples R China
[3] Minist Nat Resources, Inst Oceanog 3, Xiamen 361005, Peoples R China
[4] Fujian Prov Key Lab Marine Ecol Conservat & Restor, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Fuzzy support vector machines; Class imbalance; The curse of dimensionality; Kernel space; Graph; CLASSIFICATION; RECOGNITION; ROBUST; MODELS;
D O I
10.1016/j.neucom.2022.09.139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since support vector machine (SVM) considers all the training samples equally, it suffers from the problems of noise/outliers and class imbalance. Although many fuzzy support vector machines (FSVMs) have been proposed to suppress the effect of noise/outliers and class imbalance, most of them ignore the impact of the curse of dimensionality on the discriminative performance of fuzzy membership function and do not give the fuzzy membership function corresponding to the kernel space, which seriously reduces the performance of FSVM. To solve these problems, we propose the fuzzy support vector machine with graph (GraphFSVM) in this paper. Specifically, we first design a graph-based fuzzy membership function to accurately assess the importance of samples in original feature space and prove that the function can mine discriminative information between samples in high-dimensional data. Additionally, since the data distribution in kernel space is different from those in the original feature space, a method is provided to calculate the fuzzy membership function in the kernel space. Finally, the GraphFSVM model analyzes samples of each class independently, this suppresses the effect of class imbalance. Following the above principles, we design the graph-based fuzzy support vector machine and propose a detailed optimization method. Experimental results on UCI, gene expression, and image datasets show that the GraphFSVM has better generalization and robustness than other state-of-the-art methods.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 312
页数:17
相关论文
共 56 条
[1]   Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises [J].
An, Wenjuan ;
Liang, Mangui .
NEUROCOMPUTING, 2013, 110 :101-110
[2]  
[Anonymous], 1999, Proceedings of the international joint conference on AI
[3]  
[Anonymous], 1999, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-2440-0
[4]  
Batista G. E., 2004, ACM SIGKDD Explorations Newsletter, V6, P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
[5]   FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning [J].
Batuwita, Rukshan ;
Palade, Vasile .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (03) :558-571
[6]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[7]  
Bellman R.E., 1961, ADAPTIVE CONTROL PRO, V2456
[8]  
Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
[9]  
Bhavani Raskutti, 2004, SIGKDD Explor. Newsl, V6, P60, DOI 10.1145/1007730.1007739
[10]   Joint Learning of Spectral Clustering Structure and Fuzzy Similarity Matrix of Data [J].
Bian, Zekang ;
Ishibuchi, Hisao ;
Wang, Shitong .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (01) :31-44