Relative Density-Based Intuitionistic Fuzzy SVM for Class Imbalance Learning

被引:6
作者
Fu, Cui [1 ]
Zhou, Shuisheng [1 ]
Zhang, Dan [1 ]
Chen, Li [2 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
[2] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
fuzzy support vector machine (FSVM); class imbalance learning; intuitionistic fuzzy number (IFN); relative density; SUPPORT VECTOR MACHINE; K-NEAREST-NEIGHBOR;
D O I
10.3390/e25010034
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The support vector machine (SVM) has been combined with the intuitionistic fuzzy set to suppress the negative impact of noises and outliers in classification. However, it has some inherent defects, resulting in the inaccurate prior distribution estimation for datasets, especially the imbalanced datasets with non-normally distributed data, further reducing the performance of the classification model for imbalance learning. To solve these problems, we propose a novel relative density-based intuitionistic fuzzy support vector machine (RIFSVM) algorithm for imbalanced learning in the presence of noise and outliers. In our proposed algorithm, the relative density, which is estimated by adopting the k-nearest-neighbor distances, is used to calculate the intuitionistic fuzzy numbers. The fuzzy values of the majority class instances are designed by multiplying the score function of the intuitionistic fuzzy number by the imbalance ratio, and the fuzzy values of minority class instances are assigned the intuitionistic fuzzy membership degree. With the help of the strong capture ability of the relative density to prior information and the strong recognition ability of the intuitionistic fuzzy score function to noises and outliers, the proposed RIFSVM not only reduces the influence of class imbalance but also suppresses the impact of noises and outliers, and further improves the classification performance. Experiments on the synthetic and public imbalanced datasets show that our approach has better performance in terms of G-Means, F-Measures, and AUC than the other class imbalance classification algorithms.
引用
收藏
页数:21
相关论文
共 38 条
[1]   INTUITIONISTIC FUZZY-SETS [J].
ATANASSOV, KT .
FUZZY SETS AND SYSTEMS, 1986, 20 (01) :87-96
[2]   FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning [J].
Batuwita, Rukshan ;
Palade, Vasile .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (03) :558-571
[3]  
Biau Gerard, 2015, Lectures on the Nearest Neighbor Method, P25, DOI 10.1007/ 978-3-319- 25388-6_3
[4]   Affinity and transformed class probability-based fuzzy least squares support vector machines [J].
Borah, Parashjyoti ;
Gupta, Deepak .
FUZZY SETS AND SYSTEMS, 2022, 443 :203-235
[5]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[6]   A data mining based system for credit-card fraud detection in e-tail [J].
Carneiro, Nuno ;
Figueira, Goncalo ;
Costa, Miguel .
DECISION SUPPORT SYSTEMS, 2017, 95 :91-101
[7]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]   Entropy-based fuzzy support vector machine for imbalanced datasets [J].
Fan, Qi ;
Wang, Zhe ;
Li, Dongdong ;
Gao, Daqi ;
Zha, Hongyuan .
KNOWLEDGE-BASED SYSTEMS, 2017, 115 :87-99
[10]   Risk-Averse support vector classifier machine via moments penalization [J].
Fu, Cui ;
Zhou, Shuisheng ;
Zhang, Junna ;
Han, Banghe ;
Chen, Yuxue ;
Ye, Feng .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) :3341-3358