MVQS: Robust multi-view instance-level cost-sensitive learning method for imbalanced data classification

被引:6
作者
Hou, Zhaojie [1 ,2 ]
Tang, Jingjing [1 ,2 ]
Li, Yan [1 ,2 ]
Fu, Saiji [3 ]
Tian, Yingjie [4 ,5 ,6 ,7 ]
机构
[1] Southwestern Univ Finance & Econ, Fac Business Adm, Sch Business Adm, Chengdu 611130, Peoples R China
[2] Southwestern Univ Finance & Econ, Inst Big Data, Chengdu 611130, Peoples R China
[3] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing 100876, Peoples R China
[4] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[5] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[6] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[7] UCAS, MOE Social Sci Lab Digital Econ Forecasts & Policy, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view learning; Imbalanced classes; Noisy samples; Support vector machine; QTSE loss function; SUPPORT VECTOR MACHINE; KERNEL-METHOD; CONSENSUS; BLINEX; NOISE;
D O I
10.1016/j.ins.2024.120467
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-view imbalanced learning is to handle the datasets with multi-view representations and imbalanced classes. Existing multi-view imbalanced learning methods can be divided into two main categories: multi-view ensemble learning and multi-view cost-sensitive learning. However, these methods suffer from the following problems: 1) neglecting either consensus or complementary information, 2) complex preprocessing and information fusion in multiview ensemble learning and manual assignment of misclassification costs in multi-view costsensitive learning, and 3) limited ability to handle noisy samples. Therefore, we aim to design a concise and unified framework to grapple with the multi-view representations, imbalanced classes and noisy samples simultaneously. Inspired by the merits of support vector machine (SVM) and quadratic type squared error (QTSE) loss function, we propose a robust multi-view instance-level cost-sensitive SVM with QTSE loss (MVQS) for imbalanced data classification. The consensus regularization term and combination weight strategy are employed to fully exploit multi-view information. The QTSE loss can adaptively impose instance-level penalties to the misclassification of samples, and make MVQS be robust to noisy samples. We solve MVQS with the alternating direction method of multipliers (ADMM) and the gradient descent (GD) algorithm. Comprehensive experiments validate that MVQS is more competitive and robust than other benchmark approaches.
引用
收藏
页数:25
相关论文
共 50 条
[21]   Incremental Cost-Sensitive Support Vector Machine With Linear-Exponential Loss [J].
Ma, Yue ;
Zhao, Kun ;
Wang, Qi ;
Tian, Yingjie .
IEEE ACCESS, 2020, 8 :149899-149914
[22]   Adaptive Margin Aware Complement-Cross Entropy Loss for Improving Class Imbalance in Multi-View Sleep Staging Based on EEG Signals [J].
Miao, Fahui ;
Yao, Li ;
Zhao, Xiaojie .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2022, 30 :2927-2938
[23]   INFFC: An iterative class noise filter based on the fusion of classifiers with noise sensitivity control [J].
Saez, Jose A. ;
Galar, Mike ;
Luengo, Julian ;
Herrera, Francisco .
INFORMATION FUSION, 2016, 27 :19-32
[24]   A unifying view of class overlap and imbalance: Key concepts, multi-view panorama, and open avenues for research [J].
Santos, Miriam Seoane ;
Abreu, Pedro Henriques ;
Japkowicz, Nathalie ;
Fernandez, Alberto ;
Santos, Joao .
INFORMATION FUSION, 2023, 89 :228-253
[25]   An empirical study of the classification performance of learners on imbalanted and noisy software quality data [J].
Seiffert, Chris ;
Khoshgoftaar, Taghi M. ;
Van Hulse, Jason ;
Folleco, Andres .
INFORMATION SCIENCES, 2014, 259 :571-595
[26]   Support vector machine classifier with truncated pinball loss [J].
Shen, Xin ;
Niu, Lingfeng ;
Qi, Zhiquan ;
Tian, Yingjie .
PATTERN RECOGNITION, 2017, 68 :199-210
[27]   A stable variant of linex loss SVM for handling noise with reduced hyperparameters [J].
Shrivastava, Saurabh ;
Shukla, Sanyam ;
Khare, Nilay .
INFORMATION SCIENCES, 2023, 646
[28]   Multi-view ensemble learning based on distance-to-model and adaptive clustering for imbalanced credit risk assessment in P2P lending [J].
Song, Yu ;
Wang, Yuyan ;
Ye, Xin ;
Wang, Dujuan ;
Yin, Yunqiang ;
Wang, Yanzhang .
INFORMATION SCIENCES, 2020, 525 (525) :182-204
[29]   A survey of multi-view machine learning [J].
Sun, Shiliang .
NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) :2031-2038
[30]   Multi-view representation learning with Kolmogorov-Smirnov to predict default based on imbalanced and complex dataset [J].
Tan, Yandan ;
Zhao, Guangcai .
INFORMATION SCIENCES, 2022, 596 :380-394