Cost-Sensitive Online Adaptive Kernel Learning for Large-Scale Imbalanced Classification

被引:9
作者
Chen, Yingying [1 ]
Hong, Zijie [1 ]
Yang, Xiaowei [1 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Peoples R China
关键词
Classification algorithms; Kernel; Costs; Adaptation models; Machine learning algorithms; Approximation algorithms; Task analysis; Adaptive algorithms; cost function; classification algorithms; kernel; machine learning algorithms; SVM; PREDICTION;
D O I
10.1109/TKDE.2023.3266648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced classification is a challenging task in the fields of machine learning, data mining and pattern recognition. Cost-sensitive online algorithms are very important methods for large-scale imbalanced classification problems. At present, most of the cost-sensitive classification algorithms focus on the accuracy of the minority class and ignore the accuracy of the majority class. In order to better balance the accuracy between the minority class and the majority class, in this article, a misclassification cost is presented to ensure that the cost-sensitive online algorithm can better deal with the imbalanced classification problems without signifificantly reducing the accuracy of the majority class. Based on the proposed misclassification cost, a novel cost-sensitive online adaptive kernel learning algorithm is proposed to boost the adaptability of kernel function when data arrives one by one. According to the essential characteristics of the imbalanced binary classification, a cost-sensitive online adaptive kernel learning algorithm is given to handle the large-scale imbalanced multi-class classification problems. Theoretical analysis of the proposed algorithms are provided. Extensive experiments demonstrate that compared with the state-of-the-art imbalanced classification algorithms, the proposed algorithms can significantly improve the classification performances on most of the large-scale imbalanced data sets.
引用
收藏
页码:10554 / 10568
页数:15
相关论文
共 64 条
[1]   To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques [J].
Abdi, Lida ;
Hashemi, Sattar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) :238-251
[2]  
[Anonymous], 2002, ICML
[3]   FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning [J].
Batuwita, Rukshan ;
Palade, Vasile .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (03) :558-571
[4]   Gradient boosting for high-dimensional prediction of rare events [J].
Blagus, Rok ;
Lusa, Lara .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 113 :19-37
[5]  
Bracewell Ronald Newbold, 1986, The Fourier transform and its applications, V31999
[6]  
Chen Zhong, 2017, SIAM Rev Soc Ind Appl Math, V2017, P759, DOI 10.1137/1.9781611974973.85
[7]  
Crammer K, 2006, J MACH LEARN RES, V7, P551
[8]   Adaptive regularization of weight vectors [J].
Crammer, Koby ;
Kulesza, Alex ;
Dredze, Mark .
MACHINE LEARNING, 2013, 91 (02) :155-187
[9]   DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data [J].
Dablain, Damien ;
Krawczyk, Bartosz ;
Chawla, Nitesh, V .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) :6390-6404
[10]   Boosting with Lexicographic Programming: Addressing Class Imbalance without Cost Tuning [J].
Datta, Shounak ;
Nag, Sayak ;
Das, Swagatam .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (05) :883-897