Multi-Class Imbalance Classification Based on Data Distribution and Adaptive Weights

被引:7
|
作者
Li, Shuxian [1 ,2 ,3 ]
Song, Liyan [1 ,2 ,4 ]
Wu, Xiaoyu [5 ]
Hu, Zheng [5 ]
Cheung, Yiu-ming [3 ]
Yao, Xin [6 ,7 ]
机构
[1] Southern Univ Sci & Technol, Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Guangdong, Peoples R China
[2] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Guangdong Prov Key Lab Brain Inspired Intelligent, Shenzhen 518055, Guangdong, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[4] Harbin Inst Technol, Fac Comp, Harbin 150001, Heilongjiang, Peoples R China
[5] Huawei Technol Co Ltd, RAMS Reliabil Technol Lab, Shenzhen 518129, Guangdong, Peoples R China
[6] Lingnan Univ, Dept Comp & Decis Sci, Hong Kong, Peoples R China
[7] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
基金
中国国家自然科学基金;
关键词
Training; Ensemble learning; Costs; Computer science; Linear programming; Learning systems; Classification algorithms; Multi-class imbalance classification; ensembles; AdaBoost; adaptive weight; data density; OVER-SAMPLING TECHNIQUE; SMOTE;
D O I
10.1109/TKDE.2024.3384961
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
AdaBoost approaches have been used for multi-class imbalance classification with an imbalance ratio measured on class sizes. However, such ratio would assign each training sample of the same class with the same weight, thus failing to reflect the data distribution within a class. We propose to incorporate the density information of training samples into the class imbalance ratio so that samples of the same class could have different weights. As one could use the entire training set to calculate the imbalance and density factors, the weight of a training sample resulting from the two factors remains static throughout the training epochs. However, static weights could not reflect the up-to-date training status of base learners. To deal with this, we propose to design an adaptive weighting mechanism by making use of up-to-date training status to further alleviate the multi-class imbalance issue. Ultimately, we incorporate the class imbalance ratio, the density-based factor, and the adaptive weighting mechanism into a single variable, based on which the adaptive weights of all training samples are computed. Experimental studies are carried out to investigate the effectiveness of the proposed approach and each of the three components in dealing with multi-class imbalance classification problem.
引用
收藏
页码:5265 / 5279
页数:15
相关论文
共 50 条
  • [1] ARConvL: Adaptive Region-Based Convolutional Learning for Multi-class Imbalance Classification
    Li, Shuxian
    Song, Liyan
    Wu, Xiaoyu
    Hu, Zheng
    Cheung, Yiu-ming
    Yao, Xin
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 103 - 120
  • [2] Improved Ensemble Methods to Solve Multi-class Imbalance Problem Using Adaptive Weights
    Kokilam, K. Vasantha
    Latha, D. Ponmary Pushpa
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 333 - 343
  • [3] CLASSIFICATION OF LIDAR DATA BASED ON MULTI-CLASS SVM
    Samadzadegan, F.
    Bigdeli, B.
    Ramzi, P.
    2010 CANADIAN GEOMATICS CONFERENCE AND SYMPOSIUM OF COMMISSION I, ISPRS CONVERGENCE IN GEOMATICS - SHAPING CANADA'S COMPETITIVE LANDSCAPE, 2010, 38
  • [4] Kernel Based Data-Adaptive Support Vector Machines for Multi-Class Classification
    Shao, Jianli
    Liu, Xin
    He, Wenqing
    MATHEMATICS, 2021, 9 (09)
  • [5] An oversampling method for multi-class imbalanced data based on composite weights
    Deng, Mingyang
    Guo, Yingshi
    Wang, Chang
    Wu, Fuwei
    PLOS ONE, 2021, 16 (11):
  • [6] An adaptive Bagging algorithm based on lightweight transformer for multi-class imbalance recognition
    Wang, Junyi
    Jiang, Xuezheng
    Liu, Hailian
    Cai, Haibin
    Meng, Qinggang
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [7] An adaptive Bagging algorithm based on lightweight transformer for multi-class imbalance recognition
    Junyi Wang
    Xuezheng Jiang
    Hailian Liu
    Haibin Cai
    Qinggang Meng
    Multimedia Systems, 2024, 30
  • [8] Boosting with Adaptive Sampling for Multi-class Classification
    Chen, Jianhua
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 667 - 672
  • [9] Imbalance accuracy metric for model selection in multi-class imbalance classification problems
    Mortaz, Ebrahim
    KNOWLEDGE-BASED SYSTEMS, 2020, 210
  • [10] A comparison of improving multi-class imbalance for internet traffic classification
    Qiong Liu
    Zhen Liu
    Information Systems Frontiers, 2014, 16 : 509 - 521