A survey of multi-class imbalanced data classification methods

被引:4
|
作者
Han, Meng [1 ]
Li, Ang [1 ]
Gao, Zhihui [1 ]
Mu, Dongliang [1 ]
Liu, Shujuan [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
关键词
Classification; multi-class imbalance data; data preprocessing method; algorithm-level classification method; EXTREME LEARNING-MACHINE; SELECTION; ALGORITHM; CNN;
D O I
10.3233/JIFS-221902
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reality, the data generated in many fields are often imbalanced, such as fraud detection, network intrusion detection and disease diagnosis. The class with fewer instances in the data is called the minority class, and the minority class in some applications contains the significant information. So far, many classification methods and strategies for binary imbalanced data have been proposed, but there are still many problems and challenges in multi-class imbalanced data that need to be solved urgently. The classification methods for multi-class imbalanced data are analyzed and summarized in terms of data preprocessing methods and algorithm-level classification methods, and the performance of the algorithms using the same dataset is compared separately. In the data preprocessing methods, the methods of oversampling, under-sampling, hybrid sampling and feature selection are mainly introduced. Algorithm-level classification methods are comprehensively introduced in four aspects: ensemble learning, neural network, support vector machine and multi-class decomposition technique. At the same time, all data preprocessing methods and algorithm-level classification methods are analyzed in detail in terms of the techniques used, comparison algorithms, pros and cons, respectively. Moreover, the evaluation metrics commonly used for multi-class imbalanced data classification methods are described comprehensively. Finally, the future directions of multi-class imbalanced data classification are given.
引用
收藏
页码:2471 / 2501
页数:31
相关论文
共 50 条
  • [41] Two Ways of Extending BRACID Rule-based Classifiers for Multi-class Imbalanced Data
    Naklicka, Maria
    Stefanowski, Jerzy
    THIRD INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 154, 2021, 154 : 90 - 103
  • [42] Data Sampling Methods to Deal With the Big Data Multi-Class Imbalance Problem
    Rendon, Erendira
    Alejo, Roberto
    Castorena, Carlos
    Isidro-Ortega, Frank J.
    Granda-Gutierrez, Everardo E.
    APPLIED SCIENCES-BASEL, 2020, 10 (04):
  • [43] Review of imbalanced data classification methods
    Li Y.-X.
    Chai Y.
    Hu Y.-Q.
    Yin H.-P.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (04): : 673 - 688
  • [44] Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
    Tsai, Chih-Fong
    Chen, Kuan-Chen
    Lin, Wei -Chao
    APPLIED SOFT COMPUTING, 2024, 153
  • [45] Combining One-vs-One Decomposition and Ensemble Learning for Multi-class Imbalanced Data
    Krawczyk, Bartosz
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 27 - 36
  • [46] DynaQ: online learning from imbalanced multi-class streams through dynamic sampling
    Sadeghi, Farnaz
    Viktor, Herna L.
    Vafaie, Parsa
    APPLIED INTELLIGENCE, 2023, 53 (21) : 24908 - 24930
  • [47] An FPA-Optimized XGBoost Stacking for Multi-Class Imbalanced Network Attack Detection
    Soon, Hui Fern
    Amir, Amiza
    Nishizaki, Hiromitsu
    Zahri, Nik Adilah Hanin
    Kamarudin, Latifah Munirah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 1380 - 1390
  • [48] Lazy Learning for Multi-class Classification Using Genetic Programming
    Jabeen, Hajira
    Baig, Abdul Rauf
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 177 - +
  • [49] Unsupervised Multi-class Sentiment Classification Approach
    Xu, Liwei
    Qiu, Jiangnan
    KNOWLEDGE ORGANIZATION, 2019, 46 (01): : 15 - 32
  • [50] POSTERIOR CALIBRATION FOR MULTI-CLASS PARALINGUISTIC CLASSIFICATION
    Gosztolya, Gabor
    Busa-Fekete, Robert
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 119 - 125