A survey of multi-class imbalanced data classification methods

被引:4
|
作者
Han, Meng [1 ]
Li, Ang [1 ]
Gao, Zhihui [1 ]
Mu, Dongliang [1 ]
Liu, Shujuan [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
关键词
Classification; multi-class imbalance data; data preprocessing method; algorithm-level classification method; EXTREME LEARNING-MACHINE; SELECTION; ALGORITHM; CNN;
D O I
10.3233/JIFS-221902
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reality, the data generated in many fields are often imbalanced, such as fraud detection, network intrusion detection and disease diagnosis. The class with fewer instances in the data is called the minority class, and the minority class in some applications contains the significant information. So far, many classification methods and strategies for binary imbalanced data have been proposed, but there are still many problems and challenges in multi-class imbalanced data that need to be solved urgently. The classification methods for multi-class imbalanced data are analyzed and summarized in terms of data preprocessing methods and algorithm-level classification methods, and the performance of the algorithms using the same dataset is compared separately. In the data preprocessing methods, the methods of oversampling, under-sampling, hybrid sampling and feature selection are mainly introduced. Algorithm-level classification methods are comprehensively introduced in four aspects: ensemble learning, neural network, support vector machine and multi-class decomposition technique. At the same time, all data preprocessing methods and algorithm-level classification methods are analyzed in detail in terms of the techniques used, comparison algorithms, pros and cons, respectively. Moreover, the evaluation metrics commonly used for multi-class imbalanced data classification methods are described comprehensively. Finally, the future directions of multi-class imbalanced data classification are given.
引用
收藏
页码:2471 / 2501
页数:31
相关论文
共 50 条
  • [31] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Lango, Mateusz
    Stefanowski, Jerzy
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (01) : 97 - 127
  • [32] On the class overlap problem in imbalanced data classification
    Vuttipittayamongkol, Pattaramon
    Elyan, Eyad
    Petrovski, Andrei
    KNOWLEDGE-BASED SYSTEMS, 2021, 212 (212)
  • [33] Progressive Learning Strategies for Multi-class Classification
    Er, Meng Joo
    Venkatesan, Rajasekar
    Wang, Ning
    Chien, Chiang-Ju
    2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
  • [34] Evolutionary optimization of the area under precision-recall curve for classifying imbalanced multi-class data
    Chabbouh, Marwa
    Bechikh, Slim
    Mezura-Montes, Efren
    Ben Said, Lamjed
    JOURNAL OF HEURISTICS, 2025, 31 (01)
  • [35] Dynamic affinity-based classification of multi-class imbalanced data with one-versus-one decomposition: a fuzzy rough set approach
    Vluymans, Sarah
    Fernandez, Alberto
    Saeys, Yvan
    Cornelis, Chris
    Herrera, Francisco
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (01) : 55 - 84
  • [36] Multi-class Classification of EEG Spectral Data for Artifact Detection
    Tokovarov, Mikhail
    Plechawska-Wojcik, Malgorzata
    Kaczorowska, Monika
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II, 2019, 11509 : 305 - 316
  • [37] A divide and conquer approach for imbalanced multi-class classification and its application to medical decision making
    Li, Hu
    PAKISTAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2016, 29 (02) : 743 - 751
  • [38] A Hybrid Sampling Approach for Imbalanced Binary and Multi-Class Data Using Clustering Analysis
    Palli, Abdul Sattar
    Jaafar, Jafreezal
    Hashmani, Manzoor Ahmed
    Gomes, Heitor Murilo
    Gilal, Abdul Rehman
    IEEE ACCESS, 2022, 10 : 118639 - 118653
  • [39] SCUT-DS: Learning from Multi-class Imbalanced Canadian Weather Data
    Olaitan, Olubukola M.
    Viktor, Herna L.
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 291 - 301
  • [40] A novel progressive learning technique for multi-class classification
    Venkatesan, Rajasekar
    Er, Meng Joo
    NEUROCOMPUTING, 2016, 207 : 310 - 321