An oversampling method for multi-class imbalanced data based on composite weights

被引:9
|
作者
Deng, Mingyang [1 ,2 ]
Guo, Yingshi [1 ]
Wang, Chang [1 ]
Wu, Fuwei [1 ]
机构
[1] Changan Univ, Sch Automobile, Xian, Peoples R China
[2] Changchun Univ Technol, Coll Automobile Engn, Coll Humanities & Informat, Changchun, Peoples R China
来源
PLOS ONE | 2021年 / 16卷 / 11期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
ALGORITHM; CLASSIFICATION; SMOTE;
D O I
10.1371/journal.pone.0259227
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To solve the oversampling problem of multi-class small samples and to improve their classification accuracy, we develop an oversampling method based on classification ranking and weight setting. The designed oversampling algorithm sorts the data within each class of dataset according to the distance from original data to the hyperplane. Furthermore, iterative sampling is performed within the class and inter-class sampling is adopted at the boundaries of adjacent classes according to the sampling weight composed of data density and data sorting. Finally, information assignment is performed on all newly generated sampling data. The training and testing experiments of the algorithm are conducted by using the UCI imbalanced datasets, and the established composite metrics are used to evaluate the performance of the proposed algorithm and other algorithms in comprehensive evaluation method. The results show that the proposed algorithm makes the multi-class imbalanced data balanced in terms of quantity, and the newly generated data maintain the distribution characteristics and information properties of the original samples. Moreover, compared with other algorithms such as SMOTE and SVMOM, the proposed algorithm has reached a higher classification accuracy of about 90%. It is concluded that this algorithm has high practicability and general characteristics for imbalanced multi-class samples.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] A quantum-based oversampling method for classification of highly imbalanced and overlapped data
    Yang, Bei
    Tian, Guilan
    Luttrell, Joseph
    Gong, Ping
    Zhang, Chaoyang
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (24) : 2500 - 2513
  • [32] Clustering-Based Oversampling Algorithm for Multi-class Imbalance Learning
    Zhao, Haixia
    Wu, Jian
    JOURNAL OF CLASSIFICATION, 2025, 42 (01) : 205 - 220
  • [33] A Hybrid Sampling Approach for Imbalanced Binary and Multi-Class Data Using Clustering Analysis
    Palli, Abdul Sattar
    Jaafar, Jafreezal
    Hashmani, Manzoor Ahmed
    Gomes, Heitor Murilo
    Gilal, Abdul Rehman
    IEEE ACCESS, 2022, 10 : 118639 - 118653
  • [34] A hybrid multi-class imbalanced learning method for predicting the quality level of diesel engines
    Qin, Wei
    Zhuang, Zilong
    Guo, Liangxun
    Sun, Yanning
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 846 - 856
  • [35] Improved multi-class classification approach for imbalanced big data on spark
    Singh, Tinku
    Khanna, Riya
    Satakshi
    Kumar, Manish
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06) : 6583 - 6611
  • [36] Oversampling With Reliably Expanding Minority Class Regions for Imbalanced Data Learning
    Zhu, Tuanfei
    Liu, Xinwang
    Zhu, En
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 6167 - 6181
  • [37] A new imbalanced data oversampling method based on Bootstrap method and Wasserstein Generative Adversarial Network
    Hou, Binjie
    Chen, Gang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4309 - 4327
  • [38] A Novel and Effective Multi-Class Classification Method for Imbalanced Medical Transcriptions
    Bhardwaj, Priti
    Baliyan, Niyati
    IETE JOURNAL OF RESEARCH, 2024, 8 (6734-6744) : 6734 - 6744
  • [39] Oversampling the minority class in a multi-linear feature space for imbalanced data classification
    Liang, Peifeng
    Li, Weite
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2018, 13 (10) : 1483 - 1491
  • [40] Radial-Based Approach to Imbalanced Data Oversampling
    Koziarski, Michal
    Krawczyk, Bartosz
    Wozniak, Michal
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 318 - 327