Neural Network-Based Undersampling Techniques

被引:32
|
作者
Arefeen, Md Adnan [1 ,2 ]
Nimi, Sumaiya Tabassum [1 ,2 ]
Rahman, M. Sohel [3 ]
机构
[1] Univ Missouri, Dept Comp Sci Elect Engn, Kansas City, MO 64110 USA
[2] United Int Univ, Dept Comp Sci & Engn, Dhaka 1209, Bangladesh
[3] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1205, Bangladesh
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 02期
关键词
Task analysis; Noise measurement; Neurons; Machine learning algorithms; Computer science; Genetic algorithms; Autoencoder; class imbalance; classification; neural network; undersampling; CLASSIFICATION; IMBALANCE; FRAUD; SMOTE;
D O I
10.1109/TSMC.2020.3016283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models have gained popularity nowadays for their potential to solve real-life issues when trained on pertinent data. In many cases, the real-life data are class imbalanced and hence the corresponding machine learning models trained on the data tend to perform poorly on metrics like precision, recall, AUC, F1, and G-mean score. Since class imbalance issue poses serious challenges to the performance of trained models, a multitude of research works have addressed this issue. Two common data-based sampling techniques have mostly been proposed-undersampling the data of the majority class and oversampling the data of the minority class. In this article, we focus on the former approach. We propose two novel algorithms that employ neural network-based approaches to remove majority samples that are found to reside in the vicinity of the minority samples, thereby undersampling the former to remove (or alleviate) the imbalance issue. We delineate the proposed algorithms and then test the proposed algorithms on some publicly available imbalanced datasets. We then compare the performance of our proposed algorithms to other popular undersampling algorithms. Finally, we conclude that our proposed algorithms outperform most of the existing undersampling approaches on most performance metrics.
引用
收藏
页码:1111 / 1120
页数:10
相关论文
共 50 条
  • [41] Neural network-based geometry classification for navigation satellite selection
    Jwo, DJ
    Lai, CC
    JOURNAL OF NAVIGATION, 2003, 56 (02) : 291 - 304
  • [42] A Review of Recurrent Neural Network-Based Methods in Computational Physiology
    Mao, Shitong
    Sejdic, Ervin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6983 - 7003
  • [43] Design and Optimization of a Neural Network-based Driver Recognition System by means of a Multiobjective Genetic Algorithm
    Echanobe, Javier
    del Campo, Ines
    Victoria Martinez, M.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3745 - 3750
  • [44] An efficient neural network-based method for patient-specific information involved arrhythmia detection
    Liu, Yunqing
    Qin, Chengjin
    Liu, Jinlei
    Jin, Yanrui
    Li, Zhiyuan
    Liu, Chengliang
    KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [45] NNWarp: Neural Network-Based Nonlinear Deformation
    Luo, Ran
    Shao, Tianjia
    Wang, Huamin
    Xu, Weiwei
    Chen, Xiang
    Zhou, Kun
    Yang, Yin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (04) : 1745 - 1759
  • [46] A neural network-based ionospheric model for Arecibo
    Friedrich, M.
    Fankhauser, M.
    Oyeyemi, E.
    McKinnell, L. A.
    ADVANCES IN SPACE RESEARCH, 2008, 42 (04) : 776 - 781
  • [47] Neural Network-Based Adaptive Polar Coding
    Miloslavskaya, Vera
    Li, Yonghui
    Vucetic, Branka
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (04) : 1881 - 1894
  • [48] Spatial Distribution-Based Imbalanced Undersampling
    Yan, Yuanting
    Zhu, Yuanwei
    Liu, Ruiqing
    Zhang, Yiwen
    Zhang, Yanping
    Zhang, Ling
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 6376 - 6391
  • [49] Neural Network-Based Diagnostics for PV Plant
    Cristaldi, Loredana
    Leone, Giacomo
    Vergura, Silvano
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING (EEEIC), 2016,
  • [50] Recurrent Neural Network-Based Autoencoder for Problems of Automatic Time Series Analysis at Power Facilities
    Matrenin, P., V
    Khalyasmaa, A., I
    Potachits, Y. V.
    PROBLEMELE ENERGETICII REGIONALE, 2023, (02): : 61 - 71