Neural Network-Based Undersampling Techniques

被引:32
|
作者
Arefeen, Md Adnan [1 ,2 ]
Nimi, Sumaiya Tabassum [1 ,2 ]
Rahman, M. Sohel [3 ]
机构
[1] Univ Missouri, Dept Comp Sci Elect Engn, Kansas City, MO 64110 USA
[2] United Int Univ, Dept Comp Sci & Engn, Dhaka 1209, Bangladesh
[3] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1205, Bangladesh
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 02期
关键词
Task analysis; Noise measurement; Neurons; Machine learning algorithms; Computer science; Genetic algorithms; Autoencoder; class imbalance; classification; neural network; undersampling; CLASSIFICATION; IMBALANCE; FRAUD; SMOTE;
D O I
10.1109/TSMC.2020.3016283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models have gained popularity nowadays for their potential to solve real-life issues when trained on pertinent data. In many cases, the real-life data are class imbalanced and hence the corresponding machine learning models trained on the data tend to perform poorly on metrics like precision, recall, AUC, F1, and G-mean score. Since class imbalance issue poses serious challenges to the performance of trained models, a multitude of research works have addressed this issue. Two common data-based sampling techniques have mostly been proposed-undersampling the data of the majority class and oversampling the data of the minority class. In this article, we focus on the former approach. We propose two novel algorithms that employ neural network-based approaches to remove majority samples that are found to reside in the vicinity of the minority samples, thereby undersampling the former to remove (or alleviate) the imbalance issue. We delineate the proposed algorithms and then test the proposed algorithms on some publicly available imbalanced datasets. We then compare the performance of our proposed algorithms to other popular undersampling algorithms. Finally, we conclude that our proposed algorithms outperform most of the existing undersampling approaches on most performance metrics.
引用
收藏
页码:1111 / 1120
页数:10
相关论文
共 50 条
  • [21] Neural network-based GPS GDOP approximation and classification
    Jwo, Dah-Jing
    Lai, Chien-Cheng
    GPS SOLUTIONS, 2007, 11 (01) : 51 - 60
  • [22] AN EMPIRICAL EVALUATION OF REPETITIVE UNDERSAMPLING TECHNIQUES
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2010, 20 (02) : 173 - 195
  • [23] Graph neural network-based long method and blob code smell detection
    Zhang, Minnan
    Jia, Jingdong
    Capretz, Luiz Fernando
    Hou, Xin
    Tan, Huobin
    ADVANCES IN COLLOID AND INTERFACE SCIENCE, 2025, 340
  • [24] A Neural Network-Based Method for Respiratory Sound Analysis and Lung Disease Detection
    Brunese, Luca
    Mercaldo, Francesco
    Reginelli, Alfonso
    Santone, Antonella
    APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [25] Hybrid Controller with the Combination of FLC and Neural Network-Based IMC for Nonlinear Processes
    Hosen, Mohammad Anwar
    Salaken, Syed Moshfeq
    Khosravi, Abbas
    Nahavandi, Saeid
    Creighton, Douglas
    NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 206 - 213
  • [26] Anomaly detection-based undersampling for imbalanced classification problems
    Park, You-Jin
    Brito, Paula
    Ma, Yun-Chen
    ENGINEERING OPTIMIZATION, 2024, 56 (12) : 2565 - 2578
  • [27] Human Identification Using Neural Network-Based Classification of Periodic Behaviors in Virtual Reality
    Pham, Duc-Minh
    25TH 2018 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2018, : 657 - 658
  • [28] A neural network-based model for lower limb continuous estimation against the disturbance of uncertainty*
    Li, Wanting
    Liu, Keping
    Sun, Zhongbo
    Li, Chunxu
    Chai, Yuanyuan
    Gu, Jian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [29] A Convolutional Neural Network-Based Web Prototype to Support Melanoma Skin Cancer Detection
    Rosas-Lara, Mauro
    Mendoza-Tello, Julio C.
    Flores, Aldrin
    Zumba-Acosta, Gema
    2022 THIRD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND SOFTWARE TECHNOLOGIES, ICI2ST, 2022, : 1 - 7
  • [30] DEVELOPMENT OF NEURAL NETWORK-BASED ELECTRONIC NOSE FOR HERBS RECOGNITION
    Soh, A. Che
    Chow, K. K.
    Yusuf, U. K. Mohammad
    Ishak, A. J.
    Hassan, M. K.
    Khamis, S.
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (02) : 584 - 609