HSDLM: A Hybrid Sampling With Deep Learning Method for Imbalanced Data Classification

被引:34
|
作者
Hasib, Khan Md [1 ]
Towhid, Nurul Akter [2 ]
Islam, Md Rafiqul [3 ]
机构
[1] Ahsanullah Univ Sci & Engn, Dhaka, Bangladesh
[2] Jahangirnagar Univ, Dhaka, Bangladesh
[3] Univ Technol Sydney UTS, Sydney, NSW, Australia
关键词
Class Imbalance; Classification; Deep Learning; ENN; LSTM; Sampling; SMOTE; SUPPORT; SMOTE;
D O I
10.4018/IJCAC.2021100101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Imbalanced data presents many difficulties, as the majority of learners will be prejudice against the majority class, and in severe cases, may fully disregard the minority class. Over the last few decades, class inequality has been extensively researched using traditional machine learning techniques. However, there is relatively little analytical research in the field of deep learning with class inequality. In this article, the authors classify the imbalanced data with the combination of both sampling method and deep learning method. They propose a novel sampling-based deep learning method (HSDLM) to address the class imbalance problem. They preprocess the data with label encoding and remove the noisy data with the under-sampling technique edited nearest neighbor (ENN) algorithm. They also balance the data using the over-sampling technique SMOTE and apply parallelly three types of long short-term memory networks, which is a deep learning classifier. The experimental findings indicate that HSDLM is a promising and fruitful solution to working with strongly imbalanced datasets.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [21] High-Resolution Remote Sensing Imagery Classification of Imbalanced Data Using Multistage Sampling Method and Deep Neural Networks
    Xia, Wei
    Ma, Caihong
    Liu, Jianbo
    Liu, Shibin
    Chen, Fu
    Yang, Zhi
    Duan, Jianbo
    REMOTE SENSING, 2019, 11 (21)
  • [22] An Effective Over-sampling Method for Imbalanced Data Sets Classification
    Zhai Yun
    Ma Nan
    Ruan Da
    An Bing
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 489 - 494
  • [23] Exploratory parallel hybrid sampling framework for imbalanced data classification
    Zheng, Ming
    Zhao, Zhuo
    Wang, Fei
    Hu, Xiaowen
    Xu, Sheng
    Li, Wanggen
    Li, Tong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [24] A Hybrid Under-Sampling Method (HUSBoost) to Classify Imbalanced Data
    Popel, Mahmudul Hasan
    Hasib, Khan Md
    Habib, Syed Ahsan
    Shah, Faisal Muhammad
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [25] Robust hybrid data-level sampling approach to handle imbalanced data during classification
    Kaur, Prabhjot
    Gosain, Anjana
    SOFT COMPUTING, 2020, 24 (20) : 15715 - 15732
  • [26] Online Extreme Learning Machine with Hybrid Sampling Strategy for Sequential Imbalanced Data
    Mao, Wentao
    Jiang, Mengxue
    Wang, Jinwan
    Li, Yuan
    COGNITIVE COMPUTATION, 2017, 9 (06) : 780 - 800
  • [27] Hybrid sampling-based contrastive learning for imbalanced node classification
    Cui, Caixia
    Wang, Jie
    Wei, Wei
    Liang, Jiye
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 989 - 1001
  • [28] Class imbalanced data handling with cyberattack classification using Hybrid Salp Swarm Algorithm with deep learning approach
    Alabduallah, Bayan
    Maray, Mohammed
    Alruwais, Nuha
    Alabdan, Rana
    Darem, Abdulbasit A.
    Alallah, Fouad Shoie
    Alsini, Raed
    Yafoz, Ayman
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 106 : 654 - 663
  • [29] Parallel selective sampling method for imbalanced and large data classification
    D'Addabbo, Annarita
    Maglietta, Rosalia
    PATTERN RECOGNITION LETTERS, 2015, 62 : 61 - 67
  • [30] A cluster-based hybrid sampling approach for imbalanced data classification
    Feng, Shou
    Zhao, Chunhui
    Fu, Ping
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2020, 91 (05)