HSDLM: A Hybrid Sampling With Deep Learning Method for Imbalanced Data Classification

被引:34
|
作者
Hasib, Khan Md [1 ]
Towhid, Nurul Akter [2 ]
Islam, Md Rafiqul [3 ]
机构
[1] Ahsanullah Univ Sci & Engn, Dhaka, Bangladesh
[2] Jahangirnagar Univ, Dhaka, Bangladesh
[3] Univ Technol Sydney UTS, Sydney, NSW, Australia
关键词
Class Imbalance; Classification; Deep Learning; ENN; LSTM; Sampling; SMOTE; SUPPORT; SMOTE;
D O I
10.4018/IJCAC.2021100101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Imbalanced data presents many difficulties, as the majority of learners will be prejudice against the majority class, and in severe cases, may fully disregard the minority class. Over the last few decades, class inequality has been extensively researched using traditional machine learning techniques. However, there is relatively little analytical research in the field of deep learning with class inequality. In this article, the authors classify the imbalanced data with the combination of both sampling method and deep learning method. They propose a novel sampling-based deep learning method (HSDLM) to address the class imbalance problem. They preprocess the data with label encoding and remove the noisy data with the under-sampling technique edited nearest neighbor (ENN) algorithm. They also balance the data using the over-sampling technique SMOTE and apply parallelly three types of long short-term memory networks, which is a deep learning classifier. The experimental findings indicate that HSDLM is a promising and fruitful solution to working with strongly imbalanced datasets.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
    Park, Yeontark
    Lee, Jong-Seok
    IEEE ACCESS, 2021, 9 : 158010 - 158026
  • [32] Skin Lesion Classification on Imbalanced Data Using Deep Learning with Soft Attention
    Viet Dung Nguyen
    Ngoc Dung Bui
    Hoang Khoi Do
    SENSORS, 2022, 22 (19)
  • [33] An ensemble imbalanced classification method based on model dynamic selection driven by data partition hybrid sampling
    Gao, Xin
    Ren, Bing
    Zhang, Hao
    Sun, Bohao
    Li, Junliang
    Xu, Jianhang
    He, Yang
    Li, Kangsheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [34] A Hybrid Approach for Binary Classification of Imbalanced Data
    Tsai, Hsinhan
    Yang, Ta-Wei
    Wong, Wai-Man
    Kao, Han-Yi
    Chou, Cheng-Fu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (03)
  • [35] A Constructive Method for Data Reduction and Imbalanced Sampling
    Liu, Fei
    Yan, Yuanting
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT III, 2024, 14489 : 476 - 489
  • [36] Entropy-based hybrid sampling ensemble learning for imbalanced data
    Dongdong, Li
    Ziqiu, Chi
    Bolu, Wang
    Zhe, Wang
    Hai, Yang
    Wenli, Du
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (07) : 3039 - 3067
  • [37] Imbalanced Data Classification Based on a Hybrid Resampling SVM Method
    Cao, Lu
    Zhai, Yikui
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 1533 - 1536
  • [38] A Hybrid Learning Framework for Imbalanced Classification
    Jiang, Eric P.
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2022, 18 (01)
  • [39] An Effective Sampling Strategy for Ensemble Learning with Imbalanced Data
    Zhang, Chen
    Zhang, Xiaolong
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 377 - 388
  • [40] EHSO: Evolutionary Hybrid Sampling in overlapping scenarios for imbalanced learning
    Zhu, Yuanwei
    Yan, Yuanting
    Zhang, Yiwen
    Zhang, Yanping
    NEUROCOMPUTING, 2020, 417 : 333 - 346