Effective Class-Imbalance Learning Based on SMOTE and Convolutional Neural Networks

被引:41
|
作者
Joloudari, Javad Hassannataj [1 ]
Marefat, Abdolreza [2 ]
Nematollahi, Mohammad Ali [3 ]
Oyelere, Solomon Sunday [4 ]
Hussain, Sadiq [5 ]
机构
[1] Univ Birjand, Fac Engn, Dept Comp Engn, Birjand 9717434765, Iran
[2] Islamic Azad Univ, Tech & Engn Fac, Dept Artificial Intelligence, South Tehran Branch, Tehran 1477893780, Iran
[3] Fasa Univ, Dept Comp Sci, Fasa 7461686131, Iran
[4] Lulea Univ Technol, Dept Comp Sci Elect & Space Engn, S-93187 Skelleftea, Sweden
[5] Dibrugarh Univ, Examinat Branch, Dibrugarh 786004, Assam, India
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 06期
关键词
imbalanced data; resampling; normalization; deep neural network; convolutional neural network; CORONARY-ARTERY-DISEASE; CLASSIFICATION; DIAGNOSIS; CLASSIFIERS; ALGORITHMS;
D O I
10.3390/app13064006
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Imbalanced Data (ID) is a problem that deters Machine Learning (ML) models from achieving satisfactory results. ID is the occurrence of a situation where the quantity of the samples belonging to one class outnumbers that of the other by a wide margin, making such models' learning process biased towards the majority class. In recent years, to address this issue, several solutions have been put forward, which opt for either synthetically generating new data for the minority class or reducing the number of majority classes to balance the data. Hence, in this paper, we investigate the effectiveness of methods based on Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs) mixed with a variety of well-known imbalanced data solutions meaning oversampling and undersampling. Then, we propose a CNN-based model in combination with SMOTE to effectively handle imbalanced data. To evaluate our methods, we have used KEEL, breast cancer, and Z-Alizadeh Sani datasets. In order to achieve reliable results, we conducted our experiments 100 times with randomly shuffled data distributions. The classification results demonstrate that the mixed Synthetic Minority Oversampling Technique (SMOTE)-Normalization-CNN outperforms different methodologies achieving 99.08% accuracy on the 24 imbalanced datasets. Therefore, the proposed mixed model can be applied to imbalanced binary classification problems on other real datasets.
引用
收藏
页数:34
相关论文
共 50 条
  • [21] Output Layer Multiplication for Class Imbalance Problem in Convolutional Neural Networks
    Zhao Yang
    Yuanxin Zhu
    Tie Liu
    Sai Zhao
    Yunyan Wang
    Dapeng Tao
    Neural Processing Letters, 2020, 52 : 2637 - 2653
  • [22] A Cost Sensitive and Class-imbalance Classification Method based on Neural Network for Disease Diagnosis
    He, Fei
    Yang, Huamin
    Miao, Yu
    Louis, Rainbow
    2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2016, : 7 - 10
  • [23] Output Layer Multiplication for Class Imbalance Problem in Convolutional Neural Networks
    Yang, Zhao
    Zhu, Yuanxin
    Liu, Tie
    Zhao, Sai
    Wang, Yunyan
    Tao, Dapeng
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 2637 - 2653
  • [24] SWSEL: Sliding Window-based Selective Ensemble Learning for class-imbalance problems
    Dai, Qi
    Liu, Jian-wei
    Yang, Jia-Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [25] Ensemble of Cost-Sensitive Hypernetworks for Class-Imbalance Learning
    Wang, Jin
    Huang, Ping-li
    Sun, Kai-wei
    Cao, Bao-lin
    Zhao, Rui
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1883 - 1888
  • [26] Towards Class-Imbalance Aware Multi-Label Learning
    Zhang, Min-Ling
    Li, Yu-Kun
    Liu, Xu-Ying
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 4041 - 4047
  • [27] An Ensemble Learning Approach with Gradient Resampling for Class-Imbalance Problems
    Zhao, Hongke
    Zhao, Chuang
    Zhang, Xi
    Liu, Nanlin
    Zhu, Hengshu
    Liu, Qi
    Xiong, Hui
    INFORMS JOURNAL ON COMPUTING, 2023, 35 (04) : 747 - 763
  • [28] Towards Class-Imbalance Aware Multi-Label Learning
    Zhang, Min-Ling
    Li, Yu-Kun
    Yang, Hao
    Liu, Xu-Ying
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4459 - 4471
  • [29] A systematic review for class-imbalance in semi-supervised learning
    de Oliveira, Willian Dihanster Gomes
    Berton, Lilian
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 2349 - 2382
  • [30] Tackling the Class-Imbalance Learning Problem in Semantic Web Knowledge Bases
    Rizzo, Giuseppe
    d'Amato, Claudia
    Fanizzi, Nicola
    Esposito, Floriana
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2014, 2014, 8876 : 453 - 468