Ensemble learning method based on CNN for class imbalanced data

被引:0
作者
Xin Zhong
Nan Wang
机构
[1] Heilongjiang University,School of Mathematical Sciences
来源
The Journal of Supercomputing | 2024年 / 80卷
关键词
Convolutional neural networks; Imbalanced data; Machine learning; Big data;
D O I
暂无
中图分类号
学科分类号
摘要
Classifying imbalanced data presents a significant challenge, and many studies have proposed methodologies to address this issue. Among them, Convolutional Neural Networks have demonstrated superior performance for imbalanced image classification. This paper initially employs various data pre-processing methods such as over-sampling, under-sampling, and SMOTE to enhance the original dataset. Subsequently, an Ensemble CNN learning model is used to train and predict the data. In order to comprehensively evaluate models trained on imbalanced data, we used metrics such as Accuracy, Recall, Precision, F1-score, and G-mean. On the CIFAR-10 and Fashion-MNIST datasets, different samples from each category were extracted as imbalanced data for experimental research. Compared to the AdaBoost-DenseNet model, our proposed methodology increases the test accuracy on the CIFAR-10 dataset by 9%. Similarly, the F1-score and G-mean improved by 0.096 and 0.069, respectively. Compared to traditional methodologies, our proposed method significantly improves accuracy, recall, precision, and other performance indicators.
引用
收藏
页码:10090 / 10121
页数:31
相关论文
共 50 条
[41]   Evolutionary under-sampling based bagging ensemble method for imbalanced data classification [J].
Sun, Bo ;
Chen, Haiyan ;
Wang, Jiandong ;
Xie, Hua .
FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (02) :331-350
[42]   A Combination of Resampling and Ensemble Method for Text Classification on Imbalanced Data [J].
Feng, Haijun ;
Qin, Wen ;
Wang, Huijing ;
Li, Yi ;
Hu, Guangwu .
BIG DATA, BIGDATA 2021, 2022, 12988 :3-16
[43]   Clustering-based undersampling in class-imbalanced data [J].
Lin, Wei-Chao ;
Tsai, Chih-Fong ;
Hu, Ya-Han ;
Jhang, Jing-Shang .
INFORMATION SCIENCES, 2017, 409 :17-26
[44]   Leveraging ensemble pruning for imbalanced data classification [J].
Krawczyk, Bartosz ;
Wozniak, Michal .
2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, :439-444
[45]   IIvotes ensemble for imbalanced data [J].
Blaszczynski, Jerzy ;
Deckert, Magdalena ;
Stefanowski, Jerzy ;
Wilk, Szymon .
INTELLIGENT DATA ANALYSIS, 2012, 16 (05) :777-801
[46]   A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data [J].
Yuan, Xiaohui ;
Xie, Lijun ;
Abouelenien, Mohamed .
PATTERN RECOGNITION, 2018, 77 :160-172
[47]   A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data [J].
Yu, Lean ;
Zhou, Rongtian ;
Tang, Ling ;
Chen, Rongda .
APPLIED SOFT COMPUTING, 2018, 69 :192-202
[48]   Network Intrusion Detection based of Semi-Supervised Ensemble Learning Algorithm for Imbalanced Data [J].
Lin, Zhang .
2021 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS, NANA, 2021, :338-344
[49]   Ensemble Strategy for Hard Classifying Samples in Class-Imbalanced Data Set [J].
Yang, Yingze ;
Xiao, Pengcheng ;
Cheng, Yijun ;
Liu, Weirong ;
Huang, Zhiwu .
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, :170-175
[50]   A Sampling Method of Imbalanced Data Based on Sample Space [J].
Zhang Y.-Q. ;
Lu R.-Z. ;
Qiao S.-J. ;
Han N. ;
Gutierrez L.A. ;
Zhou J.-L. .
Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (10) :2549-2563