Ensemble learning method based on CNN for class imbalanced data

被引:0
作者
Xin Zhong
Nan Wang
机构
[1] Heilongjiang University,School of Mathematical Sciences
来源
The Journal of Supercomputing | 2024年 / 80卷
关键词
Convolutional neural networks; Imbalanced data; Machine learning; Big data;
D O I
暂无
中图分类号
学科分类号
摘要
Classifying imbalanced data presents a significant challenge, and many studies have proposed methodologies to address this issue. Among them, Convolutional Neural Networks have demonstrated superior performance for imbalanced image classification. This paper initially employs various data pre-processing methods such as over-sampling, under-sampling, and SMOTE to enhance the original dataset. Subsequently, an Ensemble CNN learning model is used to train and predict the data. In order to comprehensively evaluate models trained on imbalanced data, we used metrics such as Accuracy, Recall, Precision, F1-score, and G-mean. On the CIFAR-10 and Fashion-MNIST datasets, different samples from each category were extracted as imbalanced data for experimental research. Compared to the AdaBoost-DenseNet model, our proposed methodology increases the test accuracy on the CIFAR-10 dataset by 9%. Similarly, the F1-score and G-mean improved by 0.096 and 0.069, respectively. Compared to traditional methodologies, our proposed method significantly improves accuracy, recall, precision, and other performance indicators.
引用
收藏
页码:10090 / 10121
页数:31
相关论文
共 101 条
[1]  
He X(2005)Face recognition using Laplacianfaces IEEE Trans Pattern Anal Mach Intell 27 328-340
[2]  
Yan S(2014)A hybrid intelligent system for medical data classification Expert Syst Appl 41 2239-2249
[3]  
Hu Y(2017)Instance categorization by support vector machines to adjust weights in AdaBoost for imbalanced data classification Inf Sci 381 92-103
[4]  
Niyogi P(2016)Ordering-based pruning for improving the performance of ensembles of classifiers in the framework of imbalanced datasets Inf Sci 354 178-196
[5]  
Zhang H(1993)The CNN paradigm IEEE Trans Circuits Syst I: Fundam Theory Appl 40 147-156
[6]  
Seera M(2015)Deep convolutional neural networks for multi-modality isointense infant brain image segmentation Neuroimage 108 214-224
[7]  
Lim CP(2020)A weighted hybrid ensemble method for classifying imbalanced data Knowl-Based Syst 203 106087-74777
[8]  
Lee W(2021)Radius-SMOTE: a new oversampling technique of minority samples based on radius distance for learning from imbalanced data IEEE Access 9 74763-259
[9]  
Jun CH(2018)A systematic study of the class imbalance problem in convolutional neural networks Neural Netw 106 249-11159
[10]  
Lee JS(2023)Enhanced pre-processing approach using ensemble machine learning algorithms for detecting liver disease Biomedicines 11 581-232