Ensemble learning method based on CNN for class imbalanced data

被引:0
作者
Xin Zhong
Nan Wang
机构
[1] Heilongjiang University,School of Mathematical Sciences
来源
The Journal of Supercomputing | 2024年 / 80卷
关键词
Convolutional neural networks; Imbalanced data; Machine learning; Big data;
D O I
暂无
中图分类号
学科分类号
摘要
Classifying imbalanced data presents a significant challenge, and many studies have proposed methodologies to address this issue. Among them, Convolutional Neural Networks have demonstrated superior performance for imbalanced image classification. This paper initially employs various data pre-processing methods such as over-sampling, under-sampling, and SMOTE to enhance the original dataset. Subsequently, an Ensemble CNN learning model is used to train and predict the data. In order to comprehensively evaluate models trained on imbalanced data, we used metrics such as Accuracy, Recall, Precision, F1-score, and G-mean. On the CIFAR-10 and Fashion-MNIST datasets, different samples from each category were extracted as imbalanced data for experimental research. Compared to the AdaBoost-DenseNet model, our proposed methodology increases the test accuracy on the CIFAR-10 dataset by 9%. Similarly, the F1-score and G-mean improved by 0.096 and 0.069, respectively. Compared to traditional methodologies, our proposed method significantly improves accuracy, recall, precision, and other performance indicators.
引用
收藏
页码:10090 / 10121
页数:31
相关论文
共 50 条
[21]   A review of ensemble learning and data augmentation models for class imbalanced problems: Combination, implementation and evaluation [J].
Khan, Azal Ahmad ;
Chaudhari, Omkar ;
Chandra, Rohitash .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
[22]   An Ensemble Learning Algorithm Based on Density Peaks Clustering and Fitness for Imbalanced Data [J].
Xu, Hui ;
Liu, Qicheng .
IEEE ACCESS, 2022, 10 :116120-116128
[23]   MCNN-LSTM: Combining CNN and LSTM to Classify Multi-Class Text in Imbalanced News Data [J].
Hasib, Khan Md ;
Azam, Sami ;
Karim, Asif ;
Marouf, Ahmed Al ;
Shamrat, F. M. Javed Mehedi ;
Montaha, Sidratul ;
Yeo, Kheng Cher ;
Jonkman, Mirjam ;
Alhajj, Reda ;
Rokne, Jon G. .
IEEE ACCESS, 2023, 11 :93048-93063
[24]   RRBoost: a new ensemble method for classifying imbalanced data [J].
Park, Hyejoon ;
Kim, Hyunjoong .
KNOWLEDGE AND INFORMATION SYSTEMS, 2025,
[25]   GIR-based canonical forest: An ensemble method for imbalanced big data [J].
Han, Solji ;
Myung, Jaesung ;
Kim, Hyunjoong .
KOREAN JOURNAL OF APPLIED STATISTICS, 2024, 37 (05)
[26]   Evaluating Integrated Weight Linear Method to Class Imbalanced Learning in Video Data [J].
Apandi, Ziti Fariha Mohd ;
Mustapha, Norwati ;
Affendey, Lilly Suriani .
2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, :243-247
[27]   Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data [J].
Zhang, Zhongliang ;
Krawczyk, Bartosz ;
Garcia, Salvador ;
Rosales-Perez, Alejandro ;
Herrera, Francisco .
KNOWLEDGE-BASED SYSTEMS, 2016, 106 :251-263
[28]   A machine learning method for incomplete and imbalanced medical data [J].
Salman, Issam ;
Vomlel, Jiri .
PROCEEDINGS OF THE 20TH CZECH-JAPAN SEMINAR ON DATA ANALYSIS AND DECISION MAKING UNDER UNCERTAINTY, 2017, :188-195
[29]   BPSO-Adaboost-KNN ensemble learning algorithm for multi-class imbalanced data classification [J].
Guo Haixiang ;
Li Yijing ;
Li Yanan ;
Liu Xiao ;
Li Jinling .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 49 :176-193
[30]   Combining One-vs-One Decomposition and Ensemble Learning for Multi-class Imbalanced Data [J].
Krawczyk, Bartosz .
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 :27-36