Image classification method on class imbalance datasets using multi-scale CNN and two-stage transfer learning

被引：12

作者：

Liu, Jiahuan ^{[1
]}

Guo, Fei ^{[1
]}

Gao, Huang ^{[1
]}

Huang, Zhigao ^{[1
]}

Zhang, Yun ^{[1
]}

Zhou, Huamin ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Mat Sci & Engn, State Key Lab Mat Proc & Die & Mould Technol, Wuhan 430074, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2021年 / 33卷 / 21期

关键词：

Image classification; Class imbalance; Convolutional neural network; Transfer learning;

D O I：

10.1007/s00521-021-06066-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image classification tasks widely exist in many actual scenarios, including medicine, security, manufacture and finance. A major problem that hurts algorithm performance of image classification is the class imbalance of training datasets, which is caused by the difficulty in collecting minority class samples. Current methods handle this class imbalance problem from three aspects: data resample, cost-sensitive loss function and ensemble learning. However, the average accuracy of these common methods is about 95% and performance gets degenerating dramatically when the training datasets are extremely imbalanced. We propose an image classification method on class imbalance datasets using multi-scale convolutional neural network and two-stage transfer learning. Proposed methods extract multi-scale image features using convolutional kernels with different receptive fields and reuse image knowledge of other classification task to improve model representation capability using two-stage transfer strategy. Comparison experiments are carried to verify the performance of proposed methods on DAGM texture dataset, MURA medical dataset and an industrial dataset. The average accuracy obtained by proposed methods reaches about 99% which is 2.32% higher than commonly used methods over all the cases of different imbalance ratio, accuracy increase of 4.0% is achieved when some datasets are extremely imbalanced. Besides, proposed method can also achieve best accuracy of more than 99% on the industrial dataset containing only several negative samples. In addition, visualization technique is applied to prove that the accuracy boost comes from advantage of proposed architecture and training strategy.

引用

页码：14179 / 14197

页数：19

共 44 条

[1] Abadi Martin, 2016, arXiv
[2] MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network
Ali-Gombe, Adamu
Elyan, Eyad
[J]. NEUROCOMPUTING, 2019, 361 : 212 - 221
[3] Software defect prediction using cost-sensitive neural network
Arar, Omer Faruk
Ayan, Kursat
[J]. APPLIED SOFT COMPUTING, 2015, 33 : 263 - 277
[4] A systematic study of the class imbalance problem in convolutional neural networks
Buda, Mateusz
Maki, Atsuto
Mazurowski, Maciej A.
[J]. NEURAL NETWORKS, 2018, 106 : 249 - 259
[5] Chollet F., 2015, Keras: Deep learning library for theano and tensorflow
[6] Optimize TSK Fuzzy Systems for Classification Problems: Minibatch Gradient Descent With Uniform Regularization and Batch Normalization
Cui, Yuqi
Wu, Dongrui
Huang, Jian
[J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (12) : 3065 - 3075
[7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8] Diversity techniques improve the performance of the best imbalance learning ensembles
Diez-Pastor, Jose F.
Rodriguez, Juan J.
Garcia-Osorio, Cesar I.
Kuncheva, Ludmila I.
[J]. INFORMATION SCIENCES, 2015, 325 : 98 - 117
[9] Crowd counting by using multi-level density-based spatial information: A Multi-scale CNN framework
Dong, Li
Zhang, Haijun
Ji, Yuzhu
Ding, Yuxin
[J]. INFORMATION SCIENCES, 2020, 528 (528) : 79 - 91
[10] HaoWu Gareth JF Jones, 2018, TRECVID

← 1 2 3 4 5 →