A new image classification method using CNN transfer learning and web data augmentation

被引：315

作者：

Han, Dongmei ^{[1
,2
]}

Liu, Qigang ^{[1
]}

Fan, Weiguo ^{[3
]}

机构：

[1] ShangHai Univ Finance & Econ, Sch Informat Management & Engn, 777 Guoding Rd, Shanghai 200433, Peoples R China

[2] Shanghai Key Lab Financial Informat Technol, 777 Guoding Rd, Shanghai 200433, Peoples R China

[3] Virginia Tech, Accounting & Informat Syst Dept, 3007 Pamplin Hall, Blacksburg, VA 24061 USA

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2018年 / 95卷

关键词：

Feature transferring; Data augmentation; Convolutional neural network; Feature representation; Parameter fine-tuning; Bayesian optimization;

D O I：

10.1016/j.eswa.2017.11.028

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Since Convolutional Neural Network (CNN) won the image classification competition 202 (ILSVRC12), a lot of attention has been paid to deep layer CNN study. The success of CNN is attributed to its superior multi-scale high-level image representations as opposed to hand-engineering low-level features. However, estimating millions of parameters of a deep CNN requires a large number of annotated samples, which currently prevents many superior deep CNNs (such as AlexNet, VGG, ResNet) being applied to problems with limited training data. To address this problem, a novel two-phase method combining CNN transfer learning and web data augmentation is proposed. With our method, the useful feature presentation of pre-trained network can be efficiently transferred to target task, and the original dataset can be augmented with the most valuable Internet images for classification. Our method not only greatly reduces the requirement of a large training data, but also effectively expand the training dataset. Both of method features contribute to the considerable over-fitting reduction of deep CNNs on small dataset. In addition, we successfully apply Bayesian optimization to solve the tuff problem, hyper-parameter tuning, in network fine-tuning. Our solution is applied to six public small datasets. Extensive experiments show that, comparing to traditional methods, our solution can assist the popular deep CNNs to achieve better performance. Particularly, ResNet can outperform all the state-of-the-art models on six small datasets. The experiment results prove that the proposed solution will be the great tool for dealing with practice problems which are related to use deep CNNs on small dataset. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：43 / 56

页数：14

共 50 条

[11] Chai YN, 2011, IEEE I CONF COMP VIS, P2579, DOI 10.1109/ICCV.2011.6126546
[12] Dai W., 2007, 24 INT C MACHINELEAR, P193, DOI [10.1145/1273496.1273521, DOI 10.1145/1273496.1273521]
[13] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[14] DeVries T., 2017, ARXIV170205538
[15] Duchi J, 2011, J MACH LEARN RES, V12, P2121
[16] Learning Object Categories From Internet Image Searches
Fergus, Rob
Fei-Fei, Li
Perona, Pietro
Zisserman, Andrew
[J]. PROCEEDINGS OF THE IEEE, 2010, 98 (08) : 1453 - 1466
[17] PATTERN-RECOGNITION BY AFFINE MOMENT INVARIANTS
FLUSSER, J
SUK, T
[J]. PATTERN RECOGNITION, 1993, 26 (01) : 167 - 174
[18] Gong YC, 2014, LECT NOTES COMPUT SC, V8695, P392, DOI 10.1007/978-3-319-10584-0_26
[19] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[20] Hauagge DC, 2012, PROC CVPR IEEE, P206, DOI 10.1109/CVPR.2012.6247677

← 1 2 3 4 5 →