A new image classification method using CNN transfer learning and web data augmentation

被引:315
作者
Han, Dongmei [1 ,2 ]
Liu, Qigang [1 ]
Fan, Weiguo [3 ]
机构
[1] ShangHai Univ Finance & Econ, Sch Informat Management & Engn, 777 Guoding Rd, Shanghai 200433, Peoples R China
[2] Shanghai Key Lab Financial Informat Technol, 777 Guoding Rd, Shanghai 200433, Peoples R China
[3] Virginia Tech, Accounting & Informat Syst Dept, 3007 Pamplin Hall, Blacksburg, VA 24061 USA
关键词
Feature transferring; Data augmentation; Convolutional neural network; Feature representation; Parameter fine-tuning; Bayesian optimization;
D O I
10.1016/j.eswa.2017.11.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since Convolutional Neural Network (CNN) won the image classification competition 202 (ILSVRC12), a lot of attention has been paid to deep layer CNN study. The success of CNN is attributed to its superior multi-scale high-level image representations as opposed to hand-engineering low-level features. However, estimating millions of parameters of a deep CNN requires a large number of annotated samples, which currently prevents many superior deep CNNs (such as AlexNet, VGG, ResNet) being applied to problems with limited training data. To address this problem, a novel two-phase method combining CNN transfer learning and web data augmentation is proposed. With our method, the useful feature presentation of pre-trained network can be efficiently transferred to target task, and the original dataset can be augmented with the most valuable Internet images for classification. Our method not only greatly reduces the requirement of a large training data, but also effectively expand the training dataset. Both of method features contribute to the considerable over-fitting reduction of deep CNNs on small dataset. In addition, we successfully apply Bayesian optimization to solve the tuff problem, hyper-parameter tuning, in network fine-tuning. Our solution is applied to six public small datasets. Extensive experiments show that, comparing to traditional methods, our solution can assist the popular deep CNNs to achieve better performance. Particularly, ResNet can outperform all the state-of-the-art models on six small datasets. The experiment results prove that the proposed solution will be the great tool for dealing with practice problems which are related to use deep CNNs on small dataset. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 50 条
  • [11] Chai YN, 2011, IEEE I CONF COMP VIS, P2579, DOI 10.1109/ICCV.2011.6126546
  • [12] Dai W., 2007, 24 INT C MACHINELEAR, P193, DOI [10.1145/1273496.1273521, DOI 10.1145/1273496.1273521]
  • [13] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [14] DeVries T., 2017, ARXIV170205538
  • [15] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [16] Learning Object Categories From Internet Image Searches
    Fergus, Rob
    Fei-Fei, Li
    Perona, Pietro
    Zisserman, Andrew
    [J]. PROCEEDINGS OF THE IEEE, 2010, 98 (08) : 1453 - 1466
  • [17] PATTERN-RECOGNITION BY AFFINE MOMENT INVARIANTS
    FLUSSER, J
    SUK, T
    [J]. PATTERN RECOGNITION, 1993, 26 (01) : 167 - 174
  • [18] Gong YC, 2014, LECT NOTES COMPUT SC, V8695, P392, DOI 10.1007/978-3-319-10584-0_26
  • [19] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [20] Hauagge DC, 2012, PROC CVPR IEEE, P206, DOI 10.1109/CVPR.2012.6247677