Comparison of Different Image Data Augmentation Approaches

被引:46
作者
Nanni, Loris [1 ]
Paci, Michelangelo [2 ]
Brahnam, Sheryl [3 ]
Lumini, Alessandra [4 ]
机构
[1] Univ Padua, Dept Informat Engn, Via Gradenigo 6, I-35131 Padua, Italy
[2] Tampere Univ, Fac Med & Hlth Technol, BioMediTech, Arvo Ylpon Katu 34, FI-33520 Tampere, Finland
[3] Missouri State Univ, Comp Informat Syst, 901 S Natl, Springfield, MO 65804 USA
[4] Univ Bologna, Dipartimento Informat Sci & Ingn DISI, Via Univ 50, I-47521 Cesena, Italy
关键词
data augmentation; deep learning; convolutional neural networks; ensemble; COLOR; CLASSIFICATION;
D O I
10.3390/jimaging7120254
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Convolutional neural networks (CNNs) have gained prominence in the research literature on image classification over the last decade. One shortcoming of CNNs, however, is their lack of generalizability and tendency to overfit when presented with small training sets. Augmentation directly confronts this problem by generating new data points providing additional information. In this paper, we investigate the performance of more than ten different sets of data augmentation methods, with two novel approaches proposed here: one based on the discrete wavelet transform and the other on the constant-Q Gabor transform. Pretrained ResNet50 networks are finetuned on each augmentation method. Combinations of these networks are evaluated and compared across four benchmark data sets of images representing diverse problems and collected by instruments that capture information at different scales: a virus data set, a bark data set, a portrait dataset, and a LIGO glitches data set. Experiments demonstrate the superiority of this approach. The best ensemble proposed in this work achieves state-of-the-art (or comparable) performance across all four data sets. This result shows that varying data augmentation is a feasible way for building an ensemble of classifiers for image classification.
引用
收藏
页数:13
相关论文
共 57 条
  • [1] [Anonymous], 2019, ARXIV PREPRINT ARXIV
  • [2] Backes AR, 2020, INT CONF SYST SIGNAL, P290, DOI [10.1109/iwssip48289.2020.9145325, 10.1109/IWSSIP48289.2020.9145325]
  • [3] Color texture analysis based on fractal descriptors
    Backes, Andre Ricardo
    Casanova, Dalcimar
    Bruno, Odemir Martinez
    [J]. PATTERN RECOGNITION, 2012, 45 (05) : 1984 - 1992
  • [4] Machine learning for Gravity Spy: Glitch classification and dataset
    Bahaadini, S.
    Noroozi, V.
    Rohani, N.
    Coughlin, S.
    Zevin, M.
    Smith, J. R.
    Kalogera, V.
    Katsaggelos, A.
    [J]. INFORMATION SCIENCES, 2018, 444 : 172 - 186
  • [5] Performance analysis of colour descriptors for parquet sorting
    Bianconi, Francesco
    Fernandez, Antonio
    Gonzalez, Elena
    Saetta, Stefano A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1636 - 1644
  • [6] A set of statistical radial binary patterns for tree species identification based on bark images
    Boudra, Safia
    Yahiaoui, Itheri
    Behloul, Ali
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22373 - 22404
  • [7] Class-specific material categorisation
    Caputo, B
    Hayman, E
    Mallikarjuna, P
    [J]. TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1597 - 1604
  • [8] Carpentier M, 2018, IEEE INT C INT ROBOT, P1075, DOI 10.1109/IROS.2018.8593514
  • [9] Chatfield K., 2014, ARXIV
  • [10] A review of medical image data augmentation techniques for deep learning applications
    Chlap, Phillip
    Min, Hang
    Vandenberg, Nym
    Dowling, Jason
    Holloway, Lois
    Haworth, Annette
    [J]. JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2021, 65 (05) : 545 - 563