Varied Image Data Augmentation Methods for Building Ensemble

被引:18
作者
Bravin, Riccardo [1 ]
Nanni, Loris [2 ]
Loreggia, Andrea [3 ]
Brahnam, Sheryl [4 ]
Paci, Michelangelo [5 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, I-20133 Milan, Italy
[2] Univ Padua, Dept Informat Engn DEI, I-35122 Padua, Italy
[3] Univ Brescia, Dept Informat Engn DII, I-25121 Brescia, Italy
[4] Missouri State Univ, Informat Technol & Cybersecur, Springfield, MO 65804 USA
[5] Tampere Univ, Fac Med & Hlth Technol, BioMediTech, Tampere 33520, Finland
关键词
Convolutional neural networks; data augmentation; ensemble; MICROSCOPE IMAGES; CLASSIFICATION;
D O I
10.1109/ACCESS.2023.3239816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) are used in many domains but the requirement of large datasets for robust training sessions and no overfitting makes them hard to apply in medical fields and similar fields. However, when large quantities of samples cannot be easily collected, various methods can still be applied to stem the problem depending on the sample type. Data augmentation, rather than other methods, has recently been under the spotlight mostly because of the simplicity and effectiveness of some of the more adopted methods. The research question addressed in this work is whether data augmentation techniques can help in developing robust and efficient machine learning systems to be used in different domains for classification purposes. To do that, we introduce new image augmentation techniques that make use of different methods like Fourier Transform (FT), Discrete Cosine Transform (DCT), Radon Transform (RT), Hilbert Transform (HT), Singular Value Decomposition (SVD), Local Laplacian Filters (LLF) and Hampel filter (HF). We define different ensemble methods by combining various classical data augmentation methods with the newer ones presented here. We performed an extensive empirical evaluation on 15 different datasets to validate our proposal. The obtained results show that the newly proposed data augmentation methods can be very effective even when used alone. The ensembles trained with different augmentations methods can outperform some of the best approaches reported in the literature as well as compete with state-of-the-art custom methods. All resources are available at https://github.com/LorisNanni.
引用
收藏
页码:8810 / 8823
页数:14
相关论文
共 77 条
[1]   A dataset of microscopic peripheral blood cell images for development of automatic recognition systems [J].
Acevedo, Andrea ;
Merino, Anna ;
Alferez, Santiago ;
Molina, Angel ;
Boldu, Laura ;
Rodellar, Jose .
DATA IN BRIEF, 2020, 30
[2]  
[Anonymous], 2018, arxiv preprint arxiv:1801.02929
[3]  
[Anonymous], 2014, Comput. Sci.
[4]   Crop pest classification with a genetic algorithm-based weighted ensemble of deep convolutional neural networks [J].
Ayan, Enes ;
Erbay, Hasan ;
Varcin, Fatih .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 179
[5]  
Backes AR, 2020, INT CONF SYST SIGNAL, P290, DOI [10.1109/iwssip48289.2020.9145325, 10.1109/IWSSIP48289.2020.9145325]
[6]   Machine learning for Gravity Spy: Glitch classification and dataset [J].
Bahaadini, S. ;
Noroozi, V. ;
Rohani, N. ;
Coughlin, S. ;
Zevin, M. ;
Smith, J. R. ;
Kalogera, V. ;
Katsaggelos, A. .
INFORMATION SCIENCES, 2018, 444 :172-186
[7]   A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells [J].
Boland, MV ;
Murphy, RF .
BIOINFORMATICS, 2001, 17 (12) :1213-1223
[8]   Tree trunk texture classification using multi-scale statistical macro binary patterns and CNN [J].
Boudra, Safia ;
Yahiaoui, Itheri ;
Behloul, Ali .
APPLIED SOFT COMPUTING, 2022, 118
[9]   A set of statistical radial binary patterns for tree species identification based on bark images [J].
Boudra, Safia ;
Yahiaoui, Itheri ;
Behloul, Ali .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) :22373-22404
[10]  
Carpentier M, 2018, IEEE INT C INT ROBOT, P1075, DOI 10.1109/IROS.2018.8593514