Occlusions for Effective Data Augmentation in Image Classification

被引:9
作者
Fong, Ruth C. [1 ,3 ]
Vedaldi, Andrea [2 ]
机构
[1] Univ Oxford, Oxford, England
[2] Facebook AI Res, Menlo Pk, CA USA
[3] FAIR, Menlo Pk, CA USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年
关键词
D O I
10.1109/ICCVW.2019.00511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep networks for visual recognition are known to leverage "easy to recognise" portions of objects such as faces and distinctive texture patterns. The lack of a holistic understanding of objects may increase fragility and overfitting. In recent years, several papers have proposed to address this issue by means of occlusions as a form of data augmentation. However, successes have been limited to tasks such as weak localization and model interpretation, but no benefit was demonstrated on image classification on large-scale datasets. In this paper, we show that, by using a simple technique based on batch augmentation, occlusions as data augmentation can result in better performance on ImageNet for high-capacity models (e.g., ResNet50). We also show that varying amounts of occlusions used during training can be used to study the robustness of different neural network architectures.
引用
收藏
页码:4158 / 4166
页数:9
相关论文
共 24 条
[1]  
[Anonymous], 2016, RETHINKING INCEPTION
[2]  
[Anonymous], 2018, DISTILL
[3]  
[Anonymous], 2014, JMLR
[4]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.90
[5]  
Devries T., 2017, ABS17080 CORR
[6]  
Fong R., 2017, P CVPR
[7]  
Ghiasi G., 2018, P NEURIPS
[8]  
Hinton G., 2014, CIFAR-10
[9]  
Hoffer E., 2019, ARXIV190109335
[10]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90