Commonsense Oriented Fine-Grained Data Augmentation

被引:0
作者
Li, Huachao [1 ]
Kang, Bin [1 ]
Wang, Lei [2 ]
机构
[1] School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing
[2] School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing
关键词
commonsense map; data augmentation; multi-branch convolutional neural network;
D O I
10.3778/j.issn.1002-8331.2210-0361
中图分类号
学科分类号
摘要
The representative researches on data augmentation are mainly carried out on common classification benchmark datasets such as ImageNet. Considering intra- class and inter- class relation in fine- grained visual classification (FGVC) datasets is so different from ordinary classification datasets, data augmentation methods for FGVC need to be further studied. Therefore, this paper proposes a fine- grained semantic image patch mixing method by commonsense (ComSipmix), starting from the fine-grained recognition task and the special properties of the dataset. The proposed method exploits common sense knowledge to mine potential associations between sample labels, and designs a multi-branch convolutional neural network structure for structured image mixing strategy based on this, so that the image mixing process pays more attention to the subtle differences of targets. Through extensive performance tests, it can be verified that the performance of the proposed method is significantly better than the mainstream image mixing-based data augmentation methods. At the same time, through experimental verification, the common sense knowledge proposed in this paper helps to improve the performance of various data augmentation models based on mixed image classes. © 2017 Editorial Office of Tunnel Construction. All rights reserved.
引用
收藏
页码:214 / 221
页数:7
相关论文
共 35 条
[1]  
ALBAWI S, MOHAMMED T A, AL-ZAWI S., Unders tanding of a convolutional neural network, Proceedings of the 2017 International Conference on Engineering and Technology (ICET), pp. 1-6, (2017)
[2]  
MIKOLAJCZYK A, GROCHOWSKI M., Data augmentation for improving deep learning in image classification problem, Proceedings of the 2018 International Interdisciplinary PhD Workshop (IIPhDW), pp. 117-122, (2018)
[3]  
ZHANG H Y, CISSE M, DAUPHIN Y N, Et al., Mixup: beyond empirical risk minimization, (2017)
[4]  
YUN S, HAN D, CHUN S, Et al., CutMix: regularization strategy to train strong classifiers with localizable features, Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6022-6031, (2019)
[5]  
HUANG S L, WANG X Z, TAO D C., SnapMix: semantically proportional mixing for augmenting fine- grained data, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1628-1636, (2021)
[6]  
ZHANG N, DONAHUE J, GIRSHICK R, Et al., Part-based R-CNNs for fine-grained category detection, Proceedings of the European Conference on Computer Vision (ECCV), pp. 834-849, (2014)
[7]  
LIN T Y, ROYCHOWDHURY A, MAJI S., Bilinear CNN models for fine-grained visual recognition, Proceedings of the 2014 IEEE International Conference on Computer Vision (ICCV), pp. 1449-1457, (2014)
[8]  
ZHANG X P, XIONG H K, ZHOU W G, Et al., Picking deep filter responses for fine- grained image recognition, Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1134-1142, (2016)
[9]  
GAO Y, HAN X T, WANG X, Et al., Channel interaction networks for fine-grained image categorization, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 10818-10825, (2020)
[10]  
WANG Z H, WANG S J, LI H J, Et al., Graph-propagation based correlation learning for weakly supervised fine-grained image classification, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 12289-12296, (2020)