Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-grained Recognition

被引：42

作者：

Huang, Shaoli ^{[1
]}

Wang, Xinchao ^{[2
]}

Tao, Dacheng ^{[1
,3
]}

机构：

[1] Univ Sydney, Sydney, NSW, Australia

[2] Natl Univ Singapore, Singapore, Singapore

[3] JD Explore Acad, Beijing, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

D O I：

10.1109/ICCV48922.2021.00066

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning mid-level representation for fine-grained recognition is easily dominated by a limited number of highly discriminative patterns, degrading its robustness and generalization capability. To this end, we propose a novel Stochastic Partial Swap (SPS)(1) scheme to address this issue. Our method performs element-wise swapping for partial features between samples to inject noise during training. It equips a regularization effect similar to Dropout, which promotes more neurons to represent the concepts. Furthermore, it also exhibits other advantages: 1) suppressing over-activation to some part patterns to improve feature representativeness, and 2) enriching pattern combination and simulating noisy cases to enhance classifier generalization. We verify the effectiveness of our approach through comprehensive experiments across four network backbones and three fine-grained datasets. Moreover, we demonstrate its ability to complement high-level representations, allowing a simple model to achieve performance comparable to the top-performing technologies in fine-grained recognition, indoor scene recognition, and material recognition while improving model interpretability.

引用

页码：600 / 609

页数：10

共 55 条

[1]

[Anonymous], 2013, P ADV NEUR INF PROC

[2]

Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29

[3] Destruction and Construction Learning for Fine-grained Image Recognition [J].

Chen, Yue ;

Bai, Yalong ;

Zhang, Wei ;

Mei, Tao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161

[4]

Cimpoi M, 2015, PROC CVPR IEEE, P3828, DOI 10.1109/CVPR.2015.7299007

[5] Kernel Pooling for Convolutional Neural Networks [J].

Cui, Yin ;

Zhou, Feng ;

Wang, Jiang ;

Liu, Xiao ;

Lin, Yuanqing ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3049-3058

[6] FASON: First and Second Order Information Fusion Network for Texture Recognition [J].

Dai, Xiyang ;

Ng, Joe Yue-Hei ;

Davis, Larry S. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6100-6108

[7] Selective Sparse Sampling for Fine-grained Image Recognition [J].

Ding, Yao ;

Zhou, Yanzhao ;

Zhu, Yi ;

Ye, Qixiang ;

Jiao, Jianbin .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6598-6607

[8] Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition [J].

Fu, Jianlong ;

Zheng, Heliang ;

Mei, Tao .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4476-4484

[9] Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up [J].

Ge, Weifeng ;

Lin, Xiangru ;

Yu, Yizhou .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3029-3038

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 4 5 6 →