Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-grained Recognition

被引：42

作者：

Huang, Shaoli ^{[1
]}

Wang, Xinchao ^{[2
]}

Tao, Dacheng ^{[1
,3
]}

机构：

[1] Univ Sydney, Sydney, NSW, Australia

[2] Natl Univ Singapore, Singapore, Singapore

[3] JD Explore Acad, Beijing, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

D O I：

10.1109/ICCV48922.2021.00066

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning mid-level representation for fine-grained recognition is easily dominated by a limited number of highly discriminative patterns, degrading its robustness and generalization capability. To this end, we propose a novel Stochastic Partial Swap (SPS)(1) scheme to address this issue. Our method performs element-wise swapping for partial features between samples to inject noise during training. It equips a regularization effect similar to Dropout, which promotes more neurons to represent the concepts. Furthermore, it also exhibits other advantages: 1) suppressing over-activation to some part patterns to improve feature representativeness, and 2) enriching pattern combination and simulating noisy cases to enhance classifier generalization. We verify the effectiveness of our approach through comprehensive experiments across four network backbones and three fine-grained datasets. Moreover, we demonstrate its ability to complement high-level representations, allowing a simple model to achieve performance comparable to the top-performing technologies in fine-grained recognition, indoor scene recognition, and material recognition while improving model interpretability.

引用

页码：600 / 609

页数：10

共 55 条

[31]

Wah C., 2011, Technical report

[32] Phrase Localization Without Paired Training Examples [J].

Wang, Josiah ;

Specia, Lucia .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4662-4671

[33]

Wang L., 2015, ARXIV150502496

[34] Tracking Interacting Objects Using Intertwined Flows [J].

Wang, Xinchao ;

Turetken, Engin ;

Fleuret, Francois ;

Fua, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (11) :2312-2326

[35] Learning a Discriminative Filter Bank within a CNN for Fine-grained Recognition [J].

Wang, Yaming ;

Morariu, Vlad I. ;

Davis, Larry S. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4148-4157

[36] Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification [J].

Wei, Xing ;

Zhang, Yue ;

Gong, Yihong ;

Zhang, Jiawei ;

Zheng, Nanning .

COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :365-380

[37]

Xiao TJ, 2015, PROC CVPR IEEE, P842, DOI 10.1109/CVPR.2015.7298685

[38] Deep Texture Manifold for Ground Terrain Recognition [J].

Xue, Jia ;

Zhang, Hang ;

Dana, Kristin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :558-567

[39] Differential Angular Imaging for Material Recognition [J].

Xue, Jia ;

Zhang, Hang ;

Dana, Kristin ;

Nishino, Ko .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6940-6949

[40]

Yang Y., 2020, NEURIPS, V33

← 1 2 3 4 5 6 →