Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization

被引:19
作者
Xu, Kunran [1 ]
Lai, Rui [1 ]
Gu, Lin [2 ]
Li, Yishi [1 ]
机构
[1] Xidian Univ, Sch Microelect, Xian 710071, Peoples R China
[2] RIKEN, Ctr Adv Intelligence Project, Tokyo 1030027, Japan
基金
日本科学技术振兴机构; 国家重点研发计划;
关键词
Manifolds; Visualization; Training; Testing; Spatial resolution; Computational modeling; Standards; Fine-grained visual categorization (FGVC); knowledge distillation; mixup;
D O I
10.1109/TNNLS.2021.3112768
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization (FGVC) is a challenging task because there are many hard examples existing between fine-grained classes which differ subtly in particular local regions. To address this issue, many methods have recourse to high-resolution source images and others adopt effective regularization like ``mixup'' or ``between class learning.'' Despite their promising achievements, mixup tends to cause the manifold intrusion problem which would result in under-fitting and degradation of the model performance and high-resolution input inevitably leads to high computational costs. In view of this, we present a multiresolution discriminative mixup network (MRDMN). Different from standard mixup, the proposed discriminative mixup strategy mixes discriminative regions linearly instead of entire images to avoid manifold intrusion, which makes it learn the local detail features more effectively and contributes to more precise categorization. Furthermore, an innovative resolution-based distillation strategy is designed to transfer the multiresolution detail feature representations to a low-resolution network, which speeds up the testing and boosts the categorization accuracy simultaneously. Extensive experiments demonstrate that our proposed MRDMN remarkably outperforms most competitive approaches with less computation time on the CUB-200-2011, Stanford-Cars, Stanford-Dogs, Food-101, and iNaturalist 2017 datasets. The codes are in https://github.com/aztc/MRDMN.
引用
收藏
页码:3488 / 3500
页数:13
相关论文
共 52 条
[1]  
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[2]  
Branson S., 2014, BMVC
[3]   Symbiotic Segmentation and Part Localization for Fine-Grained Categorization [J].
Chai, Yuning ;
Lempitsky, Victor ;
Zisserman, Andrew .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :321-328
[4]   High Prevalence of Assisted Injection Among Street-Involved Youth in a Canadian Setting [J].
Cheng, Tessa ;
Kerr, Thomas ;
Small, Will ;
Dong, Huiru ;
Montaner, Julio ;
Wood, Evan ;
DeBeck, Kora .
AIDS AND BEHAVIOR, 2016, 20 (02) :377-384
[5]   Attention-based Dropout Layer for Weakly Supervised Object Localization [J].
Choe, Junsuk ;
Shim, Hyunjung .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2214-2223
[6]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[7]  
Fei-Fei, 2011, P 1 WORKSH FINEGRAIN
[8]   Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition [J].
Fu, Jianlong ;
Zheng, Heliang ;
Mei, Tao .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4476-4484
[9]   Compact Bilinear Pooling [J].
Gao, Yang ;
Beijbom, Oscar ;
Zhang, Ning ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326
[10]   Fine-Grained Categorization by Alignments [J].
Gavves, E. ;
Fernando, B. ;
Snoek, C. G. M. ;
Smeulders, A. W. M. ;
Tuytelaars, T. .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1713-1720