Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization

被引：15

作者：

Xu, Kunran ^{[1
]}

Lai, Rui ^{[1
]}

Gu, Lin ^{[2
]}

Li, Yishi ^{[1
]}

机构：

[1] Xidian Univ, Sch Microelect, Xian 710071, Peoples R China

[2] RIKEN, Ctr Adv Intelligence Project, Tokyo 1030027, Japan

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 07期

基金：

日本科学技术振兴机构; 国家重点研发计划;

关键词：

Manifolds; Visualization; Training; Testing; Spatial resolution; Computational modeling; Standards; Fine-grained visual categorization (FGVC); knowledge distillation; mixup;

D O I：

10.1109/TNNLS.2021.3112768

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-grained visual categorization (FGVC) is a challenging task because there are many hard examples existing between fine-grained classes which differ subtly in particular local regions. To address this issue, many methods have recourse to high-resolution source images and others adopt effective regularization like ``mixup'' or ``between class learning.'' Despite their promising achievements, mixup tends to cause the manifold intrusion problem which would result in under-fitting and degradation of the model performance and high-resolution input inevitably leads to high computational costs. In view of this, we present a multiresolution discriminative mixup network (MRDMN). Different from standard mixup, the proposed discriminative mixup strategy mixes discriminative regions linearly instead of entire images to avoid manifold intrusion, which makes it learn the local detail features more effectively and contributes to more precise categorization. Furthermore, an innovative resolution-based distillation strategy is designed to transfer the multiresolution detail feature representations to a low-resolution network, which speeds up the testing and boosts the categorization accuracy simultaneously. Extensive experiments demonstrate that our proposed MRDMN remarkably outperforms most competitive approaches with less computation time on the CUB-200-2011, Stanford-Cars, Stanford-Dogs, Food-101, and iNaturalist 2017 datasets. The codes are in https://github.com/aztc/MRDMN.

引用

页码：3488 / 3500

页数：13

共 50 条

[1] Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization
Ye, Shuo
Peng, Qinmu
Sun, Wenju
Xu, Jiamiao
Wang, Yu
You, Xinge
Cheung, Yiu-Ming
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5092 - 5102
[2] DSP: Discriminative Spatial Part modeling for Fine-Grained Visual Categorization
Yao, Hantao
Zhang, Dongming
Li, Jintao
Zhou, Jianshe
Zhang, Shiliang
Zhang, Yongdong
IMAGE AND VISION COMPUTING, 2017, 63 : 24 - 37
[3] Alignment Enhancement Network for Fine-grained Visual Categorization
Hu, Yutao
Liu, Xuhui
Zhang, Baochang
Han, Jungong
Cao, Xianbin
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
[4] Learning more discriminative clues with gradual attention for fine-grained visual categorization
Xu, Qin
Zhang, Mengquan
Li, Yun
Tao, Zhifu
IMAGE AND VISION COMPUTING, 2023, 136
[5] Multiscale attention dynamic aware network for fine-grained visual categorization
Ou, Jichu
Li, Wanyi
Huang, Jingmin
Huang, Xiaojie
Xie, Xuan
ELECTRONICS LETTERS, 2023, 59 (01)
[6] PFNet: a novel part fusion network for fine-grained visual categorization
Jingyun Liang
Jinlin Guo
Yanming Guo
Songyang Lao
Multimedia Tools and Applications, 2020, 79 : 33397 - 33416
[7] Increasingly Specialized Generative Adversarial Network for fine-grained visual categorization
Lin, Zhongqi
Gao, Wanlin
Huang, Feng
Jia, Jingdun
KNOWLEDGE-BASED SYSTEMS, 2021, 232
[8] PFNet: a novel part fusion network for fine-grained visual categorization
Liang, Jingyun
Guo, Jinlin
Guo, Yanming
Lao, Songyang
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33397 - 33416
[9] Feathers Dataset for Fine-Grained Visual Categorization
Belko, Alina
Dobratulin, Konstantin
Kuznetsov, Andrey
THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
[10] Which and How Many Regions to Gaze: Focus Discriminative Regions for Fine-Grained Visual Categorization
He, Xiangteng
Peng, Yuxin
Zhao, Junjie
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (09) : 1235 - 1255

← 1 2 3 4 5 →