Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization

被引:15
|
作者
Xu, Kunran [1 ]
Lai, Rui [1 ]
Gu, Lin [2 ]
Li, Yishi [1 ]
机构
[1] Xidian Univ, Sch Microelect, Xian 710071, Peoples R China
[2] RIKEN, Ctr Adv Intelligence Project, Tokyo 1030027, Japan
基金
日本科学技术振兴机构; 国家重点研发计划;
关键词
Manifolds; Visualization; Training; Testing; Spatial resolution; Computational modeling; Standards; Fine-grained visual categorization (FGVC); knowledge distillation; mixup;
D O I
10.1109/TNNLS.2021.3112768
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization (FGVC) is a challenging task because there are many hard examples existing between fine-grained classes which differ subtly in particular local regions. To address this issue, many methods have recourse to high-resolution source images and others adopt effective regularization like ``mixup'' or ``between class learning.'' Despite their promising achievements, mixup tends to cause the manifold intrusion problem which would result in under-fitting and degradation of the model performance and high-resolution input inevitably leads to high computational costs. In view of this, we present a multiresolution discriminative mixup network (MRDMN). Different from standard mixup, the proposed discriminative mixup strategy mixes discriminative regions linearly instead of entire images to avoid manifold intrusion, which makes it learn the local detail features more effectively and contributes to more precise categorization. Furthermore, an innovative resolution-based distillation strategy is designed to transfer the multiresolution detail feature representations to a low-resolution network, which speeds up the testing and boosts the categorization accuracy simultaneously. Extensive experiments demonstrate that our proposed MRDMN remarkably outperforms most competitive approaches with less computation time on the CUB-200-2011, Stanford-Cars, Stanford-Dogs, Food-101, and iNaturalist 2017 datasets. The codes are in https://github.com/aztc/MRDMN.
引用
收藏
页码:3488 / 3500
页数:13
相关论文
共 50 条
  • [1] Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization
    Ye, Shuo
    Peng, Qinmu
    Sun, Wenju
    Xu, Jiamiao
    Wang, Yu
    You, Xinge
    Cheung, Yiu-Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5092 - 5102
  • [2] DSP: Discriminative Spatial Part modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Dongming
    Li, Jintao
    Zhou, Jianshe
    Zhang, Shiliang
    Zhang, Yongdong
    IMAGE AND VISION COMPUTING, 2017, 63 : 24 - 37
  • [3] Alignment Enhancement Network for Fine-grained Visual Categorization
    Hu, Yutao
    Liu, Xuhui
    Zhang, Baochang
    Han, Jungong
    Cao, Xianbin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [4] Learning more discriminative clues with gradual attention for fine-grained visual categorization
    Xu, Qin
    Zhang, Mengquan
    Li, Yun
    Tao, Zhifu
    IMAGE AND VISION COMPUTING, 2023, 136
  • [5] Multiscale attention dynamic aware network for fine-grained visual categorization
    Ou, Jichu
    Li, Wanyi
    Huang, Jingmin
    Huang, Xiaojie
    Xie, Xuan
    ELECTRONICS LETTERS, 2023, 59 (01)
  • [6] PFNet: a novel part fusion network for fine-grained visual categorization
    Jingyun Liang
    Jinlin Guo
    Yanming Guo
    Songyang Lao
    Multimedia Tools and Applications, 2020, 79 : 33397 - 33416
  • [7] Increasingly Specialized Generative Adversarial Network for fine-grained visual categorization
    Lin, Zhongqi
    Gao, Wanlin
    Huang, Feng
    Jia, Jingdun
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [8] PFNet: a novel part fusion network for fine-grained visual categorization
    Liang, Jingyun
    Guo, Jinlin
    Guo, Yanming
    Lao, Songyang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33397 - 33416
  • [9] Feathers Dataset for Fine-Grained Visual Categorization
    Belko, Alina
    Dobratulin, Konstantin
    Kuznetsov, Andrey
    THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [10] Which and How Many Regions to Gaze: Focus Discriminative Regions for Fine-Grained Visual Categorization
    He, Xiangteng
    Peng, Yuxin
    Zhao, Junjie
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (09) : 1235 - 1255