Learning more discriminative clues with gradual attention for fine-grained visual categorization

被引:1
作者
Xu, Qin [1 ,2 ]
Zhang, Mengquan [1 ,2 ]
Li, Yun [1 ,2 ]
Tao, Zhifu [3 ]
机构
[1] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[3] Anhui Univ, Sch Big Data & Stat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained visual categorization; Convolutional neural network; Visual attention; Self -calibrated convolution; IMAGE CLASSIFICATION; NETWORK; MODEL; CNN;
D O I
10.1016/j.imavis.2023.104753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization, which aims to identify the different subcategories of images within the same category, is a very challenging task due to the large intra-class differences and subtle inter-class variances. The existing methods mostly focus on the salient local regions and ignore other features which probably help to recognize the images more precisely. To address this issue, in this paper, we propose a novel end-to-end network composed of the self-calibrated convolution, gradual attention module and feature inverse module for fine-grained visual categorization. To extract the salient features, the self-calibrated convolution is exploited which can avoid the influence of irrelevant information and locate salient regions more accurately. In aiming to extract the discriminative features, we propose the gradual attention module which consists of alternate channel-spatial attention and hierarchical feature grouping. The gradual attention module can extract the subtle discriminative features gradually even when the semantic information of shallow stages is not rich. Moreover, we design the feature inverse module which forces the next stage of network to search for other different useful features by feature inverse. The gradual attention module combined with the feature inverse module is capable of finding more detailed regions and of benefit to improving classification performance. Finally, the stage features and fused features are jointly used for classification. The proposed method is evaluated on three classical fine-grained image datasets and compared with a number of state-of-the-art methods. Our method achieves 89.5%, 95.2% and 93.9% accuracies on CUB-200-2011, Stanford Cars and FGVC-Aircraft datasets respectively. The experimental results demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A collaborative gated attention network for fine-grained visual classification
    Zhu, Qiangxi
    Kuang, Wenlan
    Li, Zhixin
    DISPLAYS, 2023, 79
  • [42] Diversified Visual Attention Networks for Fine-Grained Object Classification
    Zhao, Bo
    Wu, Xiao
    Feng, Jiashi
    Peng, Qiang
    Yan, Shuicheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) : 1245 - 1256
  • [43] A Novel Smart Lightweight Visual Attention Model for Fine-Grained Vehicle Recognition
    Boukerche, Azzedine
    Ma, Xiren
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 13846 - 13862
  • [44] Fine-Grained Visual Categorization Using Meta-learning Optimization with Sample Selection of Auxiliary Data
    Zhang, Yabin
    Tang, Hui
    Jia, Kui
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 241 - 256
  • [45] Fine-Grained Visual-Textual Representation Learning
    He, Xiangteng
    Peng, Yuxin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (02) : 520 - 531
  • [46] Posture-guided part learning for fine-grained image categorization
    Song, Wei
    Chen, Dongmei
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33013
  • [47] Hierarchical deep transfer learning for fine-grained categorization on micro datasets
    Wang, Ronggui
    Yao, Xuchen
    Yang, Juan
    Xue, Lixia
    Hu, Min
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 129 - 139
  • [48] Searching and Learning Discriminative Regions for Fine-Grained Image Retrieval and Classification
    Sun, Kangbo
    Zhu, Jie
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (01) : 141 - 149
  • [49] The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking
    Ye, Shuo
    Wang, Yu
    Peng, Qinmu
    You, Xinge
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 2 - 16
  • [50] Global Information-Assisted Fine-Grained Visual Categorization in Internet of Things
    Li, Ang
    Kang, Bin
    Chen, Jianxin
    Wu, Dan
    Zhou, Liang
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 940 - 952