Learning more discriminative clues with gradual attention for fine-grained visual categorization

被引:1
作者
Xu, Qin [1 ,2 ]
Zhang, Mengquan [1 ,2 ]
Li, Yun [1 ,2 ]
Tao, Zhifu [3 ]
机构
[1] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[3] Anhui Univ, Sch Big Data & Stat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained visual categorization; Convolutional neural network; Visual attention; Self -calibrated convolution; IMAGE CLASSIFICATION; NETWORK; MODEL; CNN;
D O I
10.1016/j.imavis.2023.104753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization, which aims to identify the different subcategories of images within the same category, is a very challenging task due to the large intra-class differences and subtle inter-class variances. The existing methods mostly focus on the salient local regions and ignore other features which probably help to recognize the images more precisely. To address this issue, in this paper, we propose a novel end-to-end network composed of the self-calibrated convolution, gradual attention module and feature inverse module for fine-grained visual categorization. To extract the salient features, the self-calibrated convolution is exploited which can avoid the influence of irrelevant information and locate salient regions more accurately. In aiming to extract the discriminative features, we propose the gradual attention module which consists of alternate channel-spatial attention and hierarchical feature grouping. The gradual attention module can extract the subtle discriminative features gradually even when the semantic information of shallow stages is not rich. Moreover, we design the feature inverse module which forces the next stage of network to search for other different useful features by feature inverse. The gradual attention module combined with the feature inverse module is capable of finding more detailed regions and of benefit to improving classification performance. Finally, the stage features and fused features are jointly used for classification. The proposed method is evaluated on three classical fine-grained image datasets and compared with a number of state-of-the-art methods. Our method achieves 89.5%, 95.2% and 93.9% accuracies on CUB-200-2011, Stanford Cars and FGVC-Aircraft datasets respectively. The experimental results demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
    Dakshayani Himabindu D.
    Praveen Kumar S.
    Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in), 1600, Brno University of Technology (27): : 59 - 67
  • [32] Fine-Grained Categorization Using a Mixture of Transfer Learning Networks
    Firsching, Justin
    Hashem, Sherif
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 151 - 158
  • [33] To Know and To Learn About the Integration of Knowledge Representation and Deep Learning for Fine-Grained Visual Categorization
    Setti, Francesco
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 387 - 392
  • [34] A benchmark dataset and approach for fine-grained visual categorization in complex scenes
    Zhang, Xiang
    Zhang, Keran
    Zhao, Wanqing
    Luo, Hangzai
    Zhong, Sheng
    Tang, Lei
    Peng, Jinye
    Fan, Jianping
    DIGITAL SIGNAL PROCESSING, 2023, 137
  • [35] Exploring part-aware segmentation for fine-grained visual categorization
    Pang, Cheng
    Yao, Hongxun
    Sun, Xiaoshuai
    Zhao, Sicheng
    Zhang, Yanhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (23) : 30291 - 30310
  • [36] Coarse Label Refined Knowledge Reasoning for Fine-Grained Visual Categorization
    Zhao, Xiangyu
    Peng, Yuxin
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 349 - 359
  • [37] Fine-Grained Visual Categorization by Localizing Object Parts With Single Image
    Zheng, Xiangtao
    Qi, Lei
    Ren, Yutao
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1187 - 1199
  • [38] Exploring part-aware segmentation for fine-grained visual categorization
    Cheng Pang
    Hongxun Yao
    Xiaoshuai Sun
    Sicheng Zhao
    Yanhao Zhang
    Multimedia Tools and Applications, 2018, 77 : 30291 - 30310
  • [39] AUGMENTING DESCRIPTORS FOR FINE-GRAINED VISUAL CATEGORIZATION USING POLYNOMIAL EMBEDDING
    Nakayama, Hideki
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [40] Two-Branch Attention Learning for Fine-Grained Class Incremental Learning
    Guo, Jiaqi
    Qi, Guanqiu
    Xie, Shuiqing
    Li, Xiangyuan
    ELECTRONICS, 2021, 10 (23)