A weakly supervised spatial group attention network for fine-grained visual recognition

被引:8
|
作者
Xie, Jiangjian [1 ,2 ,3 ]
Zhong, Yujie [1 ]
Zhang, Junguo [1 ,2 ]
Zhang, Changchun [1 ,2 ]
Schuller, Bjoern W. [3 ,4 ,5 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Beijing Forestry Univ, Res Ctr Biodivers Intelligent Monitoring, Beijing 100083, Peoples R China
[3] Univ Augsburg, Chair Embedded Intelligence Hlth Care & Wellbeing, D-86159 Augsburg, Germany
[4] Imperial Coll London, GLAM Grp Language Audio & Mus, London SW7 2AZ, England
[5] Univ Augsburg, Ctr Interdisciplinary Hlth Res, D-86159 Augsburg, Germany
关键词
Classification; Fine-grained image; Bird recognition; Weakly supervised network; Moment exchange; Spatial group attention;
D O I
10.1007/s10489-023-04627-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fine-grained visual recognition is to classify several sub-categories affiliated to the same basic-level category, which is highly challenging because the same sub-category with large variance and different sub-categories with small variance. Previously approaches generally localize the targets or parts first, then determine which sub-category the image is attached to. They depend on target or part annotations, which are labor-intensive and a barrier to moving towards practical use. Other methods indirectly extract recognizable areas from the high-level feature maps, ignoring the spatial relationships between the target and its parts, which may cause inaccurate recognition. In this paper, we propose a weakly supervised spatial group attention network (WSSGA-Net) for fine-grained bird recognition. According to the spatial relationships between the target and its parts, we embed the spatial group attention (SGA) module into the WSSGA-Net to highlight the correct semantic feature regions by establishing a semantic feature space enhancement mechanism. In addition, we apply moment exchange (MoEx) to generate new feature maps by exchanging two input image feature moments for data augmentation. Comprehensive experiments indicate that our approach significantly has a better performance than the state-of-the-art approaches on the standard bird image datasets Bird-65, CUB200-2011 and fine-grained dataset Stanford Cars.
引用
收藏
页码:23301 / 23315
页数:15
相关论文
共 50 条
  • [1] A weakly supervised spatial group attention network for fine-grained visual recognition
    Jiangjian Xie
    Yujie Zhong
    Junguo Zhang
    Changchun Zhang
    Björn W Schuller
    Applied Intelligence, 2023, 53 : 23301 - 23315
  • [2] Weakly Supervised Fine-grained Recognition in a Segmentation-attention Network
    Yu, Nannan
    Zhang, Wenfeng
    Cai, Huanhuan
    ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 324 - 329
  • [3] Weakly supervised fine-grained recognition based on spatial-channel aware attention filters
    Yu, Nannan
    Huang, Lei
    Wei, Zhiqiang
    Zhang, Wenfeng
    Wang, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 14409 - 14427
  • [4] Weakly supervised fine-grained recognition based on spatial-channel aware attention filters
    Nannan Yu
    Lei Huang
    Zhiqiang Wei
    Wenfeng Zhang
    Bin Wang
    Multimedia Tools and Applications, 2021, 80 : 14409 - 14427
  • [5] The Weakly Supervised Network of Hierarchical Attention Mechanism for Fine-Grained Classification
    Long, Qian
    Wang, Gaihua
    Qu, Hongwei
    Yao, Jingxuan
    Zhu, Bolun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 257 - 265
  • [6] Supervised Spatial Transformer Networks for Attention Learning in Fine-grained Action Recognition
    Liu, Dichao
    Wang, Yu
    Kato, Jien
    VISAPP: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4, 2019, : 311 - 318
  • [7] AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification
    Ding, Yifeng
    Ma, Zhanyu
    Wen, Shaoguo
    Xie, Jiyang
    Chang, Dongliang
    Si, Zhongwei
    Wu, Ming
    Ling, Haibin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2826 - 2836
  • [8] Attention-Guided Spatial Transformer Networks for Fine-Grained Visual Recognition
    Liu, Dichao
    Wang, Yu
    Kato, Jien
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2577 - 2586
  • [9] DACBN: Dual attention convolutional broad network for fine-grained visual recognition
    Chen, Tao
    Wang, Lijie
    Liu, Yang
    Yu, Haisheng
    PATTERN RECOGNITION, 2024, 156
  • [10] Weakly supervised instance attention for multisource fine-grained object recognition with an application to tree species classification
    Aygunes, Bulut
    Cinbis, Ramazan Gokberk
    Aksoy, Selim
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 262 - 274