A weakly supervised spatial group attention network for fine-grained visual recognition

被引:8
|
作者
Xie, Jiangjian [1 ,2 ,3 ]
Zhong, Yujie [1 ]
Zhang, Junguo [1 ,2 ]
Zhang, Changchun [1 ,2 ]
Schuller, Bjoern W. [3 ,4 ,5 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Beijing Forestry Univ, Res Ctr Biodivers Intelligent Monitoring, Beijing 100083, Peoples R China
[3] Univ Augsburg, Chair Embedded Intelligence Hlth Care & Wellbeing, D-86159 Augsburg, Germany
[4] Imperial Coll London, GLAM Grp Language Audio & Mus, London SW7 2AZ, England
[5] Univ Augsburg, Ctr Interdisciplinary Hlth Res, D-86159 Augsburg, Germany
关键词
Classification; Fine-grained image; Bird recognition; Weakly supervised network; Moment exchange; Spatial group attention;
D O I
10.1007/s10489-023-04627-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fine-grained visual recognition is to classify several sub-categories affiliated to the same basic-level category, which is highly challenging because the same sub-category with large variance and different sub-categories with small variance. Previously approaches generally localize the targets or parts first, then determine which sub-category the image is attached to. They depend on target or part annotations, which are labor-intensive and a barrier to moving towards practical use. Other methods indirectly extract recognizable areas from the high-level feature maps, ignoring the spatial relationships between the target and its parts, which may cause inaccurate recognition. In this paper, we propose a weakly supervised spatial group attention network (WSSGA-Net) for fine-grained bird recognition. According to the spatial relationships between the target and its parts, we embed the spatial group attention (SGA) module into the WSSGA-Net to highlight the correct semantic feature regions by establishing a semantic feature space enhancement mechanism. In addition, we apply moment exchange (MoEx) to generate new feature maps by exchanging two input image feature moments for data augmentation. Comprehensive experiments indicate that our approach significantly has a better performance than the state-of-the-art approaches on the standard bird image datasets Bird-65, CUB200-2011 and fine-grained dataset Stanford Cars.
引用
收藏
页码:23301 / 23315
页数:15
相关论文
共 50 条
  • [21] Temporal Contrastive and Spatial Enhancement Coarse Grained Network for Weakly Supervised Group Activity Recognition
    Guo, Jie
    Ge, Yongxin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [22] Weakly Supervised Posture Mining for Fine-grained Classification
    Tang, Zhenchao
    Yang, Hualin
    Chen, Calvin Yu-Chian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23735 - 23744
  • [23] Progressive learning for weakly supervised fine-grained classification
    Yan, Tiantian
    Wang, Shijie
    Wang, Zhihui
    Li, Haojie
    Luo, Zhongxuan
    SIGNAL PROCESSING, 2020, 171
  • [24] Fine-grained Image Recognition via Attention Interaction and Counterfactual Attention Network
    Huang, Lei
    An, Chen
    Wang, Xiaodong
    Bullock, Leon Bevan
    Wei, Zhiqiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [25] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
    Dakshayani Himabindu D.
    Praveen Kumar S.
    Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in), 1600, Brno University of Technology (27): : 59 - 67
  • [26] Weakly-Supervised Learning for Fine-Grained Emotion Recognition Using Physiological Signals
    Zhang, Tianyi
    El Ali, Abdallah
    Wang, Chen
    Hanjalic, Alan
    Cesar, Pablo
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2304 - 2322
  • [27] Weakly Supervised Semantic and Attentive Data Mixing Augmentation for Fine-Grained Visual Categorization
    He, Mengqi
    Cheng, Qilong
    Qi, Guanqiu
    IEEE ACCESS, 2022, 10 : 35814 - 35823
  • [28] WEAKLY SUPERVISED DEEP CONVOLUTIONAL NETWORKS FOR FINE-GRAINED OBJECT RECOGNITION IN MULTISPECTRAL IMAGES
    Aygunes, Bulut
    Aksoy, Selim
    Cinbis, Ramazan Gokberk
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1478 - 1481
  • [29] Progressive Co-Attention Network for Fine-Grained Visual Classification
    Zhang, Tian
    Chang, Dongliang
    Ma, Zhanyu
    Guo, Jun
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [30] Multiscale attention dynamic aware network for fine-grained visual categorization
    Ou, Jichu
    Li, Wanyi
    Huang, Jingmin
    Huang, Xiaojie
    Xie, Xuan
    ELECTRONICS LETTERS, 2023, 59 (01)