Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引:2
|
作者
Sun, Jiayin [1 ,2 ,3 ]
Wang, Hong [4 ]
Dong, Qiulei [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
关键词
Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;
D O I
10.1109/TCSVT.2023.3325001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.
引用
收藏
页码:3891 / 3904
页数:14
相关论文
共 50 条
  • [21] Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Zha, Zheng-Jun
    Luo, Jiebo
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 476 - 488
  • [22] Attention-Guided CutMix Data Augmentation Network for Fine-Grained Bird Recognition
    Guo, Wenming
    Wang, Yifei
    Han, Fang
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [23] Wavelet and Adaptive Coordinate Attention Guided Fine-Grained Residual Network for Image Denoising
    Ding, Shifei
    Wang, Qidong
    Guo, Lili
    Li, Xuan
    Ding, Ling
    Wu, Xindong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6156 - 6166
  • [24] Application of Improved DNN Algorithm Based on Feature Fusion in Fine-Grained Image Recognition
    Zhu, Jiongguang
    Zhang, Wei
    IEEE ACCESS, 2024, 12 (32140-32151) : 32140 - 32151
  • [25] Fine-grained Recognition of Chinese Food Image Based on DenseNet with Attention Mechanism
    Hao, Ran
    Gao, Weidong
    Mi, Jihang
    Zhao, Zhenwei
    TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
  • [26] Visual Attention Focusing on Fine-Grained Foreground and Eliminating Background Bias for Pest Image Identification
    Xu, Xinyuan
    Li, Heng
    Gao, Qi
    Zhou, Meixuan
    Meng, Tianyue
    Yin, Liping
    Chai, Xinyu
    IEEE ACCESS, 2024, 12 : 161732 - 161741
  • [27] Food and Ingredient Joint Learning for Fine-Grained Recognition
    Liu, Chengxu
    Liang, Yuanzhi
    Xue, Yao
    Qian, Xueming
    Fu, Jianlong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2480 - 2493
  • [28] Incremental Learning With Open-Set Recognition for Remote Sensing Image Scene Classification
    Liu, Weiwei
    Nie, Xiangli
    Zhang, Bo
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [29] Regional Attention Network (RAN) for Head Pose and Fine-Grained Gesture Recognition
    Behera, Ardhendu
    Wharton, Zachary
    Liu, Yonghuai
    Ghahremani, Morteza
    Kumar, Swagat
    Bessis, Nik
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 549 - 562
  • [30] Incremental Learning for Fine-Grained Image Recognition
    Cao, Liangliang
    Hsiao, Jenhao
    de Juan, Paloma
    Li, Yuncheng
    Thomee, Bart
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 363 - 366