Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引:2
|
作者
Sun, Jiayin [1 ,2 ,3 ]
Wang, Hong [4 ]
Dong, Qiulei [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
关键词
Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;
D O I
10.1109/TCSVT.2023.3325001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.
引用
收藏
页码:3891 / 3904
页数:14
相关论文
共 50 条
  • [31] Hierarchical Open-Set Recognition for Automatic Target Recognition
    Bennette, Walter
    Hofmann, Nathaniel
    Wilson, Nathaniel
    Witter, Tyler
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [32] The Weakly Supervised Network of Hierarchical Attention Mechanism for Fine-Grained Classification
    Long, Qian
    Wang, Gaihua
    Qu, Hongwei
    Yao, Jingxuan
    Zhu, Bolun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 257 - 265
  • [33] Hierarchical Self Attention Based Autoencoder for Open-Set Human Activity Recognition
    Tonmoy, M. Tanjid Hasan
    Mahmud, Saif
    Rahman, A. K. M. Mahbubur
    Amin, M. Ashraful
    Ali, Amin Ahsan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 351 - 363
  • [34] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Yilin LYU
    Liping JING
    Jiaqi WANG
    Mingzhe GUO
    Xinyue WANG
    Jian YU
    ScienceChina(InformationSciences), 2023, 66 (03) : 188 - 203
  • [35] Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition
    Yu, Jun
    Tan, Min
    Zhang, Hongyuan
    Tao, Dacheng
    Rui, Yong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 563 - 578
  • [36] Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Zha, Zheng-Jun
    Luo, Jiebo
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5007 - 5016
  • [37] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Yilin Lyu
    Liping Jing
    Jiaqi Wang
    Mingzhe Guo
    Xinyue Wang
    Jian Yu
    Science China Information Sciences, 2023, 66
  • [38] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Lyu, Yilin
    Jing, Liping
    Wang, Jiaqi
    Guo, Mingzhe
    Wang, Xinyue
    Yu, Jian
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (03)
  • [39] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
    Dakshayani Himabindu D.
    Praveen Kumar S.
    Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in), 1600, Brno University of Technology (27): : 59 - 67
  • [40] Feature Correlation Residual Network for Fine-Grained Image Recognition
    Xu, Jiazhen
    Wei, Yantao
    Deng, Wei
    IEEE ACCESS, 2020, 8 : 214322 - 214331