Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引:2
|
作者
Sun, Jiayin [1 ,2 ,3 ]
Wang, Hong [4 ]
Dong, Qiulei [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
关键词
Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;
D O I
10.1109/TCSVT.2023.3325001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.
引用
收藏
页码:3891 / 3904
页数:14
相关论文
共 50 条
  • [1] Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition
    Liu, Huabin
    Li, Jianguo
    Li, Dian
    See, John
    Lin, Weiyao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2902 - 2913
  • [2] Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition
    Yu, Jun
    Tan, Min
    Zhang, Hongyuan
    Tao, Dacheng
    Rui, Yong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 563 - 578
  • [3] Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples
    Liu, Huafeng
    Zhang, Chuanyi
    Yao, Yazhou
    Wei, Xiu-Shen
    Shen, Fumin
    Tang, Zhenmin
    Zhang, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 546 - 557
  • [4] Feature Correlation Residual Network for Fine-Grained Image Recognition
    Xu, Jiazhen
    Wei, Yantao
    Deng, Wei
    IEEE ACCESS, 2020, 8 : 214322 - 214331
  • [5] Fine-Grained Open-Set Deepfake Detection via Unsupervised Domain Adaptation
    Zhou, Xinye
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7536 - 7547
  • [6] Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
    Rodriguez, Pau
    Velazquez, Diego
    Cucurull, Guillem
    Gonfaus, Josep M.
    Roca, E. Xavier
    Gonzalez, Jordi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 502 - 514
  • [7] Hierarchical Feature Attention Learning Network for Detecting Object and Discriminative Parts in Fine-Grained Visual Classification
    Han, A. Yeong
    Yi, Kwang Moo
    Kim, Kyeong Tae
    Choi, Jae Young
    IEEE ACCESS, 2025, 13 : 19533 - 19544
  • [8] Bi-Modal Progressive Mask Attention for Fine-Grained Recognition
    Song, Kaitao
    Wei, Xiu-Shen
    Shu, Xiangbo
    Song, Ren-Jie
    Lu, Jianfeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7006 - 7018
  • [9] Discriminative Feature Mining and Enhancement Network for Low-Resolution Fine-Grained Image Recognition
    Yan, Tiantian
    Li, Haojie
    Sun, Baoli
    Wang, Zhihui
    Luo, Zhongxuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5319 - 5330
  • [10] Conservative Novelty Synthesizing Network for Malware Recognition in an Open-Set Scenario
    Guo, Jingcai
    Guo, Song
    Ma, Shiheng
    Sun, Yuxia
    Xu, Yuanyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 662 - 676