Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引:2
|
作者
Sun, Jiayin [1 ,2 ,3 ]
Wang, Hong [4 ]
Dong, Qiulei [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
关键词
Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;
D O I
10.1109/TCSVT.2023.3325001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.
引用
收藏
页码:3891 / 3904
页数:14
相关论文
共 50 条
  • [21] Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Mei, Tao
    Luo, Jiebo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5219 - 5227
  • [22] Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples
    Liu, Huafeng
    Zhang, Chuanyi
    Yao, Yazhou
    Wei, Xiu-Shen
    Shen, Fumin
    Tang, Zhenmin
    Zhang, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 546 - 557
  • [23] Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
    Rodriguez, Pau
    Velazquez, Diego
    Cucurull, Guillem
    Gonfaus, Josep M.
    Roca, E. Xavier
    Gonzalez, Jordi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 502 - 514
  • [24] Open-set Fine-grained Retrieval via Prompting Vision-Language Evaluator
    Wang, Shijie
    Chang, Jianlong
    Li, Haojie
    Wang, Zhihui
    Ouyang, Wanli
    Tian, Qi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19381 - 19391
  • [25] Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning
    Kim, Sungnyun
    Bae, Sangmin
    Yun, Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7537 - 7547
  • [26] Subtler mixed attention network on fine-grained image classification
    Liu, Chao
    Huang, Lei
    Wei, Zhiqiang
    Zhang, Wenfeng
    APPLIED INTELLIGENCE, 2021, 51 (11) : 7903 - 7916
  • [27] Subtler mixed attention network on fine-grained image classification
    Chao Liu
    Lei Huang
    Zhiqiang Wei
    Wenfeng Zhang
    Applied Intelligence, 2021, 51 : 7903 - 7916
  • [28] FGM-SPCL: Open-Set Recognition Network for Medical Images Based on Fine-Grained Data Mixture and Spatial Position Constraint Loss
    Zhang, Ruru
    Haihong, E.
    Yuan, Lifei
    Wang, Yanhui
    Wang, Lifei
    Song, Meina
    CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (04) : 1023 - 1033
  • [29] DMRAN: A Hierarchical Fine-Grained Attention-Based Network for Recommendation
    Wang, Huizhao
    Liu, Guanfeng
    Liu, An
    Li, Zhixu
    Zheng, Kai
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3698 - 3704
  • [30] FGM-SPCL:Open-Set Recognition Network for Medical Images Based on Fine-Grained Data Mixture and Spatial Position Constraint Loss
    Ruru ZHANG
    Haihong E
    Lifei YUAN
    Yanhui WANG
    Lifei WANG
    Meina SONG
    Chinese Journal of Electronics, 2024, 33 (04) : 1023 - 1033