Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引:2
|
作者
Sun, Jiayin [1 ,2 ,3 ]
Wang, Hong [4 ]
Dong, Qiulei [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
关键词
Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;
D O I
10.1109/TCSVT.2023.3325001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.
引用
收藏
页码:3891 / 3904
页数:14
相关论文
共 50 条
  • [41] Context-Aware Fine-Grained Product Recognition on Grocery Shelves
    Budimir, Lovre Antonio
    Kalafatic, Zoran
    Subasic, Marko
    Loncaric, Sven
    IEEE ACCESS, 2025, 13 : 16824 - 16837
  • [42] Feature Consistency-Based Prototype Network for Open-Set Hyperspectral Image Classification
    Xie, Zhuojun
    Duan, Puhong
    Liu, Wang
    Kang, Xudong
    Wei, Xiaohui
    Li, Shutao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9286 - 9296
  • [43] COUPLED PATCH SIMILARITY NETWORK FOR ONE-SHOT FINE-GRAINED IMAGE RECOGNITION
    Tian, Sheng
    Tang, Hao
    Dai, Longquan
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2478 - 2482
  • [44] CMSEA: Compound Model Scaling With Efficient Attention for Fine-Grained Image Classification
    Guang, Jinzheng
    Liang, Jianru
    IEEE ACCESS, 2022, 10 : 18222 - 18232
  • [45] A Fine-Grained Image Classification Model Based on Hybrid Attention and Pyramidal Convolution
    Wang, Sifeng
    Li, Shengxiang
    Li, Anran
    Dong, Zhaoan
    Li, Guangshun
    Yan, Chao
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (03): : 1283 - 1293
  • [46] Fine-Grained Image Quality Caption With Hierarchical Semantics Degradation
    Yang, Wen
    Wu, Jinjian
    Tian, Shiwei
    Li, Leida
    Dong, Weisheng
    Shi, Guangming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3578 - 3590
  • [47] Text to Image GANs with RoBERTa and Fine-grained Attention Networks
    Siddharth, M.
    Aarthi, R.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (12) : 947 - 955
  • [48] Fine-Grained Image Analysis With Deep Learning: A Survey
    Wei, Xiu-Shen
    Song, Yi-Zhe
    Mac Aodha, Oisin
    Wu, Jianxin
    Peng, Yuxin
    Tang, Jinhui
    Yang, Jian
    Belongie, Serge
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8927 - 8948
  • [49] SDHC: Joint Semantic-Data Guided Hierarchical Classification for Fine-Grained HRRP Target Recognition
    Liu, Yichen
    Long, Teng
    Zhang, Liang
    Wang, Yanhua
    Zhang, Xin
    Li, Yang
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (04) : 3993 - 4009
  • [50] Multiscale Progressive Complementary Fusion Network for Fine-Grained Visual Classification
    Lei, Jingsheng
    Yang, Xinqi
    Yang, Shengying
    IEEE ACCESS, 2022, 10 : 62800 - 62810