Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引：2

作者：

Sun, Jiayin ^{[1
,2
,3
]}

Wang, Hong ^{[4
]}

Dong, Qiulei ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 05期

关键词：

Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;

D O I：

10.1109/TCSVT.2023.3325001

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.

引用

页码：3891 / 3904

页数：14

共 50 条

[41] Context-Aware Fine-Grained Product Recognition on Grocery Shelves
Budimir, Lovre Antonio
Kalafatic, Zoran
Subasic, Marko
Loncaric, Sven
IEEE ACCESS, 2025, 13 : 16824 - 16837
[42] Feature Consistency-Based Prototype Network for Open-Set Hyperspectral Image Classification
Xie, Zhuojun
Duan, Puhong
Liu, Wang
Kang, Xudong
Wei, Xiaohui
Li, Shutao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9286 - 9296
[43] COUPLED PATCH SIMILARITY NETWORK FOR ONE-SHOT FINE-GRAINED IMAGE RECOGNITION
Tian, Sheng
Tang, Hao
Dai, Longquan
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2478 - 2482
[44] CMSEA: Compound Model Scaling With Efficient Attention for Fine-Grained Image Classification
Guang, Jinzheng
Liang, Jianru
IEEE ACCESS, 2022, 10 : 18222 - 18232
[45] A Fine-Grained Image Classification Model Based on Hybrid Attention and Pyramidal Convolution
Wang, Sifeng
Li, Shengxiang
Li, Anran
Dong, Zhaoan
Li, Guangshun
Yan, Chao
TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (03): : 1283 - 1293
[46] Fine-Grained Image Quality Caption With Hierarchical Semantics Degradation
Yang, Wen
Wu, Jinjian
Tian, Shiwei
Li, Leida
Dong, Weisheng
Shi, Guangming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3578 - 3590
[47] Text to Image GANs with RoBERTa and Fine-grained Attention Networks
Siddharth, M.
Aarthi, R.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (12) : 947 - 955
[48] Fine-Grained Image Analysis With Deep Learning: A Survey
Wei, Xiu-Shen
Song, Yi-Zhe
Mac Aodha, Oisin
Wu, Jianxin
Peng, Yuxin
Tang, Jinhui
Yang, Jian
Belongie, Serge
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8927 - 8948
[49] SDHC: Joint Semantic-Data Guided Hierarchical Classification for Fine-Grained HRRP Target Recognition
Liu, Yichen
Long, Teng
Zhang, Liang
Wang, Yanhua
Zhang, Xin
Li, Yang
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (04) : 3993 - 4009
[50] Multiscale Progressive Complementary Fusion Network for Fine-Grained Visual Classification
Lei, Jingsheng
Yang, Xinqi
Yang, Shengying
IEEE ACCESS, 2022, 10 : 62800 - 62810

← 1 2 3 4 5 →