Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引：2

作者：

Sun, Jiayin ^{[1
,2
,3
]}

Wang, Hong ^{[4
]}

Dong, Qiulei ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 05期

关键词：

Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;

D O I：

10.1109/TCSVT.2023.3325001

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.

引用

页码：3891 / 3904

页数：14

共 50 条

[1] Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition
Liu, Huabin
Li, Jianguo
Li, Dian
See, John
Lin, Weiyao
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2902 - 2913
[2] Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition
Yu, Jun
Tan, Min
Zhang, Hongyuan
Tao, Dacheng
Rui, Yong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 563 - 578
[3] Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples
Liu, Huafeng
Zhang, Chuanyi
Yao, Yazhou
Wei, Xiu-Shen
Shen, Fumin
Tang, Zhenmin
Zhang, Jian
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 546 - 557
[4] Feature Correlation Residual Network for Fine-Grained Image Recognition
Xu, Jiazhen
Wei, Yantao
Deng, Wei
IEEE ACCESS, 2020, 8 : 214322 - 214331
[5] Fine-Grained Open-Set Deepfake Detection via Unsupervised Domain Adaptation
Zhou, Xinye
Han, Hu
Shan, Shiguang
Chen, Xilin
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7536 - 7547
[6] Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
Rodriguez, Pau
Velazquez, Diego
Cucurull, Guillem
Gonfaus, Josep M.
Roca, E. Xavier
Gonzalez, Jordi
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 502 - 514
[7] Hierarchical Feature Attention Learning Network for Detecting Object and Discriminative Parts in Fine-Grained Visual Classification
Han, A. Yeong
Yi, Kwang Moo
Kim, Kyeong Tae
Choi, Jae Young
IEEE ACCESS, 2025, 13 : 19533 - 19544
[8] Bi-Modal Progressive Mask Attention for Fine-Grained Recognition
Song, Kaitao
Wei, Xiu-Shen
Shu, Xiangbo
Song, Ren-Jie
Lu, Jianfeng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7006 - 7018
[9] Discriminative Feature Mining and Enhancement Network for Low-Resolution Fine-Grained Image Recognition
Yan, Tiantian
Li, Haojie
Sun, Baoli
Wang, Zhihui
Luo, Zhongxuan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5319 - 5330
[10] Conservative Novelty Synthesizing Network for Malware Recognition in an Open-Set Scenario
Guo, Jingcai
Guo, Song
Ma, Shiheng
Sun, Yuxia
Xu, Yuanyuan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 662 - 676

← 1 2 3 4 5 →