Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition

被引：2

作者：

Sun, Jiayin ^{[1
,2
,3
]}

Wang, Hong ^{[4
]}

Dong, Qiulei ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[4] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 05期

关键词：

Transformers; Feature extraction; Task analysis; Image recognition; Training; Visualization; Computer vision; Open-set fine-grained image recognition; hierarchical attention; long-short term memory; TEMPORAL ATTENTION; DIFFICULTY;

D O I：

10.1109/TCSVT.2023.3325001

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Triggered by the success of transformers in various visual tasks, the spatial self-attention mechanism has recently attracted more and more attention in the computer vision community. However, we empirically found that a typical vision transformer with the spatial self-attention mechanism could not learn accurate attention maps for distinguishing different categories of fine-grained images. To address this problem, motivated by the temporal attention mechanism in brains, we propose a hierarchical attention network for learning fine-grained feature representations, called HAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively. The proposed HAN consists of four modules: a self-attention backbone module for learning a sequence of features with self-attention operations, a spatial feature self-organizing module for facilitating the model training, a hierarchical aggregation module for aggregating the re-organized features via a Long Short-Term Memory network, and a context-aware module that is implemented as the forget block of the hierarchical aggregation module for preserving/forgetting the long-term memory by utilizing contextual information. Then, we propose a HAN-based method for open-set fine-grained recognition by integrating the proposed HAN network with a linear classifier, called HAN-OSFGR. Extensive experimental results on 3 fine-grained datasets and 2 coarse-grained datasets demonstrate that the proposed HAN-OSFGR outperforms 9 state-of-the-art open-set recognition methods significantly in most cases.

引用

页码：3891 / 3904

页数：14

共 50 条

[21] Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition
Zheng, Heliang
Fu, Jianlong
Zha, Zheng-Jun
Luo, Jiebo
Mei, Tao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 476 - 488
[22] Attention-Guided CutMix Data Augmentation Network for Fine-Grained Bird Recognition
Guo, Wenming
Wang, Yifei
Han, Fang
PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
[23] Wavelet and Adaptive Coordinate Attention Guided Fine-Grained Residual Network for Image Denoising
Ding, Shifei
Wang, Qidong
Guo, Lili
Li, Xuan
Ding, Ling
Wu, Xindong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6156 - 6166
[24] Application of Improved DNN Algorithm Based on Feature Fusion in Fine-Grained Image Recognition
Zhu, Jiongguang
Zhang, Wei
IEEE ACCESS, 2024, 12 (32140-32151) : 32140 - 32151
[25] Fine-grained Recognition of Chinese Food Image Based on DenseNet with Attention Mechanism
Hao, Ran
Gao, Weidong
Mi, Jihang
Zhao, Zhenwei
TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
[26] Visual Attention Focusing on Fine-Grained Foreground and Eliminating Background Bias for Pest Image Identification
Xu, Xinyuan
Li, Heng
Gao, Qi
Zhou, Meixuan
Meng, Tianyue
Yin, Liping
Chai, Xinyu
IEEE ACCESS, 2024, 12 : 161732 - 161741
[27] Food and Ingredient Joint Learning for Fine-Grained Recognition
Liu, Chengxu
Liang, Yuanzhi
Xue, Yao
Qian, Xueming
Fu, Jianlong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2480 - 2493
[28] Incremental Learning With Open-Set Recognition for Remote Sensing Image Scene Classification
Liu, Weiwei
Nie, Xiangli
Zhang, Bo
Sun, Xian
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[29] Regional Attention Network (RAN) for Head Pose and Fine-Grained Gesture Recognition
Behera, Ardhendu
Wharton, Zachary
Liu, Yonghuai
Ghahremani, Morteza
Kumar, Swagat
Bessis, Nik
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 549 - 562
[30] Incremental Learning for Fine-Grained Image Recognition
Cao, Liangliang
Hsiao, Jenhao
de Juan, Paloma
Li, Yuncheng
Thomee, Bart
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 363 - 366

← 1 2 3 4 5 →