GNDAN: Graph Navigated Dual Attention Network for Zero-Shot Learning

被引：25

作者：

Chen, Shiming ^{[1
]}

Hong, Ziming ^{[1
]}

Xie, Guosen ^{[2
]}

Peng, Qinmu ^{[1
]}

You, Xinge ^{[1
]}

Ding, Weiping ^{[3
]}

Shao, Ling ^{[4
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China

[3] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China

[4] Saudi Data & Artificial Intelligence Author SDAIA, Natl Ctr Artificial Intelligence NCAI, Riyadh, Saudi Arabia

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Semantics; Visualization; Feature extraction; Task analysis; Knowledge transfer; Navigation; Learning systems; Attribute-based region features; graph attention network (GAT); graph neural network (GNN); zero-shot learning (ZSL);

D O I：

10.1109/TNNLS.2022.3155602

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Zero-shot learning (ZSL) tackles the unseen class recognition problem by transferring semantic knowledge from seen classes to unseen ones. Typically, to guarantee desirable knowledge transfer, a direct embedding is adopted for associating the visual and semantic domains in ZSL. However, most existing ZSL methods focus on learning the embedding from implicit global features or image regions to the semantic space. Thus, they fail to: 1) exploit the appearance relationship priors between various local regions in a single image, which corresponds to the semantic information and 2) learn cooperative global and local features jointly for discriminative feature representations. In this article, we propose the novel graph navigated dual attention network (GNDAN) for ZSL to address these drawbacks. GNDAN employs a region-guided attention network (RAN) and a region-guided graph attention network (RGAT) to jointly learn a discriminative local embedding and incorporate global context for exploiting explicit global embeddings under the guidance of a graph. Specifically, RAN uses soft spatial attention to discover discriminative regions for generating local embeddings. Meanwhile, RGAT employs an attribute-based attention to obtain attribute-based region features, where each attribute focuses on the most relevant image regions. Motivated by the graph neural network (GNN), which is beneficial for structural relationship representations, RGAT further leverages a graph attention network to exploit the relationships between the attribute-based region features for explicit global embedding representations. Based on the self-calibration mechanism, the joint visual embedding learned is matched with the semantic embedding to form the final prediction. Extensive experiments on three benchmark datasets demonstrate that the proposed GNDAN achieves superior performances to the state-of-the-art methods. Our code and trained models are available at https://github.com/shiming-chen/GNDAN.

引用

页码：4516 / 4529

页数：14

共 50 条

[21] Dual insurance for generalized zero-shot learning
Liang, Jiahao
Fang, Xiaozhao
Kang, Peipei
Han, Na
Li, Chuang
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 2111 - 2125
[22] Correlated dual autoencoder for zero-shot learning
Jiang, Ming
Liu, Zhiyong
Li, Pengfei
Zhang, Min
Tang, Jingfan
UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2020, 82 (01): : 65 - 76
[23] Concept-Aware Graph Convolutional Network for Compositional Zero-Shot Learning
Liu, Yang
Wang, Xinshuo
Gao, Xinbo
Han, Jungong
Shao, Ling
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
[24] Zero-Shot Learning on Semantic Class Prototype Graph
Fu, Zhenyong
Xiang, Tao
Kodirov, Elyor
Gong, Shaogang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 2009 - 2022
[25] Rethinking Knowledge Graph Propagation for Zero-Shot Learning
Kampffmeyer, Michael
Chen, Yinbo
Liang, Xiaodan
Wang, Hao
Zhang, Yujia
Xing, Eric P.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11479 - 11488
[26] Implicit and explicit attention mechanisms for zero-shot learning
Alamri, Faisal
Dutta, Anjan
NEUROCOMPUTING, 2023, 534 : 55 - 66
[27] Attributes learning network for generalized zero-shot learning
Yun, Yu
Wang, Sen
Hou, Mingzhen
Gao, Quanxue
NEURAL NETWORKS, 2022, 150 : 112 - 118
[28] Attribute Attention for Semantic Disambiguation in Zero-Shot Learning
Liu, Yang
Guo, Jishun
Cai, Deng
He, Xiaofei
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6697 - 6706
[29] Differential Refinement Network for Zero-Shot Learning
Tian, Yi
Zhang, Yilei
Huang, Yaping
Xu, Wanru
Ding, Zhengming
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4164 - 4178
[30] Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning
Ni, Jian
Zhang, Shanghang
Xie, Haiyong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →