Integrating topology beyond descriptions for zero-shot learning

被引:1
作者
Chen, Ziyi [1 ]
Gao, Yutong [1 ]
Lang, Congyan [1 ]
Wei, Lili [1 ]
Li, Yidong [1 ]
Liu, Hongzhe [2 ]
Liu, Fayao [3 ]
机构
[1] Beijing Jiaotong Univ, Key Lab Big Data & Artificial Intelligence Transpo, Minist Educ, Beijing 100044, Peoples R China
[2] Beijing Union Univ, Beijing Key Lab Informat Serv Engineenng, Beijing 100101, Peoples R China
[3] ASTAR, Inst Infocomm Res, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Topology mining; Image classification; DATABASE;
D O I
10.1016/j.patcog.2023.109738
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) aims to discriminate object categories through the identification of their at-tributes and has received much attention for its capability to predict unseen categories without col-lecting training data. Recently, excellent works have been devoted to optimizing the model inference by mining the topology among categories/attributes, which proves that the topology learning is bene-ficial and important for ZSL. However, existing works focus almost exclusively on the construction of semantic topological knowledge with textual descriptions, which, though effective, still suffer from two deficiencies: first, the semantic gap between modalities makes it difficult for the category attributes to accurately describe the corresponding visual characters, resulting in the topology constructed in the se-mantic modality being distorted in the visual modality; second, it is difficult for one to enumerate all the attributes hidden in images, resulting in an incomplete topology mined only from the defined attributes. Therefore, we propose a Cross-Modality Topology Propagation Matcher (CTPM) to construct a more com-plete topology system by collaborative mining of topological knowledge in both the visual and semantic modalities. We stand at the dataset level to construct sample-based visual topological knowledge based on the global image features to preserve the integrity of visual information. Meanwhile, we exploit the matching relationship between visual and semantic modalities to make topological knowledge propagate effectively across modalities, and fully enjoy the benefits of multi-modality topological knowledge in cat-egory/attribute reasoning. We validate the effectiveness of our CTPM through extensive experiments and achieve state-of-the-art performance on four ZSL datasets.& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 49 条
[1]  
Chen S., 2021, Neurips
[2]  
Chen SM, 2022, AAAI CONF ARTIF INTE, P330
[3]   MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning [J].
Chen, Shiming ;
Hong, Ziming ;
Xie, Guo-Sen ;
Yang, Wenhan ;
Peng, Qinmu ;
Wang, Kai ;
Zhao, Jian ;
You, Xinge .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :7602-7611
[4]   FREE: Feature Refinement for Generalized Zero-Shot Learning [J].
Chen, Shiming ;
Wang, Wenjie ;
Xia, Beihao ;
Peng, Qinmu ;
You, Xinge ;
Zheng, Feng ;
Shao, Ling .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :122-131
[5]   Hybrid routing transformer for zero-shot learning [J].
Cheng, De ;
Wang, Gerong ;
Wang, Bo ;
Zhang, Qiang ;
Han, Jungong ;
Zhang, Dingwen .
PATTERN RECOGNITION, 2023, 137
[6]   Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions [J].
Elhoseiny, Mohamed ;
Saleh, Babak ;
Elgammal, Ahmed .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2584-2591
[7]   Multi-modal Cycle-Consistent Generalized Zero-Shot Learning [J].
Felix, Rafael ;
Kumar, B. G. Vijay ;
Reid, Ian ;
Carneiro, Gustavo .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :21-37
[8]   Explainable zero-shot learning via attentive graph convolutional network and knowledge graphs [J].
Geng, Yuxia ;
Chen, Jiaoyan ;
Ye, Zhiquan ;
Yuan, Zonggang ;
Zhang, Wei ;
Chen, Huajun .
SEMANTIC WEB, 2021, 12 (05) :741-765
[9]  
Gong K, 2019, PROC CVPR IEEE, P7442, DOI [10.1109/cvpr.2019.00763, 10.1109/CVPR.2019.00763]
[10]   Contrastive Embedding for Generalized Zero-Shot Learning [J].
Han, Zongyan ;
Fu, Zhenyong ;
Chen, Shuo ;
Yang, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2371-2381