LVAR-CZSL: Learning Visual Attributes Representation for Compositional Zero-Shot Learning

被引：0

作者：

Ma, Xingjiang ^{[1
]}

Yang, Jing ^{[1
,2
]}

Lin, Jiacheng ^{[3
]}

Zheng, Zhenzhe ^{[4
]}

Li, Shaobo ^{[1
]}

Hu, Bingqi ^{[1
]}

Tang, Xianghong ^{[1
]}

机构：

[1] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China

[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China

[4] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Visualization; Feature extraction; Dogs; Task analysis; Attention mechanisms; Zero-shot learning; Circuits and systems; Compositional zero-shot learning; visual attributes; objects and attributes; inter-class connectivity; OBJECTS;

D O I：

10.1109/TCSVT.2024.3444782

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Compositional Zero-Shot Learning (CZSL) has been applied to various scenarios, including scene understanding, visual-language representation, and domain adaptation. Despite numerous endeavours and significant advancements, the crucial issues of fuzzy conceptualization of visual attributes and insufficient inter-class connectivity, have remained insufficiently addressed. To address these issues, we propose Learning Visual Attributes Representation for Compositional Zero-Shot Learning (LVAR-CZSL), which has the ability to learn visual attributes and inter-class dependencies. LVAR-CZSL is mainly composed of two key components: the Visual Attribute Representation Module (VARM) and the Connected Learning Module (CLM). Specifically, VARM extracts detailed attributes and object visual features from global visual features, resolving the issue of fuzzy visual attribute concepts. Moreover, CLM endows LVAR-CZSL with the capability to perceive connectivity between different attributes and objects, effectively enhancing inter-class connectivity. To establish a close connection between VARM and CLM and minimize the gap between image and text features, we introduce the composition-attribute-object Joint Scoring Function (JSF). Additionally, we propose Joint Loss Function (JLF) to optimize the learning process of VARM and CLM. The experiment results on four datasets show that LVAR-CZSL achieves state-of-the-art performance. The code is available at https://github.com/mxjmxj1/LVAR-CZSL.

引用

页码：13311 / 13323

页数：13

共 50 条

[1] Zero-shot recognition with latent visual attributes learning
Xie, Yurui
He, Xiaohai
Zhang, Jing
Luo, Xiaodong
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27321 - 27335
[2] Zero-shot recognition with latent visual attributes learning
Yurui Xie
Xiaohai He
Jing Zhang
Xiaodong Luo
Multimedia Tools and Applications, 2020, 79 : 27321 - 27335
[3] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
Zhang, Tian
Liang, Kongming
Du, Ruoyi
Sun, Xian
Ma, Zhanyu
Guo, Jun
COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
[4] Learning Graph Embeddings for Open World Compositional Zero-Shot Learning
Mancini, Massimiliano
Naeem, Muhammad Ferjad
Xian, Yongqin
Akata, Zeynep
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1545 - 1560
[5] Adaptive Fusion Learning for Compositional Zero-Shot Recognition
Min, Lingtong
Fan, Ziman
Wang, Shunzhou
Dou, Feiyang
Li, Xin
Wang, Binglu
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1193 - 1204
[6] Deep Representation of Hierarchical Semantic Attributes for Zero-shot Learning
Zhang, Zhaocheng
Yang, Gang
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[7] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74 (74)
[8] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
[9] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
[10] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74

← 1 2 3 4 5 →