A Decomposable Causal View of Compositional Zero-Shot Learning

被引：7

作者：

Yang, Muli ^{[1
]}

Xu, Chenghao ^{[1
]}

Wu, Aming ^{[1
]}

Deng, Cheng ^{[1
]}

机构：

[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Compositional zero-shot learning; vision and language; image recognition; causality;

D O I：

10.1109/TMM.2022.3200578

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Composing and recognizing novel concepts that are combinations of known concepts, i.e., compositional generalization, is one of the greatest power of human intelligence. With the development of artificial intelligence, it becomes increasingly appealing to build a vision system that can generalize to unknown compositions based on restricted known knowledge, which has so far remained a great challenge to our community. In fact, machines can be easily misled by superficial correlations in the data, disregarding the causal patterns that are crucial to generalization. In this paper, we rethink compositional generalization with a causal perspective, upon the context of Compositional Zero-Shot Learning (CZSL). We develop a simple yet strong approach based on our novel Decomposable Causal view (dubbed "DECA"), by approximating the causal effect with the combination of three easy-to-learn components. Our proposed DECA(1) is evaluated on two challenging CZSL benchmarks by recognizing unknown compositions of known concepts. Despite being simple in the design, our approach achieves consistent improvements over state-of-the-art baselines, demonstrating its superiority towards the goal of compositional generalization.

引用

页码：5892 / 5902

页数：11

共 50 条

[21] Correlated dual autoencoder for zero-shot learning
Jiang, Ming
Liu, Zhiyong
Li, Pengfei
Zhang, Min
Tang, Jingfan
UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2020, 82 (01): : 65 - 76
[22] Fusing spatial and frequency features for compositional zero-shot image classification
Li, Suyi
Jiang, Chenyi
Ye, Qiaolin
Wang, Shidong
Yang, Wankou
Zhang, Haofeng
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[23] Preserving text space integrity for robust compositional zero-shot learning via mixture of pretrained experts
Hao, Zehua
Liu, Fang
Jiao, Licheng
Du, Yaoyang
Li, Shuo
Wang, Hao
Li, Pengfang
Liu, Xu
Chen, Puhua
NEUROCOMPUTING, 2025, 614
[24] AOGN-CZSL: An Attribute- and Object-Guided Network for Compositional Zero-Shot Learning
Yang, Jing
Ma, Xingjiang
Wu, Yuankai
Li, Chengjiang
Sue, Zhidong
Xu, Ji
Feng, Yixiong
INFORMATION FUSION, 2025, 120
[25] Dual triplet network for image zero-shot learning
Ji, Zhong
Wang, Hai
Pang, Yanwei
Shao, Ling
NEUROCOMPUTING, 2020, 373 : 90 - 97
[26] Simple Primitives With Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-Shot Learning
Liu, Zhe
Li, Yun
Yao, Lina
Chang, Xiaojun
Fang, Wei
Wu, Xiaojun
El Saddik, Abdulmotaleb
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 543 - 560
[27] Learning object-centric complementary features for zero-shot learning
Liu, Jie
Song, Kechen
He, Yu
Dong, Hongwen
Yan, Yunhui
Meng, Qinggang
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 89
[28] Domain-Oriented Semantic Embedding for Zero-Shot Learning
Min, Shaobo
Yao, Hantao
Xie, Hongtao
Zha, Zheng-Jun
Zhang, Yongdong
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3919 - 3930
[29] Learning Multipart Attention Neural Network for Zero-Shot Classification
Meng, Min
Wei, Jie
Wu, Jigang
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 414 - 423
[30] Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning
Wang, Ziyang
Gou, Yunhao
Li, Jingjing
Zhu, Lei
Shen, Heng Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1019 - 1030

← 1 2 3 4 5 →