A Decomposable Causal View of Compositional Zero-Shot Learning

被引：7

作者：

Yang, Muli ^{[1
]}

Xu, Chenghao ^{[1
]}

Wu, Aming ^{[1
]}

Deng, Cheng ^{[1
]}

机构：

[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Compositional zero-shot learning; vision and language; image recognition; causality;

D O I：

10.1109/TMM.2022.3200578

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Composing and recognizing novel concepts that are combinations of known concepts, i.e., compositional generalization, is one of the greatest power of human intelligence. With the development of artificial intelligence, it becomes increasingly appealing to build a vision system that can generalize to unknown compositions based on restricted known knowledge, which has so far remained a great challenge to our community. In fact, machines can be easily misled by superficial correlations in the data, disregarding the causal patterns that are crucial to generalization. In this paper, we rethink compositional generalization with a causal perspective, upon the context of Compositional Zero-Shot Learning (CZSL). We develop a simple yet strong approach based on our novel Decomposable Causal view (dubbed "DECA"), by approximating the causal effect with the combination of three easy-to-learn components. Our proposed DECA(1) is evaluated on two challenging CZSL benchmarks by recognizing unknown compositions of known concepts. Despite being simple in the design, our approach achieves consistent improvements over state-of-the-art baselines, demonstrating its superiority towards the goal of compositional generalization.

引用

页码：5892 / 5902

页数：11

共 50 条

[1] Adaptive Fusion Learning for Compositional Zero-Shot Recognition
Min, Lingtong
Fan, Ziman
Wang, Shunzhou
Dou, Feiyang
Li, Xin
Wang, Binglu
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1193 - 1204
[2] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
Zhang, Tian
Liang, Kongming
Du, Ruoyi
Sun, Xian
Ma, Zhanyu
Guo, Jun
COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
[3] Reference-Limited Compositional Zero-Shot Learning
Huang, Siteng
Wei, Qiyao
Wang, Donglin
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 443 - 451
[4] Disentangling Before Composing: Learning Invariant Disentangled Features for Compositional Zero-Shot Learning
Zhang, Tian
Liang, Kongming
Du, Ruoyi
Chen, Wei
Ma, Zhanyu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 1132 - 1147
[5] Learning Graph Embeddings for Open World Compositional Zero-Shot Learning
Mancini, Massimiliano
Naeem, Muhammad Ferjad
Xian, Yongqin
Akata, Zeynep
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1545 - 1560
[6] Swap-Reconstruction Autoencoder for Compositional Zero-Shot Learning
Guo, Ting
Liang, Jiye
Xie, Guo-Sen
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 438 - 443
[7] Knowledge Guided Transformer Network for Compositional Zero-Shot Learning
Panda, Aditya
Prasad, Dipti
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (11)
[8] Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition
Yang, Yanhua
Pan, Rui
Li, Xiangyu
Yang, Xu
Deng, Cheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1909 - 1919
[9] PMGNet: Disentanglement and entanglement benefit mutually for compositional zero-shot learning
Liu, Yu
Li, Jianghao
Zhang, Yanyi
Jia, Qi
Wang, Weimin
Pu, Nan
Sebe, Nicu
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
[10] LVAR-CZSL: Learning Visual Attributes Representation for Compositional Zero-Shot Learning
Ma, Xingjiang
Yang, Jing
Lin, Jiacheng
Zheng, Zhenzhe
Li, Shaobo
Hu, Bingqi
Tang, Xianghong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13311 - 13323

← 1 2 3 4 5 →