A Decomposable Causal View of Compositional Zero-Shot Learning

被引:7
|
作者
Yang, Muli [1 ]
Xu, Chenghao [1 ]
Wu, Aming [1 ]
Deng, Cheng [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Compositional zero-shot learning; vision and language; image recognition; causality;
D O I
10.1109/TMM.2022.3200578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Composing and recognizing novel concepts that are combinations of known concepts, i.e., compositional generalization, is one of the greatest power of human intelligence. With the development of artificial intelligence, it becomes increasingly appealing to build a vision system that can generalize to unknown compositions based on restricted known knowledge, which has so far remained a great challenge to our community. In fact, machines can be easily misled by superficial correlations in the data, disregarding the causal patterns that are crucial to generalization. In this paper, we rethink compositional generalization with a causal perspective, upon the context of Compositional Zero-Shot Learning (CZSL). We develop a simple yet strong approach based on our novel Decomposable Causal view (dubbed "DECA"), by approximating the causal effect with the combination of three easy-to-learn components. Our proposed DECA(1) is evaluated on two challenging CZSL benchmarks by recognizing unknown compositions of known concepts. Despite being simple in the design, our approach achieves consistent improvements over state-of-the-art baselines, demonstrating its superiority towards the goal of compositional generalization.
引用
收藏
页码:5892 / 5902
页数:11
相关论文
共 50 条
  • [1] Adaptive Fusion Learning for Compositional Zero-Shot Recognition
    Min, Lingtong
    Fan, Ziman
    Wang, Shunzhou
    Dou, Feiyang
    Li, Xin
    Wang, Binglu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1193 - 1204
  • [2] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
  • [3] Reference-Limited Compositional Zero-Shot Learning
    Huang, Siteng
    Wei, Qiyao
    Wang, Donglin
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 443 - 451
  • [4] Disentangling Before Composing: Learning Invariant Disentangled Features for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Chen, Wei
    Ma, Zhanyu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 1132 - 1147
  • [5] Learning Graph Embeddings for Open World Compositional Zero-Shot Learning
    Mancini, Massimiliano
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Akata, Zeynep
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1545 - 1560
  • [6] Swap-Reconstruction Autoencoder for Compositional Zero-Shot Learning
    Guo, Ting
    Liang, Jiye
    Xie, Guo-Sen
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 438 - 443
  • [7] Knowledge Guided Transformer Network for Compositional Zero-Shot Learning
    Panda, Aditya
    Prasad, Dipti
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (11)
  • [8] Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition
    Yang, Yanhua
    Pan, Rui
    Li, Xiangyu
    Yang, Xu
    Deng, Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1909 - 1919
  • [9] PMGNet: Disentanglement and entanglement benefit mutually for compositional zero-shot learning
    Liu, Yu
    Li, Jianghao
    Zhang, Yanyi
    Jia, Qi
    Wang, Weimin
    Pu, Nan
    Sebe, Nicu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [10] LVAR-CZSL: Learning Visual Attributes Representation for Compositional Zero-Shot Learning
    Ma, Xingjiang
    Yang, Jing
    Lin, Jiacheng
    Zheng, Zhenzhe
    Li, Shaobo
    Hu, Bingqi
    Tang, Xianghong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13311 - 13323