A Decomposable Causal View of Compositional Zero-Shot Learning

被引:7
|
作者
Yang, Muli [1 ]
Xu, Chenghao [1 ]
Wu, Aming [1 ]
Deng, Cheng [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Compositional zero-shot learning; vision and language; image recognition; causality;
D O I
10.1109/TMM.2022.3200578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Composing and recognizing novel concepts that are combinations of known concepts, i.e., compositional generalization, is one of the greatest power of human intelligence. With the development of artificial intelligence, it becomes increasingly appealing to build a vision system that can generalize to unknown compositions based on restricted known knowledge, which has so far remained a great challenge to our community. In fact, machines can be easily misled by superficial correlations in the data, disregarding the causal patterns that are crucial to generalization. In this paper, we rethink compositional generalization with a causal perspective, upon the context of Compositional Zero-Shot Learning (CZSL). We develop a simple yet strong approach based on our novel Decomposable Causal view (dubbed "DECA"), by approximating the causal effect with the combination of three easy-to-learn components. Our proposed DECA(1) is evaluated on two challenging CZSL benchmarks by recognizing unknown compositions of known concepts. Despite being simple in the design, our approach achieves consistent improvements over state-of-the-art baselines, demonstrating its superiority towards the goal of compositional generalization.
引用
收藏
页码:5892 / 5902
页数:11
相关论文
共 50 条
  • [41] Cross Knowledge-based Generative Zero-Shot Learning approach with Taxonomy Regularization
    Xie, Cheng
    Xiang, Hongxin
    Zeng, Ting
    Yang, Yun
    Yu, Beibei
    Liu, Qing
    NEURAL NETWORKS, 2021, 139 : 168 - 178
  • [42] Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention
    Liu, Ziming
    Guo, Song
    Guo, Jingcai
    Xu, Yuanyuan
    Huo, Fushuo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7441 - 7455
  • [43] Anchor-based discriminative dual distribution calibration for transductive zero-shot learning
    Zhang, Yi
    Huang, Sheng
    Yang, Wanli
    Tang, Wenhao
    Zhang, Xiaohong
    Yang, Dan
    IMAGE AND VISION COMPUTING, 2023, 137
  • [44] A Generalized Zero-Shot Deep Learning Classifier for Emotion Recognition Using Facial Expression Images
    Bhati, Vishal Singh
    Tiwari, Namita
    Chawla, Meenu
    IEEE ACCESS, 2025, 13 : 18687 - 18700
  • [45] Zero-Shot Video Grounding With Pseudo Query Lookup and Verification
    Lu, Yu
    Quan, Ruijie
    Zhu, Linchao
    Yang, Yi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1643 - 1654
  • [46] Zero-shot Scene Graph Generation via Triplet Calibration and Reduction
    Li, Jiankai
    Wang, Yunhong
    Li, Weixin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [47] Exemplar-Based, Semantic Guided Zero-Shot Visual Recognition
    Zhang, Chunjie
    Liang, Chao
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3056 - 3065
  • [48] SIMPL: Generating Synthetic Overhead Imagery to Address Custom Zero-Shot and Few-Shot Detection Problems
    Xu, Yang
    Huang, Bohao
    Luo, Xiong
    Bradbury, Kyle
    Malof, Jordan M.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 4386 - 4396
  • [49] Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification
    Yi, Kai
    Shen, Xiaoqian
    Gou, Yunhao
    Elhoseiny, Mohamed
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 116 - 132
  • [50] Semantic-Enhanced ULIP for Zero-Shot 3D Shape Recognition
    Ding, Bo
    Zhang, Libao
    Sun, Hongbo
    He, Yongjun
    Qin, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1926 - 1936