A Decomposable Causal View of Compositional Zero-Shot Learning

被引:7
|
作者
Yang, Muli [1 ]
Xu, Chenghao [1 ]
Wu, Aming [1 ]
Deng, Cheng [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Compositional zero-shot learning; vision and language; image recognition; causality;
D O I
10.1109/TMM.2022.3200578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Composing and recognizing novel concepts that are combinations of known concepts, i.e., compositional generalization, is one of the greatest power of human intelligence. With the development of artificial intelligence, it becomes increasingly appealing to build a vision system that can generalize to unknown compositions based on restricted known knowledge, which has so far remained a great challenge to our community. In fact, machines can be easily misled by superficial correlations in the data, disregarding the causal patterns that are crucial to generalization. In this paper, we rethink compositional generalization with a causal perspective, upon the context of Compositional Zero-Shot Learning (CZSL). We develop a simple yet strong approach based on our novel Decomposable Causal view (dubbed "DECA"), by approximating the causal effect with the combination of three easy-to-learn components. Our proposed DECA(1) is evaluated on two challenging CZSL benchmarks by recognizing unknown compositions of known concepts. Despite being simple in the design, our approach achieves consistent improvements over state-of-the-art baselines, demonstrating its superiority towards the goal of compositional generalization.
引用
收藏
页码:5892 / 5902
页数:11
相关论文
共 50 条
  • [21] Correlated dual autoencoder for zero-shot learning
    Jiang, Ming
    Liu, Zhiyong
    Li, Pengfei
    Zhang, Min
    Tang, Jingfan
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2020, 82 (01): : 65 - 76
  • [22] Fusing spatial and frequency features for compositional zero-shot image classification
    Li, Suyi
    Jiang, Chenyi
    Ye, Qiaolin
    Wang, Shidong
    Yang, Wankou
    Zhang, Haofeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [23] Preserving text space integrity for robust compositional zero-shot learning via mixture of pretrained experts
    Hao, Zehua
    Liu, Fang
    Jiao, Licheng
    Du, Yaoyang
    Li, Shuo
    Wang, Hao
    Li, Pengfang
    Liu, Xu
    Chen, Puhua
    NEUROCOMPUTING, 2025, 614
  • [24] AOGN-CZSL: An Attribute- and Object-Guided Network for Compositional Zero-Shot Learning
    Yang, Jing
    Ma, Xingjiang
    Wu, Yuankai
    Li, Chengjiang
    Sue, Zhidong
    Xu, Ji
    Feng, Yixiong
    INFORMATION FUSION, 2025, 120
  • [25] Dual triplet network for image zero-shot learning
    Ji, Zhong
    Wang, Hai
    Pang, Yanwei
    Shao, Ling
    NEUROCOMPUTING, 2020, 373 : 90 - 97
  • [26] Simple Primitives With Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-Shot Learning
    Liu, Zhe
    Li, Yun
    Yao, Lina
    Chang, Xiaojun
    Fang, Wei
    Wu, Xiaojun
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 543 - 560
  • [27] Learning object-centric complementary features for zero-shot learning
    Liu, Jie
    Song, Kechen
    He, Yu
    Dong, Hongwen
    Yan, Yunhui
    Meng, Qinggang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 89
  • [28] Domain-Oriented Semantic Embedding for Zero-Shot Learning
    Min, Shaobo
    Yao, Hantao
    Xie, Hongtao
    Zha, Zheng-Jun
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3919 - 3930
  • [29] Learning Multipart Attention Neural Network for Zero-Shot Classification
    Meng, Min
    Wei, Jie
    Wu, Jigang
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 414 - 423
  • [30] Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning
    Wang, Ziyang
    Gou, Yunhao
    Li, Jingjing
    Zhu, Lei
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1019 - 1030