Isolating Features of Object and Its State for Compositional Zero-Shot Learning

被引:7
|
作者
Panda, Aditya [1 ]
Santra, Bikash [1 ]
Mukherjee, Dipti Prasad [1 ]
机构
[1] Indian Stat Inst, Elect & Commun Sci Unit, Kolkata 700108, West Bengal, India
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2023年 / 7卷 / 05期
关键词
CZSL; composition; disentanglement; state-object composition; sequential learning;
D O I
10.1109/TETCI.2022.3232816
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of Compositional Zero Shot Learning (CZSL) is to recognize previously unseen compositions of known objects (e.g. apple, banana) and their states (e.g. ripe, unripe) as seen in an image. The CZSL is a challenging problem as it is difficult to isolate the visual features of object and its states from its compositions in images. The features of a state may have wide variation across different compositions. For example, the state sliced has different visual features in compositions sliced apple and sliced tomato. In this paper, we attempt to solve the problem of CZSL using a two-stage recognition approach. Each stage sequentially performs recognition task utilising two distinct modalities of compositions. The modalities are image features as well as textual features representing features of objects and states respectively. We propose a novel gradient regularized loss term for better disentanglement of object and state features from the visual features of the composition. An appropriate disentanglement of the features of visual primitives (states and objects) leads to correct identification of images of unseen state-object compositions. The proposed approach and the competing methods are evaluated on three benchmark datasets, MIT States, UT-Zappos50 k and CGQA. Our extensive experiments establish the efficacy of our proposed algorithm that outperforms other state-of-the-art approaches.
引用
收藏
页码:1571 / 1583
页数:13
相关论文
共 5 条
  • [1] Fine-Grained Attribute-Object Feature Representation in Compositional Zero-Shot Learning
    Shabbir, Nazir
    Rout, Ranjeet Kr.
    Umer, Saiyed
    Mohanta, Partha Pratim
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 157 - 165
  • [2] Knowledge Guided Transformer Network for Compositional Zero-Shot Learning
    Panda, Aditya
    Prasad, Dipti
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (11)
  • [3] On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning
    Anwaar, Muhammad Umer
    Pan, Zhihui
    Kleinsteuber, Martin
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4645 - 4654
  • [4] Compositional Zero-Shot Learning using Multi-Branch Graph Convolution and Cross-layer Knowledge Sharing
    Panda, Aditya
    Mukherjee, Dipti Prasad
    PATTERN RECOGNITION, 2024, 145
  • [5] Pseudo Sample Synthesis based on Task-irrelevant Augmentation for Generalized Zero-shot Learning
    Feng, Yaogong
    Huang, Xiaowen
    Jiang, Feng
    Sun, Yangyang
    Sang, Jitao
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 295 - 302