Focusing on Valid Search Space in Open-World Compositional Zero-Shot Learning by Leveraging Misleading Answers

被引:0
|
作者
Kim, Soohyeong [1 ]
Lee, Sangjun [1 ]
Choi, Yong Suk [2 ]
机构
[1] Hanyang Univ, Dept Artificial Intelligence, Seoul 04763, South Korea
[2] Hanyang Univ, Dept Comp Sci, Seoul 04763, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Compositional zero-shot learning; open-world recognition; representation learning; vision and language; zero-shot learning;
D O I
10.1109/ACCESS.2024.3491174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of Compositional Zero-Shot Learning (CZSL) is to recognize various compositions of state-object pairs. Because the compositions that need to be considered are only a subset of all combinations of states and objects, it is tough for models to predict unseen compositions. Previous work overlooks the problem of predicting non-sensical compositions such as flying dogs. To address this problem, we introduce a novel method for the model to distinguish between target and non-target composition space to avoid predicting absurd compositions. More specifically, in the process of predicting the states and objects, we train the model to increase the similarity with the label that matches the input image while decreasing the similarity with non-matched labels. Our method calculates the logits for the composition labels by combining the similarities of the image-states and the similarities of image-objects respectively. Then, the combined logits and directly computed composition logits are used to minimize the case of the predicting absurd composition. On three well-known datasets such as MIT-States, UT-Zappos, and C-GQA, various experimental results demonstrate our simple and novel approach significantly improves model performances. Code is available at: https://github.com/ToBeSuperior/Annotation-embedding
引用
收藏
页码:165822 / 165830
页数:9
相关论文
共 3 条
  • [1] Learning Graph Embeddings for Open World Compositional Zero-Shot Learning
    Mancini, Massimiliano
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Akata, Zeynep
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1545 - 1560
  • [2] Simple Primitives With Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-Shot Learning
    Liu, Zhe
    Li, Yun
    Yao, Lina
    Chang, Xiaojun
    Fang, Wei
    Wu, Xiaojun
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 543 - 560
  • [3] Preserving text space integrity for robust compositional zero-shot learning via mixture of pretrained experts
    Hao, Zehua
    Liu, Fang
    Jiao, Licheng
    Du, Yaoyang
    Li, Shuo
    Wang, Hao
    Li, Pengfang
    Liu, Xu
    Chen, Puhua
    NEUROCOMPUTING, 2025, 614