XPNet: Cross-Domain Prototypical Network for Zero-Shot Sketch-Based Image Retrieval

被引:1
|
作者
Li, Mingkang [1 ]
Qi, Yonggang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
Cross-domain prototype; Zero-shot; SBIR;
D O I
10.1007/978-3-031-18907-4_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot retrieval is a topical problem for sketch-based image search. It is largely necessitated by the fact that human sketch data is scarce in nature - in most cases retrieval will have to be conducted at zero-shot level. The problem of zero-shot sketch-based image retrieval (ZS-SBIR) is however a much harder task when compared with its photoonly counterpart. In addition to addressing the zero-shot transfer problem, it will also need to tackle the inherent domain gap between sketch and photo. Most existing works on ZS-SBIR typically address these two problems separately: a triplet-like network to address the domain gap, and employing external semantic information (such as word embeddings) to assist category transfer. In this paper, we take a different stance and ask a more difficult question - can we devise a consolidated solution to accommodate both problems simultaneously, especially without the need for additional semantic information. For that, we propose a cross-domain prototype learning framework to narrow the domain gap by encouraging a confirmation of prototypes between two domains. The intuition is there exists an embedding in which points regardless of which domain it comes from, would cluster around a single and shared prototype representation for a given class. We first show that performance comparable with that of state-of-the-art can already be achieved just by doing this alone. We then further propose two means of tackling data efficiency during training: (i) an episode training protocol that enables data feeding by demand, and (ii) a hard triplet generation algorithm to address data scarcity. Extensive experiments on TU-Berlin-Extended, Sketchy-Extended and QuickDraw-Extended validate the usefulness of our approach.
引用
收藏
页码:394 / 410
页数:17
相关论文
共 50 条
  • [31] Sketch-based Image Retrieval Using Cross-domain Modeling and Deep Fusion Network
    Yu D.
    Liu Y.-J.
    Xing M.-M.
    Li Z.-M.
    Li H.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (11): : 3567 - 3577
  • [32] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Jiao, Shichao
    Han, Xie
    Xiong, Fengguang
    Yang, Xiaowen
    Han, Huiyan
    He, Ligang
    Kuang, Liqun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 13469 - 13483
  • [33] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Shichao Jiao
    Xie Han
    Fengguang Xiong
    Xiaowen Yang
    Huiyan Han
    Ligang He
    Liqun Kuang
    Neural Computing and Applications, 2022, 34 : 13469 - 13483
  • [34] Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval
    Dutta, Anjan
    Akata, Zeynep
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5084 - 5093
  • [35] Energy-Guided Feature Fusion for Zero-Shot Sketch-Based Image Retrieval
    Ren, Hao
    Zheng, Ziqiang
    Lu, Hong
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5711 - 5720
  • [36] Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval
    Pandey, Anubha
    Mishra, Ashish
    Verma, Vinay Kumar
    Mittal, Anurag
    Murthy, Hema A.
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2529 - 2538
  • [37] OCEAN: A DUAL LEARNING APPROACH FOR GENERALIZED ZERO-SHOT SKETCH-BASED IMAGE RETRIEVAL
    Zhu, Jiawen
    Xu, Xing
    Shen, Fumin
    Lee, Roy Ka-Wei
    Wang, Zheng
    Shen, Heng Tao
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [38] Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval
    Liu, Qing
    Xie, Lingxi
    Wang, Huiyu
    Yuile, Alan L.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3661 - 3670
  • [39] Semi-transductive Learning for Generalized Zero-Shot Sketch-Based Image Retrieval
    Ge, Ce
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    Xu, Tong
    Liao, Jianxin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7678 - 7686
  • [40] Energy-Guided Feature Fusion for Zero-Shot Sketch-Based Image Retrieval
    Hao Ren
    Ziqiang Zheng
    Hong Lu
    Neural Processing Letters, 2022, 54 : 5711 - 5720