XPNet: Cross-Domain Prototypical Network for Zero-Shot Sketch-Based Image Retrieval

被引:1
|
作者
Li, Mingkang [1 ]
Qi, Yonggang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
Cross-domain prototype; Zero-shot; SBIR;
D O I
10.1007/978-3-031-18907-4_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot retrieval is a topical problem for sketch-based image search. It is largely necessitated by the fact that human sketch data is scarce in nature - in most cases retrieval will have to be conducted at zero-shot level. The problem of zero-shot sketch-based image retrieval (ZS-SBIR) is however a much harder task when compared with its photoonly counterpart. In addition to addressing the zero-shot transfer problem, it will also need to tackle the inherent domain gap between sketch and photo. Most existing works on ZS-SBIR typically address these two problems separately: a triplet-like network to address the domain gap, and employing external semantic information (such as word embeddings) to assist category transfer. In this paper, we take a different stance and ask a more difficult question - can we devise a consolidated solution to accommodate both problems simultaneously, especially without the need for additional semantic information. For that, we propose a cross-domain prototype learning framework to narrow the domain gap by encouraging a confirmation of prototypes between two domains. The intuition is there exists an embedding in which points regardless of which domain it comes from, would cluster around a single and shared prototype representation for a given class. We first show that performance comparable with that of state-of-the-art can already be achieved just by doing this alone. We then further propose two means of tackling data efficiency during training: (i) an episode training protocol that enables data feeding by demand, and (ii) a hard triplet generation algorithm to address data scarcity. Extensive experiments on TU-Berlin-Extended, Sketchy-Extended and QuickDraw-Extended validate the usefulness of our approach.
引用
收藏
页码:394 / 410
页数:17
相关论文
共 50 条
  • [1] Cross-Domain Alignment for Zero-Shot Sketch-Based Image Retrieval
    Wang, Xu
    Peng, Dezhong
    Hu, Peng
    Gong, Yunhong
    Chen, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 7024 - 7035
  • [2] Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval
    He, Xuewan
    Wang, Jielei
    Xia, Qianxin
    Lu, Guoming
    Tang, Yuan
    Lu, Hongxia
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [3] Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
    Wang, Zhipeng
    Wang, Hao
    Yan, Jiexi
    Wu, Aming
    Deng, Cheng
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1143 - 1149
  • [4] Transferable Coupled Network for Zero-Shot Sketch-Based Image Retrieval
    Wang, Hao
    Deng, Cheng
    Liu, Tongliang
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9181 - 9194
  • [5] Contour detection network for zero-shot sketch-based image retrieval
    Zhang, Qing
    Zhang, Jing
    Su, Xiangdong
    Bao, Feilong
    Gao, Guanglai
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6781 - 6795
  • [6] Contour detection network for zero-shot sketch-based image retrieval
    Qing Zhang
    Jing Zhang
    Xiangdong Su
    Feilong Bao
    Guanglai Gao
    Complex & Intelligent Systems, 2023, 9 : 6781 - 6795
  • [7] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
    Deng, Cheng
    Xu, Xinxun
    Wang, Hao
    Yang, Muli
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8892 - 8902
  • [8] Zero-Shot Sketch-Based Image Retrieval via Graph Convolution Network
    Zhang, Zhaolong
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12943 - 12950
  • [9] Generative Model for Zero-Shot Sketch-Based Image Retrieval
    Verma, Vinay Kumar
    Mishra, Aakansha
    Mishra, Ashish
    Rai, Piyush
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 704 - 713
  • [10] An efficient framework for zero-shot sketch-based image retrieval
    Tursun, Osman
    Denman, Simon
    Sridharan, Sridha
    Goan, Ethan
    Fookes, Clinton
    PATTERN RECOGNITION, 2022, 126