Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval

被引:0
|
作者
He, Xuewan [1 ]
Wang, Jielei [1 ]
Xia, Qianxin [1 ]
Lu, Guoming [1 ]
Tang, Yuan [2 ]
Lu, Hongxia [3 ]
机构
[1] Univ Elect Sci & Technol China, Lab Intelligent Collaborat Comp, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Aeronaut & Astronaut, Chengdu, Peoples R China
[3] China Telecom Co Ltd, BeiJing Branch, Beijing, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
关键词
Zero-Shot Sketch-Based Image Retrieval; Self-Attention Vision Transformer;
D O I
10.1109/ICME57554.2024.10687519
中图分类号
学科分类号
摘要
The Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) task seeks to match images with the same semantic essence as a hand-drawn sketch from a vast image repository. Given the stark contrast in information density between simpleline sketches and detailed images, this task encounters two formidable challenges: 1) Network layers focus differently on semantically relevant features across the two domains, and 2) The sparse information in sketches hampers the extraction of meaningful features. In response, we introduce the innovative Cross-Domain Feature Semantic Calibration (CD-FSC) model. This model begins by evaluating semantic correlations between domains and layers through attention map analyses in vision transformers to ensure precise semantic alignment. Subsequently, it harnesses category associations learned from the image domain to bolster semantic learning in the sketch domain. Our extensive comparative experiments across three prevalent ZS-SBIR datasets affirm that our model sets a new benchmark, outperforming current leading methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval
    Ren, Hao
    Zheng, Ziqiang
    Wu, Yang
    Lu, Hong
    Yang, Yang
    Shan, Ying
    Yeung, Sai-Kit
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5022 - 5035
  • [32] OCEAN: A DUAL LEARNING APPROACH FOR GENERALIZED ZERO-SHOT SKETCH-BASED IMAGE RETRIEVAL
    Zhu, Jiawen
    Xu, Xing
    Shen, Fumin
    Lee, Roy Ka-Wei
    Wang, Zheng
    Shen, Heng Tao
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [33] Semi-transductive Learning for Generalized Zero-Shot Sketch-Based Image Retrieval
    Ge, Ce
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    Xu, Tong
    Liao, Jianxin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7678 - 7686
  • [34] Zero-shot sketch-based image retrieval with structure-aware asymmetric disentanglement
    Li, Jiangtong
    Ling, Zhixin
    Niu, Li
    Zhang, Liqing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 218
  • [35] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Ming Zhu
    Chen Zhao
    Nian Wang
    Feiyang Gu
    Yu Liu
    Xin Li
    The Visual Computer, 2024, 40 : 3091 - 3101
  • [36] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Zhu, Ming
    Zhao, Chen
    Wang, Nian
    Gu, Feiyang
    Liu, Yu
    Li, Xin
    VISUAL COMPUTER, 2024, 40 (05): : 3091 - 3101
  • [37] A Zero-Shot Framework for Sketch Based Image Retrieval
    Yelamarthi, Sasi Kiran
    Reddy, Shiva Krishna
    Mishra, Ashish
    Mittal, Anurag
    COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 316 - 333
  • [38] WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval
    Xu, Guanglong
    Hu, Zhensheng
    Cai, Jia
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (02)
  • [39] StyleGuide: Zero-Shot Sketch-Based Image Retrieval Using Style-Guided Image Generation
    Dutta, Titir
    Singh, Anurag
    Biswas, Soma
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2833 - 2842
  • [40] Deep sketch feature for cross-domain image retrieval
    Wang, Xinggang
    Duan, Xiong
    Bai, Xiang
    NEUROCOMPUTING, 2016, 207 : 387 - 397