Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval

被引:0
|
作者
He, Xuewan [1 ]
Wang, Jielei [1 ]
Xia, Qianxin [1 ]
Lu, Guoming [1 ]
Tang, Yuan [2 ]
Lu, Hongxia [3 ]
机构
[1] Univ Elect Sci & Technol China, Lab Intelligent Collaborat Comp, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Aeronaut & Astronaut, Chengdu, Peoples R China
[3] China Telecom Co Ltd, BeiJing Branch, Beijing, Peoples R China
关键词
Zero-Shot Sketch-Based Image Retrieval; Self-Attention Vision Transformer;
D O I
10.1109/ICME57554.2024.10687519
中图分类号
学科分类号
摘要
The Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) task seeks to match images with the same semantic essence as a hand-drawn sketch from a vast image repository. Given the stark contrast in information density between simpleline sketches and detailed images, this task encounters two formidable challenges: 1) Network layers focus differently on semantically relevant features across the two domains, and 2) The sparse information in sketches hampers the extraction of meaningful features. In response, we introduce the innovative Cross-Domain Feature Semantic Calibration (CD-FSC) model. This model begins by evaluating semantic correlations between domains and layers through attention map analyses in vision transformers to ensure precise semantic alignment. Subsequently, it harnesses category associations learned from the image domain to bolster semantic learning in the sketch domain. Our extensive comparative experiments across three prevalent ZS-SBIR datasets affirm that our model sets a new benchmark, outperforming current leading methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Cross-Domain Alignment for Zero-Shot Sketch-Based Image Retrieval
    Wang, Xu
    Peng, Dezhong
    Hu, Peng
    Gong, Yunhong
    Chen, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 7024 - 7035
  • [2] XPNet: Cross-Domain Prototypical Network for Zero-Shot Sketch-Based Image Retrieval
    Li, Mingkang
    Qi, Yonggang
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 394 - 410
  • [3] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
    Deng, Cheng
    Xu, Xinxun
    Wang, Hao
    Yang, Muli
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8892 - 8902
  • [4] Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
    Wang, Zhipeng
    Wang, Hao
    Yan, Jiexi
    Wu, Aming
    Deng, Cheng
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1143 - 1149
  • [5] Progressive Domain-Independent Feature Decomposition Network for Zero-Shot Sketch-Based Image Retrieval
    Xu, Xinxun
    Yang, Muli
    Yang, Yanhua
    Wang, Hao
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 984 - 990
  • [6] Generative Model for Zero-Shot Sketch-Based Image Retrieval
    Verma, Vinay Kumar
    Mishra, Aakansha
    Mishra, Ashish
    Rai, Piyush
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 704 - 713
  • [7] Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval
    Liu, Qing
    Xie, Lingxi
    Wang, Huiyu
    Yuile, Alan L.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3661 - 3670
  • [8] An efficient framework for zero-shot sketch-based image retrieval
    Tursun, Osman
    Denman, Simon
    Sridharan, Sridha
    Goan, Ethan
    Fookes, Clinton
    PATTERN RECOGNITION, 2022, 126
  • [9] Triplet Bridge for Zero-Shot Sketch-Based Image Retrieval
    Zheng, Jiahao
    Tang, Yu
    Wu, Dapeng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [10] Attention map feature fusion network for Zero-Shot Sketch-based Image Retrieval
    Zhao, Honggang
    Liu, Mingyue
    Lin, Yinghua
    Li, Mingyong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,