Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval

被引：0

作者：

He, Xuewan ^{[1
]}

Wang, Jielei ^{[1
]}

Xia, Qianxin ^{[1
]}

Lu, Guoming ^{[1
]}

Tang, Yuan ^{[2
]}

Lu, Hongxia ^{[3
]}

机构：

[1] Univ Elect Sci & Technol China, Lab Intelligent Collaborat Comp, Chengdu, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Aeronaut & Astronaut, Chengdu, Peoples R China

[3] China Telecom Co Ltd, BeiJing Branch, Beijing, Peoples R China

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年

关键词：

Zero-Shot Sketch-Based Image Retrieval; Self-Attention Vision Transformer;

D O I：

10.1109/ICME57554.2024.10687519

中图分类号：

学科分类号：

摘要：

The Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) task seeks to match images with the same semantic essence as a hand-drawn sketch from a vast image repository. Given the stark contrast in information density between simpleline sketches and detailed images, this task encounters two formidable challenges: 1) Network layers focus differently on semantically relevant features across the two domains, and 2) The sparse information in sketches hampers the extraction of meaningful features. In response, we introduce the innovative Cross-Domain Feature Semantic Calibration (CD-FSC) model. This model begins by evaluating semantic correlations between domains and layers through attention map analyses in vision transformers to ensure precise semantic alignment. Subsequently, it harnesses category associations learned from the image domain to bolster semantic learning in the sketch domain. Our extensive comparative experiments across three prevalent ZS-SBIR datasets affirm that our model sets a new benchmark, outperforming current leading methods.

引用

页数：6

共 50 条

[31] ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval
Ren, Hao
Zheng, Ziqiang
Wu, Yang
Lu, Hong
Yang, Yang
Shan, Ying
Yeung, Sai-Kit
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5022 - 5035
[32] OCEAN: A DUAL LEARNING APPROACH FOR GENERALIZED ZERO-SHOT SKETCH-BASED IMAGE RETRIEVAL
Zhu, Jiawen
Xu, Xing
Shen, Fumin
Lee, Roy Ka-Wei
Wang, Zheng
Shen, Heng Tao
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[33] Semi-transductive Learning for Generalized Zero-Shot Sketch-Based Image Retrieval
Ge, Ce
Wang, Jingyu
Qi, Qi
Sun, Haifeng
Xu, Tong
Liao, Jianxin
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7678 - 7686
[34] Zero-shot sketch-based image retrieval with structure-aware asymmetric disentanglement
Li, Jiangtong
Ling, Zhixin
Niu, Li
Zhang, Liqing
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 218
[35] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
Ming Zhu
Chen Zhao
Nian Wang
Feiyang Gu
Yu Liu
Xin Li
The Visual Computer, 2024, 40 : 3091 - 3101
[36] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
Zhu, Ming
Zhao, Chen
Wang, Nian
Gu, Feiyang
Liu, Yu
Li, Xin
VISUAL COMPUTER, 2024, 40 (05): : 3091 - 3101
[37] A Zero-Shot Framework for Sketch Based Image Retrieval
Yelamarthi, Sasi Kiran
Reddy, Shiva Krishna
Mishra, Ashish
Mittal, Anurag
COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 316 - 333
[38] WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval
Xu, Guanglong
Hu, Zhensheng
Cai, Jia
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (02)
[39] StyleGuide: Zero-Shot Sketch-Based Image Retrieval Using Style-Guided Image Generation
Dutta, Titir
Singh, Anurag
Biswas, Soma
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2833 - 2842
[40] Deep sketch feature for cross-domain image retrieval
Wang, Xinggang
Duan, Xiong
Bai, Xiang
NEUROCOMPUTING, 2016, 207 : 387 - 397

← 1 2 3 4 5 →