Relationship-Preserving Knowledge Distillation for Zero-Shot Sketch Based Image Retrieval

被引:39
作者
Tian, Jialin [1 ,2 ]
Xu, Xing [1 ,2 ]
Wang, Zheng [1 ,2 ,3 ]
Shen, Fumin [1 ,2 ]
Liu, Xin [4 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Sichuan, Peoples R China
[3] UESTC Guangdong, Inst Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
[4] Huaqiao Univ, Dept Comp Sci, Quanzhou, Peoples R China
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Knowledge Distillation; Sketch-Based Image Retrieval; Zero-shot; Learning;
D O I
10.1145/3474085.3475676
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot sketch-based image retrieval is challenging for the modal gap between distributions of sketches and images and the inconsistency of label spaces during training and testing. Previous methods mitigate the modal gap by projecting sketches and images into a joint embedding space. Most of them also bridge seen and unseen classes by leveraging semantic embeddings, e.g., word vectors and hierarchical similarities. In this paper, we propose RelationshipPreserving Knowledge Distillation (RPKD) to study generalizable embeddings from the perspective of knowledge distillation bypassing the usage of semantic embeddings. In particular, we firstly distill the instance-level knowledge to preserve inter-class relationships without semantic similarities that require extra effort to collect. We also reconcile the contrastive relationships among instances between different embedding spaces, which is complementary to instance-level relationships. Furthermore, embedding-induced supervision, which measures the similarities of an instance to partial class embedding centers from the teacher, is developed to align the student's classification confidences. Extensive experiments conducted on three benchmark ZS-SBIR datasets, i.e., Sketchy, TUBerlin, and QuickDraw, demonstrate the superiority of our proposed RPKD approach comparing to the state-of-the-art methods.
引用
收藏
页码:5473 / 5481
页数:9
相关论文
共 57 条
[1]   Multi-Cue Zero-Shot Learning with Strong Supervision [J].
Akata, Zeynep ;
Malinowski, Mateusz ;
Fritz, Mario ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :59-68
[2]  
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[3]  
[Anonymous], 2015, PROC BRIT MACH VIS C
[4]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.649
[5]  
[Anonymous], 2019, BMVC, DOI DOI 10.1109/TENCON.2019.8929331
[6]  
[Anonymous], 2015, PROC CVPR IEEE
[7]   Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches [J].
Chen, Zhi ;
Wang, Sen ;
Li, Jingjing ;
Huang, Zi .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :3413-3421
[8]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[9]   Doodle to Search: Practical Zero-Shot Sketch-based Image Retrieval [J].
Dey, Sounak ;
Riba, Pau ;
Dutta, Anjan ;
Llados, Josep ;
Song, Yi-Zhe .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2174-2183
[10]   Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval [J].
Dutta, Anjan ;
Akata, Zeynep .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5084-5093