Relationship-Preserving Knowledge Distillation for Zero-Shot Sketch Based Image Retrieval

被引：39

作者：

Tian, Jialin ^{[1
,2
]}

Xu, Xing ^{[1
,2
]}

Wang, Zheng ^{[1
,2
,3
]}

Shen, Fumin ^{[1
,2
]}

Liu, Xin ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Sichuan, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Sichuan, Peoples R China

[3] UESTC Guangdong, Inst Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China

[4] Huaqiao Univ, Dept Comp Sci, Quanzhou, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Knowledge Distillation; Sketch-Based Image Retrieval; Zero-shot; Learning;

D O I：

10.1145/3474085.3475676

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Zero-shot sketch-based image retrieval is challenging for the modal gap between distributions of sketches and images and the inconsistency of label spaces during training and testing. Previous methods mitigate the modal gap by projecting sketches and images into a joint embedding space. Most of them also bridge seen and unseen classes by leveraging semantic embeddings, e.g., word vectors and hierarchical similarities. In this paper, we propose RelationshipPreserving Knowledge Distillation (RPKD) to study generalizable embeddings from the perspective of knowledge distillation bypassing the usage of semantic embeddings. In particular, we firstly distill the instance-level knowledge to preserve inter-class relationships without semantic similarities that require extra effort to collect. We also reconcile the contrastive relationships among instances between different embedding spaces, which is complementary to instance-level relationships. Furthermore, embedding-induced supervision, which measures the similarities of an instance to partial class embedding centers from the teacher, is developed to align the student's classification confidences. Extensive experiments conducted on three benchmark ZS-SBIR datasets, i.e., Sketchy, TUBerlin, and QuickDraw, demonstrate the superiority of our proposed RPKD approach comparing to the state-of-the-art methods.

引用

页码：5473 / 5481

页数：9

共 57 条

[11]

Eitz M., 2010, IEEE transactions on visualization and computer graphics, V17, P1624, DOI DOI 10.1109/TVCG.2010.266

[12] How Do Humans Sketch Objects? [J].

Eitz, Mathias ;

Hays, James ;

Alexa, Marc .

ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)

[13] An evaluation of descriptors for large-scale image retrieval from sketched feature lines [J].

Eitz, Mathias ;

Hildebrand, Kristian ;

Boubekeur, Tamy ;

Alexa, Marc .

COMPUTERS & GRAPHICS-UK, 2010, 34 (05) :482-498

[14]

Hinton Geoffrey, 2015, ARXIV15030251

[15] A performance evaluation of gradient field HOG descriptor for sketch based image retrieval [J].

Hu, Rui ;

Collomosse, John .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (07) :790-806

[16]

Jongejan J., 2016, The quick, draw!-ai experiment

[17] Semantic Autoencoder for Zero-Shot Learning [J].

Kodirov, Elyor ;

Xiang, Tao ;

Gong, Shaogang .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4447-4456

[18] Attribute-Based Classification for Zero-Shot Visual Object Categorization [J].

Lampert, Christoph H. ;

Nickisch, Hannes ;

Harmeling, Stefan .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) :453-465

[19]

Lampert CH, 2009, PROC CVPR IEEE, P951, DOI 10.1109/CVPRW.2009.5206594

[20] Alleviating Feature Confusion for Generative Zero-shot Learning [J].

Li, Jingjing ;

Jing, Mengmeng ;

Lu, Ke ;

Zhu, Lei ;

Yang, Yang ;

Huang, Zi .

PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1587-1595

← 1 2 3 4 5 6 →