Good Teacher Makes Good Student: A Discriminative-Aware Knowledge Preservation Approach for Zero-Shot Sketch-Based Image Retrieval

被引:0
作者
Zhao, Haifeng [1 ,2 ,3 ]
Wu, Tianjian [1 ,3 ,4 ]
Tao, Yuting [1 ,3 ]
Zhang, Yan [1 ,3 ]
机构
[1] Jinling Inst Technol, Sch Software Engn, Nanjing, Peoples R China
[2] Jiangsu Hoperun Software Co Ltd, Nanjing, Peoples R China
[3] Informat Anal Engn Res Ctr Jiangsu Prov, Nanjing, Peoples R China
[4] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Peoples R China
关键词
Sketch-Based Image Retrieval (SBIR); zero-shot; knowledge preservation;
D O I
10.12720/jait.15.3.364-371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketch-Based Image Retrieval (SBIR) is widely used in animation, e-commerce, and security. In these real-world applications, the classes of retrieval may be very different from the training classes, making it a zero-shot SBIR problem. Most methods in the literature resort to bridging the semantic gap between the sketch and image domains by learning a common space with a pre-trained model on a large dataset as the base network, and then finetuning on the target SBIR datasets. In this process, the acquired knowledge of the pre-trained model may be lost, resulting in performance degradation. To tackle this problem, we propose a teacher-student network architecture, which consists of a teacher network using the pre-trained model and a student network whose output is guided by the teacher network. Instead of introducing supplementary semantics in the teacher network, we adopt a more powerful pre-trained model as the teacher network and further enhance its discriminative capability by adding a hard-coded margin based on the prediction probability. The student network is then fine-tuned by using the teacher network's output as the learning target. Experiments on two benchmark datasets show that the proposed approach outperforms the state-of-the-art methods by more than 10%, which verifies that the prior knowledge can be better preserved by a good teacher network, which can make the student network good too.
引用
收藏
页码:364 / 371
页数:8
相关论文
共 36 条
[1]   Towards Discriminability and Diversity: Batch Nuclear-norm Maximization under Label Insufficient Situations [J].
Cui, Shuhao ;
Wang, Shuhui ;
Zhuo, Junbao ;
Li, Liang ;
Huang, Qingming ;
Tian, Qi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3940-3949
[2]   Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval [J].
Deng, Cheng ;
Xu, Xinxun ;
Wang, Hao ;
Yang, Muli ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :8892-8902
[3]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4]  
Dey S., 2019, PROC IEEE C COMPUTER
[5]   Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval [J].
Dutta, Anjan ;
Akata, Zeynep .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5084-5093
[6]  
Dutta T., 2020, IEEE Transactions on Multimedia
[7]   Adaptive Margin Diversity Regularizer for Handling Data Imbalance in Zero-Shot SBIR [J].
Dutta, Titir ;
Singh, Anurag ;
Biswas, Soma .
COMPUTER VISION - ECCV 2020, PT V, 2020, 12350 :349-364
[8]   How Do Humans Sketch Objects? [J].
Eitz, Mathias ;
Hays, James ;
Alexa, Marc .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)
[9]  
Fellbaum C., 1998, WORDNET ELECT LEXICA, DOI DOI 10.7551/MITPRESS/7287.001.0001
[10]  
French R.M., 2006, Catastrophic Forgetting in Connectionist Networks