CROSS-MODAL GUIDANCE NETWORK FOR SKETCH-BASED 3D SHAPE RETRIEVAL

被引:15
作者
Dai, Weidong [1 ]
Liang, Shuang [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2020年
基金
中国国家自然科学基金;
关键词
sketch; 3D shape retrieval; cross-modal differences; guidance network; feature alignment;
D O I
10.1109/icme46284.2020.9102925
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The main challenge of sketch-based 3D shape retrieval is the large cross-modal differences between 2D sketches and 3D shapes. Most recent works employed two heterogeneous networks and a shared loss to directly map the features from different modalities to a common feature space, which failed to reduce the cross-modal differences effectively. In this paper, we propose a novel method that adopts a teacher-student strategy to learn an aligned cross-modal feature space indirectly. Specifically, our method first employs a classification network to learn the discriminative features of 3D shapes. Then, the pre-learned features are considered as a teacher to guide the feature learning of 2D sketches. In order to align the cross-modal features, 2D sketch features are transferred to the prelearned 3D feature space. Our experiments on two benchmark datasets demonstrate that our method obtains superior retrieval performance than the state-of-the-art approaches.
引用
收藏
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2006, P 12 ACM SIGKDD IN
[2]   Deep Cross-Modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-Based 3D Shape Retrieval [J].
Chen, Jiaxin ;
Fang, Yi .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :624-640
[3]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[4]  
Dai GX, 2017, AAAI CONF ARTIF INTE, P4002
[5]   Deep Correlated Holistic Metric Learning for Sketch-Based 3D Shape Retrieval [J].
Dai, Guoxian ;
Xie, Jin ;
Fang, Yi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) :3374-3386
[6]   Ranking on Cross-Domain Manifold for Sketch-based 3D Model Retrieval [J].
Furuya, Takahiko ;
Ohbuchi, Ryutarou .
2013 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2013, :274-281
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Triplet-Center Loss for Multi-View 3D Object Retrieval [J].
He, Xinwei ;
Zhou, Yang ;
Zhou, Zhichao ;
Bai, Song ;
Bai, Xiang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1945-1954
[9]  
Hinton G., 2015, Distilling the knowledge in a neural network
[10]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90