CROSS-MODAL GUIDANCE NETWORK FOR SKETCH-BASED 3D SHAPE RETRIEVAL

被引：15

作者：

Dai, Weidong ^{[1
]}

Liang, Shuang ^{[1
]}

机构：

[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2020年

基金：

中国国家自然科学基金;

关键词：

sketch; 3D shape retrieval; cross-modal differences; guidance network; feature alignment;

D O I：

10.1109/icme46284.2020.9102925

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The main challenge of sketch-based 3D shape retrieval is the large cross-modal differences between 2D sketches and 3D shapes. Most recent works employed two heterogeneous networks and a shared loss to directly map the features from different modalities to a common feature space, which failed to reduce the cross-modal differences effectively. In this paper, we propose a novel method that adopts a teacher-student strategy to learn an aligned cross-modal feature space indirectly. Specifically, our method first employs a classification network to learn the discriminative features of 3D shapes. Then, the pre-learned features are considered as a teacher to guide the feature learning of 2D sketches. In order to align the cross-modal features, 2D sketch features are transferred to the prelearned 3D feature space. Our experiments on two benchmark datasets demonstrate that our method obtains superior retrieval performance than the state-of-the-art approaches.

引用

页数：6

共 24 条

[1]

[Anonymous], 2006, P 12 ACM SIGKDD IN

[2] Deep Cross-Modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-Based 3D Shape Retrieval [J].

Chen, Jiaxin ;

Fang, Yi .

COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :624-640

[3] Learning a similarity metric discriminatively, with application to face verification [J].

Chopra, S ;

Hadsell, R ;

LeCun, Y .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546

[4]

Dai GX, 2017, AAAI CONF ARTIF INTE, P4002

[5] Deep Correlated Holistic Metric Learning for Sketch-Based 3D Shape Retrieval [J].

Dai, Guoxian ;

Xie, Jin ;

Fang, Yi .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) :3374-3386

[6] Ranking on Cross-Domain Manifold for Sketch-based 3D Model Retrieval [J].

Furuya, Takahiko ;

Ohbuchi, Ryutarou .

2013 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2013, :274-281

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8] Triplet-Center Loss for Multi-View 3D Object Retrieval [J].

He, Xinwei ;

Zhou, Yang ;

Zhou, Zhichao ;

Bai, Song ;

Bai, Xiang .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1945-1954

[9]

Hinton G., 2015, Distilling the knowledge in a neural network

[10] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

← 1 2 3 →