Deep Multimetric Learning for Shape-Based 3D Model Retrieval

被引：37

作者：

Xie, Jin ^{[1
,2
]}

Dai, Guoxian ^{[1
,2
]}

Fang, Yi ^{[1
,2
]}

机构：

[1] NYU, Dept Elect & Comp Engn, Multimedia & Vis Computing Lab, Abu Dhabi 129188, U Arab Emirates

[2] NYU, Tandon Sch Engn, Dept Comp Sci & Engn, New York, NY 10003 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2017年 / 19卷 / 11期

关键词：

3D shape retrieval; 3D shape descriptor; deep neural network; multiple shape features; metric learning; FEATURES; DESCRIPTORS; ROBUST;

D O I：

10.1109/TMM.2017.2698200

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, feature-learning-based 3D shape retrieval methods have been receiving more and more attention in the 3D shape analysis community. In these methods, the hand-crafted metrics or the learned linear metrics are usually used to compute the distances between shape features. Since there are complex geometric structural variations with 3D shapes, the single hand-crafted metric or learned linear metric cannot characterize the manifold, where 3D shapes lie well. In this paper, by exploring the nonlinearity of the deep neural network and the complementarity among multiple shape features, we propose a novel deep multimetric network for 3D shape retrieval. The developed multimetric network minimizes a discriminative loss function that, for each type of shape feature, the outputs of the network from the same class are encouraged to be as similar as possible and the outputs from different classes are encouraged to be as dissimilar as possible. Meanwhile, the Hilbert-Schmidt independence criterion is employed to enforce the outputs of different types of shape features to be as complementary as possible. Furthermore, the weights of the learned multiple distance metrics can be adaptively determined in our developed deep metric network. The weighted distance metric is then used as the similarity for shape retrieval. We conduct experiments with the proposed method on the four benchmark shape datasets. Experimental results demonstrate that the proposed method can obtain better performance than the learned deep single metric and outperform the state-of-the-art 3D shape retrieval methods.

引用

页码：2463 / 2474

页数：12

共 52 条

[1]

Agathos A., 2009, 3DOR, P29

[2]

[Anonymous], EUR WORKSH 3D OBJ RE, DOI DOI 10.2312/3DOR/3DOR08/009-016

[3]

[Anonymous], 2014, P EG3DOR 2014

[4]

[Anonymous], 2009, P ACM INT C IM VID R, DOI DOI 10.1145/1646396.1646430

[5] A Bayesian 3-D search engine using adaptive views clustering [J].

Ansary, Tarik Filali ;

Daoudi, Mohamed ;

Vandeborre, Jean-Philippe .

IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (01) :78-88

[6] Content-based retrieval of 3-D objects using Spin Image Signatures [J].

Assfalg, Juergen ;

Bertini, Marco ;

Del Bimbo, Alberto ;

Pala, Pietro .

IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (03) :589-599

[7]

Aubry M, 2011, IEEE I CONF COMP VIS, P1411, DOI 10.1109/ICCV.2011.6126396

[8] GIFT: A Real-time and Scalable 3D Shape Search Engine [J].

Bai, Song ;

Bai, Xiang ;

Zhou, Zhichao ;

Zhang, Zhaoxiang ;

Latecki, Longin Jan .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032

[9] Neural shape codes for 3D model retrieval [J].

Bai, Song ;

Bai, Xiang ;

Liu, Wenyu ;

Roli, Fabio .

PATTERN RECOGNITION LETTERS, 2015, 65 :15-21

[10] 3D Shape Matching via Two Layer Coding [J].

Bai, Xiang ;

Bai, Song ;

Zhu, Zhuotun ;

Latecki, Longin Jan .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (12) :2361-2373

← 1 2 3 4 5 6 →