Mesh Convolutional Restricted Boltzmann Machines for Unsupervised Learning of Features With Structure Preservation on 3-D Meshes

被引:48
作者
Han, Zhizhong [1 ]
Liu, Zhenbao [1 ]
Han, Junwei [1 ]
Vong, Chi-Man [2 ]
Bu, Shuhui [1 ]
Chen, Chun Lung Philip [3 ]
机构
[1] Northwestern Polytech Univ, Xian 710072, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, Macau 99999, Peoples R China
[3] Univ Macau, Fac Sci & Technol, Macau 99999, Peoples R China
基金
中国国家自然科学基金;
关键词
3-D mesh; Laplace-Beltrami operator; mesh convolutional deep belief networks (MCDBNs); mesh convolutional restricted Boltzmann machines (MCRBMs); 3D SHAPE RETRIEVAL; OBJECT RETRIEVAL; MODEL RETRIEVAL; DEEP; RECOGNITION; DESCRIPTORS;
D O I
10.1109/TNNLS.2016.2582532
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminative features of 3-D meshes are significant to many 3-D shape analysis tasks. However, handcrafted descriptors and traditional unsupervised 3-D feature learning methods suffer from several significant weaknesses: 1) the extensive human intervention is involved; 2) the local and global structure information of 3-D meshes cannot be preserved, which is in fact an important source of discriminability; 3) the irregular vertex topology and arbitrary resolution of 3-D meshes do not allow the direct application of the popular deep learning models; 4) the orientation is ambiguous on the mesh surface; and 5) the effect of rigid and nonrigid transformations on 3-D meshes cannot be eliminated. As a remedy, we propose a deep learning model with a novel irregular model structure, called mesh convolutional restricted Boltzmann machines (MCRBMs). MCRBM aims to simultaneously learn structure-preserving local and global features from a novel raw representation, local function energy distribution. In addition, multiple MCRBMs can be stacked into a deeper model, called mesh convolutional deep belief networks (MCDBNs). MCDBN employs a novel local structure preserving convolution (LSPC) strategy to convolve the geometry and the local structure learned by the lower MCRBM to the upper MCRBM. LSPC facilitates resolving the challenging issue of the orientation ambiguity on the mesh surface in MCDBN. Experiments using the proposed MCRBM and MCDBN were conducted on three common aspects: global shape retrieval, partial shape retrieval, and shape correspondence. Results show that the features learned by the proposed methods outperform the other state-of-the-art 3-D shape features.
引用
收藏
页码:2268 / 2281
页数:14
相关论文
共 63 条
[1]   3D articulated object retrieval using a graph-based representation [J].
Agathos, Alexander ;
Pratikakis, Ioannis ;
Papadakis, Panagiotis ;
Perantonis, Stavros ;
Azariadis, Philip ;
Sapidis, Nickolas S. .
VISUAL COMPUTER, 2010, 26 (10) :1301-1319
[2]  
Anguelov Dragomir., 2004, T SAIL 2004 100, P33
[3]  
[Anonymous], 2005, P 2005 ACM S SOL PHY, DOI DOI 10.1145/1060244.1060256
[4]  
[Anonymous], P IEEE SHAP MOD INT
[5]  
[Anonymous], 2011, Proceedings of the 4th Eurographics conference on 3D Object Retrieval, EG 3DOR'11, DOI DOI 10.2312/3DOR/3DOR11/049-056
[6]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7298801
[7]  
[Anonymous], 2009, ICML
[8]  
[Anonymous], TECH REP
[9]  
[Anonymous], 2012, Advances in Neural Information Processing Systems
[10]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828