Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds From Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction

被引:91
作者
Han, Zhizhong [1 ,3 ]
Wang, Xiyang [1 ,2 ]
Liu, Yu-Shen [1 ,2 ]
Zwicker, Matthias [3 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[3] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
DISTANCE; NETWORK;
D O I
10.1109/ICCV.2019.01054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised feature learning for point clouds has been vital for large-scale point cloud understanding. Recent deep learning based methods depend on learning global geometry from self-reconstruction. However, these methods are still suffering from ineffective learning of local geometry, which significantly limits the discriminability of learned features. To resolve this issue, we propose MAP-VAE to enable the learning of global and local geometry by jointly leveraging global and local self-supervision. To enable effective local self-supervision, we introduce multi-angle analysis for point clouds. In a multi-angle scenario, we first split a point cloud into a front half and a back half from each angle, and then, train MAP-VAE to learn to predict a back half sequence from the corresponding front half sequence. MAP-VAE performs this half-to-half prediction using RNN to simultaneously learn each local geometry and the spatial relationship among them. In addition, MAP-VAE also learns global geometry via self-reconstruction, where we employ a variational constraint to facilitate novel shape generation. The outperforming results in four shape analysis tasks show that MAP-VAE can learn more discriminative global or local features than the state-of-the-art methods.
引用
收藏
页码:10441 / 10450
页数:10
相关论文
共 40 条
[1]  
Achlioptas P, 2018, PR MACH LEARN RES, V80
[2]  
[Anonymous], 2015, P IEEE C COMPUTER VI, DOI 10.1109/CVPR.2015.7298801
[3]   The Heat Method for Distance Computation [J].
Crane, Keenan ;
Weischedel, Clarisse ;
Wardetzky, Max .
COMMUNICATIONS OF THE ACM, 2017, 60 (11) :90-99
[4]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[5]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443
[6]   Accelerated exhaustive eye glints localization method for infrared video oculography [J].
Ding, Zihan ;
Luo, Jiayi ;
Deng, Hongping .
33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, :620-627
[7]   A Point Set Generation Network for 3D Object Reconstruction from a Single Image [J].
Fan, Haoqiang ;
Su, Hao ;
Guibas, Leonidas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2463-2471
[8]   Multiresolution Tree Networks for 3D Point Cloud Processing [J].
Gadelha, Matheus ;
Wang, Rui ;
Maji, Subhransu .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :105-122
[9]   Learning a Predictable and Generative Vector Representation for Objects [J].
Girdhar, Rohit ;
Fouhey, David F. ;
Rodriguez, Mikel ;
Gupta, Abhinav .
COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 :484-499
[10]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144