FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation

被引:920
作者
Yang, Yaoqing [1 ]
Feng, Chen [2 ]
Shen, Yiru [3 ]
Tian, Dong [2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] MERL, Cambridge, MA USA
[3] Clemson Univ, Clemson, SC 29631 USA
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent deep networks that directly handle points in a point set, e.g., PointNet, have been state-of-the-art for supervised learning tasks on point clouds such as classification and segmentation. In this work, a novel end-to-end deep auto-encoder is proposed to address unsupervised learning challenges on point clouds. On the encoder side, a graph-based enhancement is enforced to promote local structures on top of PointNet. Then, a novel folding-based decoder deforms a canonical 2D grid onto the underlying 3D object surface of a point cloud, achieving low reconstruction errors even for objects with delicate structures. The proposed decoder only uses about 7% parameters of a decoder with fully-connected neural networks, yet leads to a more discriminative representation that achieves higher linear SVM classification accuracy than the benchmark. In addition, the proposed decoder structure is shown, in theory, to be a generic architecture that is able to reconstruct an arbitrary point cloud from a 2D grid. Our code is available at http://www.merl.com/research/license#FoldingNet
引用
收藏
页码:206 / 215
页数:10
相关论文
共 60 条
[1]  
[Anonymous], 2017, ADV NEURAL INFORM PR
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]  
[Anonymous], 2016, P 30 INT C NEUR INF
[4]  
[Anonymous], 2016, arXiv preprint arXiv:1609.02907
[5]  
[Anonymous], 2016, P ADV NEUR INF PROC
[6]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[7]  
[Anonymous], COMPUTERS GRAPHICS
[8]  
[Anonymous], EUR WORKSH 3D OBJ RE
[9]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7298801
[10]  
[Anonymous], 2017, P IEEE C COMP VIS PA