AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation

被引:108
作者
Mittal, Paritosh [1 ]
Cheng, Yen-Chi [1 ]
Singh, Maneesh [2 ]
Tulsiani, Shubham [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Ver Analytics, Jersey City, NJ USA
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.00040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Powerful priors allow us to perform inference with insufficient information. In this paper, we propose an autoregressive prior for 3D shapes to solve multimodal 3D tasks such as shape completion, reconstruction, and generation. We model the distribution over 3D shapes as a nonsequential autoregressive distribution over a discretized, low-dimensional, symbolic grid-like latent representation of 3D shapes. This enables us to represent distributions over 3D shapes conditioned on information from an arbitrary set of spatially anchored query locations and thus perform shape completion in such arbitrary settings (e.g. generating a complete chair given only a view of the back leg). We also show that the learned autoregressive prior can be leveraged for conditional tasks such as single-view reconstruction and language-based generation. This is achieved by learning task-specific `naive' conditionals which can be approximated by light-weight models trained on minimal paired data. We validate the effectiveness of the proposed method using both quantitative and qualitative evaluation and show that the proposed method outperforms the specialized stateof-the-art methods trained for individual tasks. The project page with code and video visualizations can be found at https://yccyenchicheng.github.io/AutoSDF/.
引用
收藏
页码:306 / 315
页数:10
相关论文
共 49 条
[1]  
Achlioptas P, 2018, PR MACH LEARN RES, V80
[2]   Shapeglot: Learning Language for Shape Differentiation [J].
Achlioptas, Panos ;
Fan, Judy ;
Hawkins, Robert ;
Goodman, Noah ;
Guihas, Leonidas .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8937-8946
[3]  
Brown Tom B, 2020, Advances in Neural Information Processing Systems, V1, P2
[4]  
Chang Angel X., 2015, Tech. Rep. arXiv:1512.03012 [cs.GR], V5, P7
[5]  
Chen Kevin, 2018, ACCV
[6]  
Chen M., 2020, ICML
[7]  
Chen X., 2020, INT C LEARN REPR ICL
[8]  
Chen X, 2018, PR MACH LEARN RES, V80
[9]  
Chibane J, 2020, ADV NEUR IN, V33
[10]   3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction [J].
Choy, Christopher B. ;
Xu, Danfei ;
Gwak, Jun Young ;
Chen, Kevin ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :628-644