GSIR: Generalizable 3D Shape Interpretation and Reconstruction

被引:9
作者
Wang, Jianren [1 ]
Fang, Zhaoyuan [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
COMPUTER VISION - ECCV 2020, PT XIII | 2020年 / 12358卷
关键词
Shape interpretation; 3D reconstruction;
D O I
10.1007/978-3-030-58601-0_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D shape interpretation and reconstruction are closely related to each other but have long been studied separately and often end up with priors that are highly biased towards the training classes. In this paper, we present an algorithm, Generalizable 3D Shape Interpretation and Reconstruction (GSIR), designed to jointly learn these two tasks to capture generic, class-agnostic shape priors for a better understanding of 3D geometry. We propose to recover 3D shape structures as cuboids from partial reconstruction and use the predicted structures to further guide full 3D reconstruction. The unified framework is trained simultaneously offline to learn a generic notion and can be fine-tuned online for specific objects without any annotations. Extensive experiments on both synthetic and real data demonstrate that introducing 3D shape interpretation improves the performance of single image 3D reconstruction and vice versa, achieving the state-of-the-art performance on both tasks for objects in both seen and unseen categories.
引用
收藏
页码:498 / 514
页数:17
相关论文
共 65 条
[1]  
Akhter I, 2015, PROC CVPR IEEE, P1446, DOI 10.1109/CVPR.2015.7298751
[2]   Structure-Aware Shape Synthesis [J].
Balashova, Elena ;
Singh, Vivek ;
Wang, Jiangping ;
Teixeira, Brian ;
Chen, Terrence ;
Funkhouser, Thomas .
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, :140-149
[3]  
Barrow H. G., 1977, P IM UND WORKSH SCI, P659
[4]   RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].
BIEDERMAN, I .
PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147
[5]   Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].
Bogo, Federica ;
Kanazawa, Angjoo ;
Lassner, Christoph ;
Gehler, Peter ;
Romero, Javier ;
Black, Michael J. .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578
[6]  
Chan CRLE, 2019, Arxiv, DOI arXiv:1808.07371
[7]   Binocular shape constancy from novel views: The role of a priori constraints [J].
Chan, Moses W. ;
Stevenson, Adam K. ;
Li, Yunfeng ;
Pizlo, Zygmunt .
PERCEPTION & PSYCHOPHYSICS, 2006, 68 (07) :1124-1139
[8]   Probabilistic Reasoning for Assembly-Based 3D Modeling [J].
Chaudhuri, Siddhartha ;
Kalogerakis, Evangelos ;
Guibas, Leonidas ;
Koltun, Vladlen .
ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04)
[9]  
Chen WF, 2016, ADV NEUR IN, V29
[10]   Learning Implicit Fields for Generative Shape Modeling [J].
Chen, Zhiqin ;
Zhang, Hao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5932-5941