Learning Category-Specific Mesh Reconstruction from Image Collections

被引:349
作者
Kanazawa, Angjoo [1 ]
Tulsiani, Shubham [1 ]
Efros, Alexei A. [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
COMPUTER VISION - ECCV 2018, PT 15 | 2018年 / 11219卷
关键词
D O I
10.1007/978-3-030-01267-0_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a learning framework for recovering the 3D shape, camera, and texture of an object from a single image. The shape is represented as a deformable 3D mesh model of an object category where a shape is parameterized by a learned mean shape and per-instance predicted deformation. Our approach allows leveraging an annotated image collection for training, where the deformable model and the 3D prediction mechanism are learned without relying on ground-truth 3D or multi-view supervision. Our representation enables us to go beyond existing 3D prediction approaches by incorporating texture inference as prediction of an image in a canonical appearance space. Additionally, we show that semantic keypoints can be easily associated with the predicted shapes. We present qualitative and quantitative results of our approach on CUB and PASCAL3D datasets and show that we can learn to predict diverse shapes and textures across objects using only annotated image collections. The project website can be found at https://akanazawa.github.io/cmr/.
引用
收藏
页码:386 / 402
页数:17
相关论文
共 40 条
[1]   SCAPE: Shape Completion and Animation of People [J].
Anguelov, D ;
Srinivasan, P ;
Koller, D ;
Thrun, S ;
Rodgers, J ;
Davis, J .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416
[2]  
[Anonymous], 2016, ECCV
[3]  
[Anonymous], 1917, on Growth and Form
[4]  
[Anonymous], 2017, ADV NEURAL INF PROCE
[5]  
[Anonymous], 2017, CVPR
[6]  
[Anonymous], 2017, ICCV
[7]  
[Anonymous], 2018, CVPR
[8]  
[Anonymous], 2004, P 2004 EUR ACM SIGGR
[9]  
[Anonymous], 2018, IEEE C COMP 6 VIS PA
[10]   Hierarchical Surface Prediction for 3D Object Reconstruction [J].
Bane, Christian ;
Tulsiani, Shubham ;
Malik, Jitendra .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :412-420