Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning

被引：459

作者：

Liu, Shichen ^{[1
,2
]}

Li, Tianye ^{[1
,2
]}

Chen, Weikai ^{[1
]}

Li, Hao ^{[1
,2
,3
]}

机构：

[1] USC Inst Creat Technol, Los Angeles, CA 90015 USA

[2] Univ Southern Calif, Los Angeles, CA 90089 USA

[3] Pinscreen, Redwood City, CA USA

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年

关键词：

APPEARANCE;

D O I：

10.1109/ICCV.2019.00780

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Rendering bridges the gap between 2D vision and 3D scenes by simulating the physical process of image formation. By inverting such renderer, one can think of a learning approach to infer 3D information from 2D images. However, standard graphics renderers involve a fundamental discretization step called rasterization, which prevents the rendering process to be differentiable, hence able to be learned. Unlike the state-of-the-art differentiable renderers [30, 20], which only approximate the rendering gradient in the back propagation, we propose a truly differentiable rendering framework that is able to (1) directly render colorized mesh using differentiable functions and (2) back-propagate efficient supervision signals to mesh vertices and their attributes from various forms of image representations, including silhouette, shading and color images. The key to our framework is a novel formulation that views rendering as an aggregation function that fuses the probabilistic contributions of all mesh triangles with respect to the rendered pixels. Such formulation enables our framework to flow gradients to the occluded and far-range vertices, which cannot be achieved by the previous state-of-the-arts. We show that by using the proposed renderer, one can achieve significant improvement in 3D unsupervised singleview reconstruction both qualitatively and quantitatively. Experiments also demonstrate that our approach is able to handle the challenging tasks in image-based shape fitting, which remain nontrivial to existing differentiable renderers. Code is available at https://github.com/ShichenLiu/SoftRas.

引用

页码：7707 / 7716

页数：10

共 51 条

[21]

Insafutdinov E., 2018, NIPS, P2802

[22] End-to-end Recovery of Human Shape and Pose [J].

Kanazawa, Angjoo ;

Black, Michael J. ;

Jacobs, David W. ;

Malik, Jitendra .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7122-7131

[23]

Kanazawa Angjoo, 2018, ARXIV180307549

[24] Neural 3D Mesh Renderer [J].

Kato, Hiroharu ;

Ushiku, Yoshitaka ;

Harada, Tatsuya .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3907-3916

[25] PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization [J].

Kendall, Alex ;

Grimes, Matthew ;

Cipolla, Roberto .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2938-2946

[26]

Kingma DP, 2014, ARXIV

[27] 3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare [J].

Kundu, Abhijit ;

Li, Yin ;

Rehg, James M. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3559-3568

[28] Image-based reconstruction of spatial appearance and geometric detail [J].

Lensch, HPA ;

Kautz, J ;

Goesele, M ;

Heidrich, W ;

Seidel, HP .

ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (02) :234-257

[29] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields [J].

Liu, Fayao ;

Shen, Chunhua ;

Lin, Guosheng ;

Reid, Ian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) :2024-2039

[30] Joint Face Alignment and 3D Face Reconstruction [J].

Liu, Feng ;

Zeng, Dan ;

Zhao, Qijun ;

Liu, Xiaoming .

COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :545-560

← 1 2 3 4 5 6 →