Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning

被引:401
作者
Liu, Shichen [1 ,2 ]
Li, Tianye [1 ,2 ]
Chen, Weikai [1 ]
Li, Hao [1 ,2 ,3 ]
机构
[1] USC Inst Creat Technol, Los Angeles, CA 90015 USA
[2] Univ Southern Calif, Los Angeles, CA 90089 USA
[3] Pinscreen, Redwood City, CA USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
APPEARANCE;
D O I
10.1109/ICCV.2019.00780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rendering bridges the gap between 2D vision and 3D scenes by simulating the physical process of image formation. By inverting such renderer, one can think of a learning approach to infer 3D information from 2D images. However, standard graphics renderers involve a fundamental discretization step called rasterization, which prevents the rendering process to be differentiable, hence able to be learned. Unlike the state-of-the-art differentiable renderers [30, 20], which only approximate the rendering gradient in the back propagation, we propose a truly differentiable rendering framework that is able to (1) directly render colorized mesh using differentiable functions and (2) back-propagate efficient supervision signals to mesh vertices and their attributes from various forms of image representations, including silhouette, shading and color images. The key to our framework is a novel formulation that views rendering as an aggregation function that fuses the probabilistic contributions of all mesh triangles with respect to the rendered pixels. Such formulation enables our framework to flow gradients to the occluded and far-range vertices, which cannot be achieved by the previous state-of-the-arts. We show that by using the proposed renderer, one can achieve significant improvement in 3D unsupervised singleview reconstruction both qualitatively and quantitatively. Experiments also demonstrate that our approach is able to handle the challenging tasks in image-based shape fitting, which remain nontrivial to existing differentiable renderers. Code is available at https://github.com/ShichenLiu/SoftRas.
引用
收藏
页码:7707 / 7716
页数:10
相关论文
共 51 条
  • [21] Insafutdinov E., 2018, NIPS, P2802
  • [22] End-to-end Recovery of Human Shape and Pose
    Kanazawa, Angjoo
    Black, Michael J.
    Jacobs, David W.
    Malik, Jitendra
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7122 - 7131
  • [23] Kanazawa Angjoo, 2018, ARXIV180307549
  • [24] Neural 3D Mesh Renderer
    Kato, Hiroharu
    Ushiku, Yoshitaka
    Harada, Tatsuya
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3907 - 3916
  • [25] PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization
    Kendall, Alex
    Grimes, Matthew
    Cipolla, Roberto
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2938 - 2946
  • [26] Kingma DP, 2014, ARXIV
  • [27] 3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare
    Kundu, Abhijit
    Li, Yin
    Rehg, James M.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3559 - 3568
  • [28] Image-based reconstruction of spatial appearance and geometric detail
    Lensch, HPA
    Kautz, J
    Goesele, M
    Heidrich, W
    Seidel, HP
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (02): : 234 - 257
  • [29] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
    Liu, Fayao
    Shen, Chunhua
    Lin, Guosheng
    Reid, Ian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
  • [30] Joint Face Alignment and 3D Face Reconstruction
    Liu, Feng
    Zeng, Dan
    Zhao, Qijun
    Liu, Xiaoming
    [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 545 - 560