AutoInt: Automatic Integration for Fast Neural Volume Rendering

被引:120
作者
Lindell, David B. [1 ]
Martel, Julien N. P. [1 ]
Wetzstein, Gordon [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
NETWORKS;
D O I
10.1109/CVPR46437.2021.01432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerical integration is a foundational technique in scientific computing and is at the core of many computer vision applications. Among these applications, neural volume rendering has recently been proposed as a new paradigm for view synthesis, achieving photorealistic image quality. However, a fundamental obstacle to making these methods practical is the extreme computational and memory requirements caused by the required volume integrations along the rendered rays during training and inference. Millions of rays, each requiring hundreds of forward passes through a neural network are needed to approximate those integrations with Monte Carlo sampling. Here, we propose automatic integration, a new framework for learning efficient, closed-form solutions to integrals using coordinate-based neural networks. For training, we instantiate the computational graph corresponding to the derivative of the coordinate-based network. The graph is fitted to the signal to integrate. After optimization, we reassemble the graph to obtain a network that represents the antiderivative. By the fundamental theorem of calculus, this enables the calculation of any definite integral in two evaluations of the network. Applying this approach to neural rendering, we improve a tradeoff between rendering speed and image quality: improving render times by greater than 10x with a tradeoff of reduced image quality.
引用
收藏
页码:14551 / 14560
页数:10
相关论文
共 62 条
  • [1] Abadi Martin, 2016, ARXIV160304467
  • [2] Attal Benjamin, 2020, ECCV, P441
  • [3] SAL: Sign Agnostic Learning of Shapes from Raw Data
    Atzmon, Matan
    Lipman, Yaron
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2562 - 2571
  • [4] Bradbury J., 2019, Advances in Neural Information Processing, VVolume 32, DOI DOI 10.48550/ARXIV.1912.01703
  • [5] Immersive Light Field Video with a Layered Mesh Representation
    Broxton, Michael
    Flynn, John
    Overbeck, Ryan
    Erickson, Daniel
    Hedman, Peter
    Duvall, Matthew
    Dourgarian, Jason
    Busch, Jay
    Whalen, Matt
    Debevec, Paul
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
  • [6] Free-viewpoint video of human actors
    Carranza, J
    Theobalt, C
    Magnor, MA
    Seidel, HP
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03): : 569 - 577
  • [7] Chabra R, 2020, EUR C COMP VIS, P608, DOI 10.1007
  • [8] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
    Chan, Eric R.
    Monteiro, Marco
    Kellnhofer, Petr
    Wu, Jiajun
    Wetzstein, Gordon
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5795 - 5805
  • [9] Chandrasekhar S., 2013, RAD TRANSFER
  • [10] Learning Implicit Fields for Generative Shape Modeling
    Chen, Zhiqin
    Zhang, Hao
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5932 - 5941