AutoInt: Automatic Integration for Fast Neural Volume Rendering

被引：120

作者：

Lindell, David B. ^{[1
]}

Martel, Julien N. P. ^{[1
]}

Wetzstein, Gordon ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

NETWORKS;

D O I：

10.1109/CVPR46437.2021.01432

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Numerical integration is a foundational technique in scientific computing and is at the core of many computer vision applications. Among these applications, neural volume rendering has recently been proposed as a new paradigm for view synthesis, achieving photorealistic image quality. However, a fundamental obstacle to making these methods practical is the extreme computational and memory requirements caused by the required volume integrations along the rendered rays during training and inference. Millions of rays, each requiring hundreds of forward passes through a neural network are needed to approximate those integrations with Monte Carlo sampling. Here, we propose automatic integration, a new framework for learning efficient, closed-form solutions to integrals using coordinate-based neural networks. For training, we instantiate the computational graph corresponding to the derivative of the coordinate-based network. The graph is fitted to the signal to integrate. After optimization, we reassemble the graph to obtain a network that represents the antiderivative. By the fundamental theorem of calculus, this enables the calculation of any definite integral in two evaluations of the network. Applying this approach to neural rendering, we improve a tradeoff between rendering speed and image quality: improving render times by greater than 10x with a tradeoff of reduced image quality.

引用

页码：14551 / 14560

页数：10

共 62 条

[1] Abadi Martin, 2016, ARXIV160304467
[2] Attal Benjamin, 2020, ECCV, P441
[3] SAL: Sign Agnostic Learning of Shapes from Raw Data
Atzmon, Matan
Lipman, Yaron
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2562 - 2571
[4] Bradbury J., 2019, Advances in Neural Information Processing, VVolume 32, DOI DOI 10.48550/ARXIV.1912.01703
[5] Immersive Light Field Video with a Layered Mesh Representation
Broxton, Michael
Flynn, John
Overbeck, Ryan
Erickson, Daniel
Hedman, Peter
Duvall, Matthew
Dourgarian, Jason
Busch, Jay
Whalen, Matt
Debevec, Paul
[J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
[6] Free-viewpoint video of human actors
Carranza, J
Theobalt, C
Magnor, MA
Seidel, HP
[J]. ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03): : 569 - 577
[7] Chabra R, 2020, EUR C COMP VIS, P608, DOI 10.1007
[8] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
Chan, Eric R.
Monteiro, Marco
Kellnhofer, Petr
Wu, Jiajun
Wetzstein, Gordon
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5795 - 5805
[9] Chandrasekhar S., 2013, RAD TRANSFER
[10] Learning Implicit Fields for Generative Shape Modeling
Chen, Zhiqin
Zhang, Hao
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5932 - 5941

← 1 2 3 4 5 6 7 →