CoNeRF: Controllable Neural Radiance Fields

被引:50
作者
Kania, Kacper [1 ,2 ]
Yi, Kwang Moo [1 ]
Kowalski, Marek [6 ]
Trzciniski, Tomasz [2 ,3 ,4 ]
Tagliasacchi, Andrea [5 ,7 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Warsaw Univ Technol, Warsaw, Poland
[3] Tooploox, Wroclaw, Poland
[4] Jagiellonian Univ, Krakow, Poland
[5] Simon Fraser Univ, Burnaby, BC, Canada
[6] Microsoft Res, Redmond, WA USA
[7] Google Res, Mountain View, CA USA
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR52688.2022.01807
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We extend neural 3D representations to allow for intuitive and interpretable user control beyond novel view rendering (i.e. camera control). We allow the user to annotate which part of the scene one wishes to control with just a small number of mask annotations in the training images. Our key idea is to treat the attributes as latent variables that are regressed by the neural network given the scene encoding. This leads to a few-shot learning framework, where attributes are discovered automatically by the framework, when annotations are not provided. We apply our method to various scenes with different types of controllable attributes (e.g. expression control on human faces, or state control in movement of inanimate objects). Overall, we demonstrate, to the best of our knowledge, for the first time novel view and novel attribute re-rendering of scenes from a single video.
引用
收藏
页码:18602 / 18611
页数:10
相关论文
共 55 条
[1]   Interactive digital photomontage [J].
Agarwala, A ;
Dontcheva, M ;
Agrawala, M ;
Drucker, S ;
Colburn, A ;
Curless, B ;
Salesin, D ;
Cohen, M .
ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03) :294-302
[2]  
Alldieck Thiemo, 2021, C COMP VIS PATT REC
[3]  
[Anonymous], 2017, INT C COMP VIS
[4]  
[Anonymous], SUZANNE 3D MODEL
[5]  
[Anonymous], BUNNY 3D MODEL
[6]  
[Anonymous], TEAPOT 3D MODEL
[7]  
[Anonymous], 2018, CoRR, abs/1812.02230
[8]  
Belharbi Soufiane, 2021, IEEE WINT C APPL COM
[9]   NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis [J].
Ben Mildenhall ;
Srinivasan, Pratul P. ;
Tancik, Matthew ;
Barron, Jonathan T. ;
Ramamoorthi, Ravi ;
Ng, Ren .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :405-421
[10]   Economic forecasts, anchoring bias, and stock returns [J].
Birz, Gene ;
Dutta, Sandip ;
Yu, Han .
FINANCIAL MANAGEMENT, 2022, 51 (01) :169-191