GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

被引:490
作者
Niemeyer, Michael [1 ,2 ]
Geiger, Andreas [1 ,2 ]
机构
[1] Max Planck Inst Intelligent Syst, Tubingen, Germany
[2] Univ Tubingen, Tubingen, Germany
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
欧洲研究理事会;
关键词
D O I
10.1109/CVPR46437.2021.01129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle underlying factors of variation in the data, most of them operate in 2D and hence ignore that our world is three-dimensional. Further, only few works consider the compositional nature of scenes. Our key hypothesis is that incorporating a compositional 3D scene representation into the generative model leads to more controllable image synthesis. Representing scenes as compositional generative neural feature fields allows us to disentangle one or multiple objects from the background as well as individual objects' shapes and appearances while learning from unstructured and unposed image collections without any additional supervision. Combining this scene representation with a neural rendering pipeline yields a fast and realistic image synthesis model. As evidenced by our experiments, our model is able to disentangle individual objects and allows for translating and rotating them in the scene as well as changing the camera pose.
引用
收藏
页码:11448 / 11459
页数:12
相关论文
共 99 条
[1]  
Abdal Rameen, ARXIVORG 200802401
[2]  
Alhaija H. A., 2018, P AS C COMP VIS ACCV
[3]  
Anciukevicius Titas, 2020, 200400642 ARXIVORG
[4]  
[Anonymous], 2016, ARXIV160705387
[5]  
Arandjelovic R., 2019, ARXIV190511369
[6]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[7]   The effects of remittances, foreign direct investment, and foreign aid on economic growth: An empirical analysis [J].
Bird, Graham ;
Choi, Yongseok .
REVIEW OF DEVELOPMENT ECONOMICS, 2020, 24 (01) :1-30
[8]  
Brock A., 2019, P INT C LEARN REPR
[9]  
Buhler JD, 2003, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, P5
[10]   Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction [J].
Chabra, Rohan ;
Lenssen, Jan E. ;
Ilg, Eddy ;
Schmidt, Tanner ;
Straub, Julian ;
Lovegrove, Steven ;
Newcombe, Richard .
COMPUTER VISION - ECCV 2020, PT XXIX, 2020, 12374 :608-625