SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields

被引:60
作者
Mirzaei, Ashkan [1 ,2 ]
Aumentado-Armstrong, Tristan [1 ,2 ,4 ]
Derpanis, Konstantinos G. [1 ,3 ,4 ]
Kelly, Jonathan [2 ]
Brubaker, Marcus A. [1 ,3 ,4 ]
Gilitschenski, Igor [2 ]
Levinshtein, Alex [1 ]
机构
[1] Samsung AI Ctr Toronto, Toronto, ON, Canada
[2] Univ Toronto, Toronto, ON, Canada
[3] York Univ, N York, ON, Canada
[4] Vector Inst AI, Toronto, ON, Canada
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.01980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Radiance Fields (NeRFs) have emerged as a popular approach for novel view synthesis. While NeRFs are quickly being adapted for a wider set of applications, intuitively editing NeRF scenes is still an open challenge. One important editing task is the removal of unwanted objects from a 3D scene, such that the replaced region is visually plausible and consistent with its context. We refer to this task as 3D inpainting. In 3D, solutions must be both consistent across multiple views and geometrically valid. In this paper, we propose a novel 3D inpainting method that addresses these challenges. Given a small set of posed images and sparse annotations in a single input image, our framework first rapidly obtains a 3D segmentation mask for a target object. Using the mask, a perceptual optimization-based approach is then introduced that leverages learned 2D image inpainters, distilling their information into 3D space, while ensuring view consistency. We also address the lack of a diverse benchmark for evaluating 3D scene inpainting methods by introducing a dataset comprised of challenging real-world scenes. In particular, our dataset contains views of the same scene with and without a target object, enabling more principled benchmarking of the 3D inpainting task. We first demonstrate the superiority of our approach on multiview segmentation, comparing to NeRF-based methods and 2D segmentation approaches. We then evaluate on the task of 3D inpainting, establishing state-of-the-art performance against other NeRF manipulation algorithms, as well as a strong 2D image inpainter baseline.
引用
收藏
页码:20669 / 20679
页数:11
相关论文
共 77 条
[21]  
Hu Tao, 2022, CVPR
[22]  
Iizuka Satoshi, 2017, TOG
[23]  
Jain J., 2023, WACV
[24]   A comprehensive review of past and present image inpainting methods [J].
Jam, Jireh ;
Kendrick, Connah ;
Walker, Kevin ;
Drouard, Vincent ;
Hsu, Jison Gee-Sern ;
Yap, Moi Hoon .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 203
[25]  
Jampani Varun, 2021, ICCV
[26]   CoNeRF: Controllable Neural Radiance Fields [J].
Kania, Kacper ;
Yi, Kwang Moo ;
Kowalski, Marek ;
Trzciniski, Tomasz ;
Tagliasacchi, Andrea .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18602-18611
[27]   Comparative Causal Analysis of Network Log Data in Two Large ISPs [J].
Kobayashi, Satoru ;
Shima, Keiichi ;
Cho, Kenjiro ;
Akashi, Osamu ;
Fukuda, Kensuke .
PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
[28]  
Li Wenbo, 2022, CVPR
[29]  
Lin Chen-Hsuan, 2021, ICCV, P8
[30]  
Lindell David B., 2022, CVPR