Neural Segmentation Field in 3D Scene

被引：0

作者：

Huang, Tsung-Wei ^{[1
]}

Tu, Peihan ^{[1
,2
]}

Su, Guan-Ming ^{[1
]}

机构：

[1] Dolby Labs Inc, Sunnyvale, CA 94085 USA

[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA

来源：

FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF | 2023年

关键词：

neural fields; segmentation;

D O I：

10.1109/IEEECONF59524.2023.10476870

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural Radiance Field (NeRF) represents a 3D scene implicitly as neural network(s) that takes 3D position and viewing direction as input and predicts the corresponding color texture and volume density. With the learned representation, it can render color texture of arbitrary views of the 3D scene by querying the corresponding 3D positions and viewing directions for all pixels in the views. However, in addition to color texture, sometimes the users also care about the semantic information, e.g., object segmentation or semantic segmentation, in the 3D scene. Therefore, we propose the neural segmentation field, an implicit segmentation representation that represents the segmentation in a 3D scene as a neural network on top of a pre-trained 3D scene representation such as NeRF. To be specific, given a pre-trained NeRF, and a set of 2D segmentation with known camera parameters, we learn the neural segmentation field which can be used to render 2D segmentation maps for arbitrary viewpoints. Experimental result on Replica dataset shows our model can achieve a high segmentation quality (accuracy > 0.986 and mean intersection over union (mIoU) > 0.91) with a small model size (< 0.4 MB) for scene with more than 25 semantic classes.

引用

页码：1141 / 1145

页数：5

共 12 条

[1]

Fan ZW, 2022, Arxiv, DOI arXiv:2209.08776

[2] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation [J].

Fu, Xiao ;

Zhang, Shangzhan ;

Chen, Tianrun ;

Lu, Yichong ;

Zhu, Lanyun ;

Zhou, Xiaowei ;

Geiger, Andreas ;

Liao, Yiyi .

2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, :301-311

[3]

Liu Xiyang, 2022, Advances in Neural Information Processing Systems

[4]

Mildenhall B, 2022, COMMUN ACM, V65, P99, DOI 10.1145/3503250

[5] SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields [J].

Mirzaei, Ashkan ;

Aumentado-Armstrong, Tristan ;

Derpanis, Konstantinos G. ;

Kelly, Jonathan ;

Brubaker, Marcus A. ;

Gilitschenski, Igor ;

Levinshtein, Alex .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :20669-20679

[6]

Nair V., 2010, P 27 INT C MACH LEAR, P807

[7]

Stelzner K., 2021, arXiv

[8]

Straub J, 2019, Arxiv, DOI arXiv:1906.05797

[9] NeuralDiff: Segmenting 3D objects that move in egocentric videos [J].

Tschernezki, Vadim ;

Larlus, Diane ;

Vedaldi, Andrea .

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, :910-919

[10] CLA-NeRF: Category-Level Articulated Neural Radiance Field [J].

Tseng, Wei-Cheng ;

Liao, Hung-Ju ;

Yen-Chen, Lin ;

Sun, Min .

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, :8454-8460

← 1 2 →