NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

被引:190
作者
Sun, Jiaming [1 ,2 ,3 ]
Xie, Yiming [1 ,3 ]
Chen, Linghao [1 ,3 ]
Zhou, Xiaowei [1 ,3 ]
Bao, Hujun [1 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] State Key Lab CAD & CG & ZJU SenseTime Joint Lab, Hangzhou, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.01534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction. The experiments on ScanNet and 7-Scenes datasets show that our system outperforms state-of-the-art methods in terms of both accuracy and speed. To the best of our knowledge, this is the first learning-based system that is able to reconstruct dense coherent 3D geometry in real-time. Code is available at the project page: https://zju3dv.github.io/neuralrecon/.
引用
收藏
页码:15593 / 15602
页数:10
相关论文
共 56 条
[1]   Large-Scale Data for Multiple-View Stereopsis [J].
Aanaes, Henrik ;
Jensen, Rasmus Ramsbol ;
Vogiatzis, George ;
Tola, Engin ;
Dahl, Anders Bjorholm .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168
[2]  
[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01237-3_47
[3]  
[Anonymous], 2015, 3DV
[4]  
[Anonymous], 2014, CVPR
[5]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00459
[6]  
[Anonymous], 2017, IEEE INT ENTERP DIST, DOI DOI 10.1109/EDOC.2017.11
[7]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.596
[8]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00025
[9]  
Brox Thomas, 2018, ECCV
[10]  
Campos C., 2020, ORB-SLAM3: An Accurate Open- Source Library for Visual, Visual-Inertial and Multi- Map SLAM