DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions

被引:0
作者
Laidlow, Tristan [1 ]
Czarnowski, Jan [1 ]
Leutenegger, Stefan [1 ]
机构
[1] Imperial Coll London, Dyson Robot Lab, London, England
来源
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2019年
关键词
D O I
10.1109/icra.2019.8793527
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the keypoint-based maps created by sparse monocular Simultaneous Localisation and Mapping (SLAM) systems are useful for camera tracking, dense 3D reconstructions may be desired for many robotic tasks. Solutions involving depth cameras are limited in range and to indoor spaces, and dense reconstruction systems based on minimising the photometric error between frames are typically poorly constrained and suffer from scale ambiguity. To address these issues, we propose a 3D reconstruction system that leverages the output of a Convolutional Neural Network (CNN) to produce fully dense depth maps for keyframes that include metric scale. Our system, DeepFusion, is capable of producing real-time dense reconstructions on a GPU. It fuses the output of a semi-dense multiview stereo algorithm with the depth and gradient predictions of a CNN in a probabilistic fashion, using learned uncertainties produced by the network. While the network only needs to be run once per keyframe, we are able to optimise for the depth map with each new frame so as to constantly make use of new geometric constraints. Based on its performance on synthetic and real world datasets, we demonstrate that DeepFusion is capable of performing at least as well as other comparable systems.
引用
收藏
页码:4068 / 4074
页数:7
相关论文
共 50 条
[31]   Domain-Adaptive Single-View 3D Reconstruction [J].
Pinheiro, Pedro O. ;
Rostamzadeh, Negar ;
Ahn, Sungjin .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7637-7646
[32]   Dynamic Domain Adaptation for Single-view 3D Reconstruction [J].
Yang, Cong ;
Xie, Housen ;
Tian, Haihong ;
Yu, Yuanlong .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :3563-3570
[33]   Single-view 3D reconstruction via dual attention [J].
Li, Chenghuan ;
Xiao, Meihua ;
Li, Zehuan ;
Chen, Fangping ;
Wang, Dingli .
PEERJ COMPUTER SCIENCE, 2024, 10
[34]   HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling [J].
Zheng, Yujian ;
Jin, Zirong ;
Li, Moran ;
Huang, Haibin ;
Ma, Chongyang ;
Cui, Shuguang ;
Han, Xiaoguang .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12726-12735
[35]   Real-time dense 3D object reconstruction using RGB-D sensor [J].
Ruchay, Alexey ;
Dorofeev, Konstantin ;
Kalschikov, Vsevolod .
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIII, 2020, 11510
[36]   Robust, Real-Time 3D Face Tracking from a Monocular View [J].
Liao, Wei-Kai ;
Fidaleo, Douglas ;
Medioni, Gerard .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2010,
[37]   Robust, Real-Time 3D Face Tracking from a Monocular View [J].
Wei-Kai Liao ;
Douglas Fidaleo ;
Gerard Medioni .
EURASIP Journal on Image and Video Processing, 2010
[38]   RTG-SLAM: Real-time 3D Reconstruction at Scale Using Gaussian Splatting [J].
Peng, Zhexi ;
Shao, Tianjia ;
Liu, Yong ;
Zhou, Jingke ;
Yang, Yin ;
Wang, Jingdong ;
Zhou, Kun .
PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,
[39]   Camera-Agnostic Monocular SLAM and Semi-dense 3D Reconstruction [J].
Ruenz, Martin ;
Neuhaus, Frank ;
Winkens, Christian ;
Paulus, Dietrich .
PATTERN RECOGNITION, GCPR 2016, 2016, 9796 :285-296
[40]   NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video [J].
Sun, Jiaming ;
Xie, Yiming ;
Chen, Linghao ;
Zhou, Xiaowei ;
Bao, Hujun .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15593-15602