DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions

被引:0
作者
Laidlow, Tristan [1 ]
Czarnowski, Jan [1 ]
Leutenegger, Stefan [1 ]
机构
[1] Imperial Coll London, Dyson Robot Lab, London, England
来源
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2019年
关键词
D O I
10.1109/icra.2019.8793527
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the keypoint-based maps created by sparse monocular Simultaneous Localisation and Mapping (SLAM) systems are useful for camera tracking, dense 3D reconstructions may be desired for many robotic tasks. Solutions involving depth cameras are limited in range and to indoor spaces, and dense reconstruction systems based on minimising the photometric error between frames are typically poorly constrained and suffer from scale ambiguity. To address these issues, we propose a 3D reconstruction system that leverages the output of a Convolutional Neural Network (CNN) to produce fully dense depth maps for keyframes that include metric scale. Our system, DeepFusion, is capable of producing real-time dense reconstructions on a GPU. It fuses the output of a semi-dense multiview stereo algorithm with the depth and gradient predictions of a CNN in a probabilistic fashion, using learned uncertainties produced by the network. While the network only needs to be run once per keyframe, we are able to optimise for the depth map with each new frame so as to constantly make use of new geometric constraints. Based on its performance on synthetic and real world datasets, we demonstrate that DeepFusion is capable of performing at least as well as other comparable systems.
引用
收藏
页码:4068 / 4074
页数:7
相关论文
共 50 条
[1]   Photometric single-view dense 3D reconstruction in endoscopy [J].
Batlle, Victor M. ;
Montiel, J. M. M. ;
Tardos, Juan D. .
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :4904-4910
[2]   CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction [J].
Tateno, Keisuke ;
Tombari, Federico ;
Laina, Iro ;
Navab, Nassir .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6565-6574
[3]   MDF-SLAM: Monocular Dense 3D Reconstruction Based on Depth Estimation [J].
Zhu, Zuojun ;
Xu, Xiangrong ;
Li, Yonggang ;
You, Tianya ;
Wang, Xiaoyi ;
Wang, Zhixiong ;
Wang, Haiyan ;
Xu, Shanshan ;
Rodic, Aleksandar ;
Petrovic, Petar B. .
2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, :787-792
[4]   Single-View 3D Shape Reconstruction with Learned Gradient Descent [J].
Zhang, Guanglun ;
Yang, Lu .
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2020, PT III, 2020, 12307 :173-184
[5]   Real-time 3D reconstruction using SLAM for building construction [J].
Qian, T. ;
Zhang, C. .
SUSTAINABLE BUILDINGS AND STRUCTURES: BUILDING A SUSTAINABLE TOMORROW, 2020, :313-318
[6]   Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network [J].
Luo, Hongcheng ;
Gao, Yang ;
Wu, Yuhao ;
Liao, Chunyuan ;
Yang, Xin ;
Cheng, Kwang-Ting .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) :470-483
[7]   Single-View 3D Reconstruction of Curves [J].
Fakih, Ali ;
Wilser, Nicola ;
Maillot, Yvan ;
Cordier, Frederic .
ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 :3-14
[8]   MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation [J].
Zhang, Hanwei ;
Uchiyama, Hideaki ;
Ono, Shintaro ;
Kawasaki, Hiroshi .
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :4865-4872
[9]   Adaptive SLAM with Synthetic Stereo Dataset Generation for Real-time Dense 3D Reconstruction [J].
Billy, Antoine ;
Pouteau, Sebastien ;
Desbarats, Pascal ;
Chaumette, Serge ;
Domenger, Jean-Philippe .
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, :840-848
[10]   StereoScan: Dense 3d Reconstruction in Real-time [J].
Geiger, Andreas ;
Ziegler, Julius ;
Stiller, Christoph .
2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, :963-968