CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction

被引:434
作者
Tateno, Keisuke [1 ,2 ]
Tombari, Federico [1 ]
Laina, Iro [1 ]
Navab, Nassir [1 ,3 ]
机构
[1] CAMP TU Munich, Munich, Germany
[2] Canon Inc, Tokyo, Japan
[3] Johns Hopkins Univ, Baltimore, MD USA
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
D O I
10.1109/CVPR.2017.695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the recent advances in depth prediction from Convolutional Neural Networks (CNNs), this paper investigates how predicted depth maps from a deep neural network can be deployed for accurate and dense monocular reconstruction. We propose a method where CNN-predicted dense depth maps are naturally fused together with depth measurements obtained from direct monocular SLAM. Our fusion scheme privileges depth prediction in image locations where monocular SLAM approaches tend to fail, e.g. along low-textured regions, and vice-versa. We demonstrate the use of depth prediction for estimating the absolute scale of the reconstruction, hence overcoming one of the major limitations of monocular SLAM. Finally, we propose a framework to efficiently fuse semantic labels, obtained from a single frame, with dense SLAM, yielding semantically coherent scene reconstruction from a single view. Evaluation results on two benchmark datasets show the robustness and accuracy of our approach.
引用
收藏
页码:6565 / 6574
页数:10
相关论文
共 30 条
[1]  
Delage E., 2006, P INT C COMP VIS PAT
[2]  
Eigen D., 2014, P C NEUR INF PROC SY
[3]   Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture [J].
Eigen, David ;
Fergus, Rob .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2650-2658
[4]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849
[5]   Semi-Dense Visual Odometry for a Monocular Camera [J].
Engel, Jakob ;
Sturm, Juergen ;
Cremers, Daniel .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1449-1456
[6]   Real-time monocular object SLAM [J].
Galvez-Lopez, Dorian ;
Salas, Marta ;
Tardos, Juan D. ;
Montiel, J. M. M. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 :435-449
[7]  
Greene W. N., 2016, 2016 IEEE INT C ROB
[8]  
Handa A, 2014, IEEE INT CONF ROBOT, P1524, DOI 10.1109/ICRA.2014.6907054
[9]  
He K., 2016, P IEEE COMPUTER SOC, P770
[10]  
Hoiem D., 2005, COMPUTER VISION PATT