A comprehensive overview of dynamic visual SLAM and deep learning: concepts, methods and challenges

被引:16
作者
Beghdadi, Ayman [1 ]
Mallem, Malik [1 ]
机构
[1] Paris Saclay Univ, Univ Evry, IBISC Lab, F-91025 Evry, France
关键词
Survey; SLAM (simultaneous localization and mapping); Visual SLAM; Deep learning; Environmental perception; Mobile robot; INERTIAL ODOMETRY; SIMULTANEOUS LOCALIZATION; CAMERA CALIBRATION; MONOCULAR SLAM; VERSATILE; VISION; ROBUST; SCALE;
D O I
10.1007/s00138-022-01306-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The visual SLAM (vSLAM) is a research topic that has been developing rapidly in recent years, especially with the renewed interest in machine learning and, more particularly, deep-learning-based approaches. Nowadays, main research is carried out to improve accuracy and robustness in complex and dynamic environments. This scorching topic has reached a significant level of maturity. This paper presents a relatively detailed and easily understood survey of vSLAM within deep learning. This study attempts to meet this challenge by better organizing the literature, explaining the basic concepts and tools, and presenting the current trends. The contributions of this study can be summarized in three essential steps. The first one is to provide the state-of-the-art in an incremental way following the classical processes of vSLAM-based systems. The second is to give our short- and medium-term view of the development of this very active and evolving field. Finally, we share our opinions on this subject and its interactions with new trends and, more particularly, the deep learning paradigm. We believe that this contribution will be an overview and, more importantly, a critical and detailed vision that serves as a roadmap in the field of vSLAMs both in terms of models and concepts and in terms of associated technologies.
引用
收藏
页数:28
相关论文
共 136 条
[101]  
Stenborg E, 2018, IEEE INT CONF ROBOT, P6484, DOI 10.1109/ICRA.2018.8463150
[102]   OpenVSLAM: A Versatile Visual SLAM Framework [J].
Sumikura, Shinya ;
Shibuya, Mikiya ;
Sakurada, Ken .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :2292-2295
[103]   PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume [J].
Sun, Deqing ;
Yang, Xiaodong ;
Liu, Ming-Yu ;
Kautz, Jan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8934-8943
[104]   Robust Stereo Visual Inertial Odometry for Fast Autonomous Flight [J].
Sun, Ke ;
Mohta, Kartik ;
Pfrommer, Bernd ;
Watterson, Michael ;
Liu, Sikang ;
Mulgaonkar, Yash ;
Taylor, Camillo J. ;
Kumar, Vijay .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (02) :965-972
[105]   MemNet: A Persistent Memory Network for Image Restoration [J].
Tai, Ying ;
Yang, Jian ;
Liu, Xiaoming ;
Xu, Chunyan .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4549-4557
[106]  
Taketomi T., 2017, IPSJ Transactions on Computer Vision and Applications, V9, P16, DOI [DOI 10.1186/S41074-017-0027-2, 10.1186/s41074-017-0027-2]
[107]  
Tan W, 2013, INT SYM MIX AUGMENT, P209, DOI 10.1109/ISMAR.2013.6671781
[108]   GCNv2: Efficient Correspondence Prediction for Real-Time SLAM [J].
Tang, Jiexiong ;
Ericson, Ludvig ;
Folkesson, John ;
Jensfelt, Patric .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) :3505-3512
[109]   CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction [J].
Tateno, Keisuke ;
Tombari, Federico ;
Laina, Iro ;
Navab, Nassir .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6565-6574
[110]  
Teed Zachary, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12347), P402, DOI 10.1007/978-3-030-58536-5_24