Deep Learning for Visual Localization and Mapping: A Survey

被引:16
作者
Chen, Changhao [1 ]
Wang, Bing [2 ]
Lu, Chris Xiaoxuan [3 ]
Trigoni, Niki [4 ]
Markham, Andrew [4 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[2] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R China
[3] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
[4] Univ Oxford, Dept Comp Sci, Oxford OX1 3QD, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Deep learning; global localization; visual odometry (VO); visual simultaneous localization and mapping (SLAM); visual-inertial odometry (VIO); ODOMETRY; ROBUST; DEPTH; SCALE; SLAM; REPRESENTATION; VERSATILE;
D O I
10.1109/TNNLS.2023.3309809
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep-learning-based localization and mapping approaches have recently emerged as a new research direction and receive significant attention from both industry and academia. Instead of creating hand-designed algorithms based on physical models or geometric theories, deep learning solutions provide an alternative to solve the problem in a data-driven way. Benefiting from the ever-increasing volumes of data and computational power on devices, these learning methods are fast evolving into a new area that shows potential to track self-motion and estimate environmental models accurately and robustly for mobile agents. In this work, we provide a comprehensive survey and propose a taxonomy for the localization and mapping methods using deep learning. This survey aims to discuss two basic questions: whether deep learning is promising for localization and mapping, and how deep learning should be applied to solve this problem. To this end, a series of localization and mapping topics are investigated, from the learning-based visual odometry and global relocalization to mapping, and simultaneous localization and mapping (SLAM). It is our hope that this survey organically weaves together the recent works in this vein from robotics, computer vision, and machine learning communities and serves as a guideline for future researchers to apply deep learning to tackle the problem of visual localization and mapping.
引用
收藏
页码:17000 / 17020
页数:21
相关论文
共 254 条
[1]  
Almalioglu Y, 2019, IEEE INT CONF ROBOT, P5474, DOI [10.1109/icra.2019.8793512, 10.1109/ICRA.2019.8793512]
[2]  
[Anonymous], 2018, PROC BRIT MACH VIS C
[3]  
[Anonymous], 2006, Probabilistic Robotics. Intelligent Robotics and Autonomous Agents
[4]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[5]  
Lindell DB, 2021, Arxiv, DOI arXiv:2012.01714
[6]   Simultaneous localization and mapping (SLAM): Part II [J].
Bailey, Tim ;
Durrant-Whyte, Hugh .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2006, 13 (03) :108-117
[7]   RelocNet: Continuous Metric Learning Relocalisation Using Neural Nets [J].
Balntas, Vassileios ;
Li, Shuda ;
Prisacariu, Victor .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :782-799
[8]  
Balntas Vassileios, 2016, PROCEDINGS BRIT MACH, P1, DOI DOI 10.5244/C.30.119
[9]   Hierarchical Surface Prediction for 3D Object Reconstruction [J].
Bane, Christian ;
Tulsiani, Shubham ;
Malik, Jitendra .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :412-420
[10]   Vector-based navigation using grid-like representations in artificial agents [J].
Banino, Andrea ;
Barry, Caswell ;
Uria, Benigno ;
Blundell, Charles ;
Lillicrap, Timothy ;
Mirowski, Piotr ;
Pritzel, Alexander ;
Chadwick, Martin J. ;
Degris, Thomas ;
Modayil, Joseph ;
Wayne, Greg ;
Soyer, Hubert ;
Viola, Fabio ;
Zhang, Brian ;
Goroshin, Ross ;
Rabinowitz, Neil ;
Pascanu, Razvan ;
Beattie, Charlie ;
Petersen, Stig ;
Sadik, Amir ;
Gaffney, Stephen ;
King, Helen ;
Kavukcuoglu, Koray ;
Hassabis, Demis ;
Hadsell, Raia ;
Kumaran, Dharshan .
NATURE, 2018, 557 (7705) :429-+