Unlocking robotic perception: comparison of deep learning methods for simultaneous localization and mapping and visual simultaneous localization and mapping in robot

被引:0
作者
Hoang, Minh Long [1 ]
机构
[1] Univ Parma, Dept Engn & Architecture, I-43124 Parma, Italy
关键词
Deep learning; Simultaneous localization and mapping; Visual SLAM; Robot; NEURAL-NETWORKS; SLAM; ATTENTION; RECOGNITION; FUSION; IMAGE;
D O I
10.1007/s41315-025-00419-5
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Simultaneous Localization and Mapping (SLAM) and Visual SLAM are crucial technologies in robotics, allowing autonomous systems to navigate and comprehend their environment. Deep learning (DL) has become a powerful tool in driving progress in these areas, providing solutions that improve accuracy, efficiency, and resilience. This article thoroughly analyzes different deep learning techniques designed explicitly for SLAM and Visual SLAM applications in robotic systems. This work provides a detailed overview of DL roles in SLAM and VSLAM and emphasizes the differences between these two fields. Five powerful DL methods are investigated: Convolutional Neural Networks in extracting features and understanding meaning, Recurrent Neural Network in modeling temporal relationships, Deep Reinforcement Learning in developing exploration strategies, Graph Neural Network in modeling spatial relationships, and Attention Mechanisms in selectively processing information. In this research, we will examine the advantages and disadvantages of each approach in relation to robotic applications, taking into account issues such as real-time performance, resource restrictions, and adaptability to various situations. This article seeks to guide researchers and practitioners in selecting suitable deep learning algorithms to improve the capabilities of SLAM and Visual SLAM in robotic systems by combining ideas from recent research and actual implementations. The popular types of each concerned DL will be synthesized with the discussion of pros and cons.
引用
收藏
页数:33
相关论文
共 208 条
  • [71] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [72] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [73] Graph Convolutional Networks for Hyperspectral Image Classification
    Hong, Danfeng
    Gao, Lianru
    Yao, Jing
    Zhang, Bing
    Plaza, Antonio
    Chanussot, Jocelyn
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (07): : 5966 - 5978
  • [74] Hsu WH., 2007, ACM Multimedia, DOI [10.1145/1291233.1291446, DOI 10.1145/1291233.1291446]
  • [75] IEEE, 2014, Fetch. ROBOTS: Your guide to the world of robotics
  • [76] CNN-Based Fault Detection of Scan Matching for Accurate SLAM in Dynamic Environments
    Jeong, Hyein
    Lee, Heoncheol
    [J]. SENSORS, 2023, 23 (06)
  • [77] Visual-SLAM Classical Framework and Key Techniques: A Review
    Jia, Guanwei
    Li, Xiaoying
    Zhang, Dongming
    Xu, Weiqing
    Lv, Haojie
    Shi, Yan
    Cai, Maolin
    [J]. SENSORS, 2022, 22 (12)
  • [78] Lvio-Fusion: A Self-adaptive Multi-sensor Fusion SLAM Framework Using Actor-critic Method
    Jia, Yupeng
    Luo, Haiyong
    Zhao, Fang
    Jiang, Guanlin
    Li, Yuhang
    Yan, Jiaquan
    Jiang, Zhuqing
    Wang, Zitian
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 286 - 293
  • [79] Jiao ZH, 2024, Arxiv, DOI [arXiv:2404.14822, 10.48550/arxiv.2404.14822, DOI 10.48550/ARXIV.2404.14822]
  • [80] Proximal policy optimization based dynamic path planning algorithm for mobile robots
    Jin, Xin
    Wang, Zhengxiao
    [J]. ELECTRONICS LETTERS, 2022, 58 (01) : 13 - 15