Unlocking robotic perception: comparison of deep learning methods for simultaneous localization and mapping and visual simultaneous localization and mapping in robot

被引:0
作者
Hoang, Minh Long [1 ]
机构
[1] Univ Parma, Dept Engn & Architecture, I-43124 Parma, Italy
关键词
Deep learning; Simultaneous localization and mapping; Visual SLAM; Robot; NEURAL-NETWORKS; SLAM; ATTENTION; RECOGNITION; FUSION; IMAGE;
D O I
10.1007/s41315-025-00419-5
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Simultaneous Localization and Mapping (SLAM) and Visual SLAM are crucial technologies in robotics, allowing autonomous systems to navigate and comprehend their environment. Deep learning (DL) has become a powerful tool in driving progress in these areas, providing solutions that improve accuracy, efficiency, and resilience. This article thoroughly analyzes different deep learning techniques designed explicitly for SLAM and Visual SLAM applications in robotic systems. This work provides a detailed overview of DL roles in SLAM and VSLAM and emphasizes the differences between these two fields. Five powerful DL methods are investigated: Convolutional Neural Networks in extracting features and understanding meaning, Recurrent Neural Network in modeling temporal relationships, Deep Reinforcement Learning in developing exploration strategies, Graph Neural Network in modeling spatial relationships, and Attention Mechanisms in selectively processing information. In this research, we will examine the advantages and disadvantages of each approach in relation to robotic applications, taking into account issues such as real-time performance, resource restrictions, and adaptability to various situations. This article seeks to guide researchers and practitioners in selecting suitable deep learning algorithms to improve the capabilities of SLAM and Visual SLAM in robotic systems by combining ideas from recent research and actual implementations. The popular types of each concerned DL will be synthesized with the discussion of pros and cons.
引用
收藏
页数:33
相关论文
共 208 条
  • [1] A Review of Recurrent Neural Network Based Camera Localization for Indoor Environments
    Alam, Muhammad Shamsul
    Mohamed, Farhan Bin
    Selamat, Ali
    Hossain, Akm Bellal
    [J]. IEEE ACCESS, 2023, 11 : 43985 - 44009
  • [2] Attention-based recurrent neural network for multistep-ahead prediction of process performance
    Aliabadi, Majid Moradi
    Emami, Hajar
    Dong, Ming
    Huang, Yinlun
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2020, 140 (140)
  • [3] Alomar K, 2024, Arxiv, DOI [arXiv:2407.06162, 10.48550/arxiv.2407.06162]
  • [4] Alsabban MS, 2021, APPEEC 2021: 2021 13TH IEEE PES ASIA PACIFIC POWER &AMP
  • [5] Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) Power Forecasting
    Alsabban, Maha S.
    Salem, Nema
    Malik, Hebatullah M.
    [J]. APPEEC 2021: 2021 13TH IEEE PES ASIA PACIFIC POWER & ENERGY ENGINEERING CONFERENCE (APPEEC), 2021,
  • [6] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
    Alzubaidi, Laith
    Zhang, Jinglan
    Humaidi, Amjad J.
    Al-Dujaili, Ayad
    Duan, Ye
    Al-Shamma, Omran
    Santamaria, J.
    Fadhel, Mohammed A.
    Al-Amidie, Muthana
    Farhan, Laith
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [7] Amarjyoti S, 2017, Arxiv, DOI [arXiv:1701.08878, DOI 10.48550/ARXIV.1701.08878]
  • [8] Semantic segmentation-aided visual odometry for urban autonomous driving
    An, Lifeng
    Zhang, Xinyu
    Gao, Hongbo
    Liu, Yuchao
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (05): : 1 - 111
  • [9] Visual-LiDAR SLAM Based on Unsupervised Multi-channel Deep Neural Networks
    An, Yi
    Shi, Jin
    Gu, Dongbing
    Liu, Qiang
    [J]. COGNITIVE COMPUTATION, 2022, 14 (04) : 1496 - 1508
  • [10] Andrew A. M., 1998, Reinforcement Learning:983Richard S. Sutton, Andrew G. Barto. Reinforcement learning: an introduction, Vxviii