Unlocking robotic perception: comparison of deep learning methods for simultaneous localization and mapping and visual simultaneous localization and mapping in robot

被引:0
作者
Hoang, Minh Long [1 ]
机构
[1] Univ Parma, Dept Engn & Architecture, I-43124 Parma, Italy
关键词
Deep learning; Simultaneous localization and mapping; Visual SLAM; Robot; NEURAL-NETWORKS; SLAM; ATTENTION; RECOGNITION; FUSION; IMAGE;
D O I
10.1007/s41315-025-00419-5
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Simultaneous Localization and Mapping (SLAM) and Visual SLAM are crucial technologies in robotics, allowing autonomous systems to navigate and comprehend their environment. Deep learning (DL) has become a powerful tool in driving progress in these areas, providing solutions that improve accuracy, efficiency, and resilience. This article thoroughly analyzes different deep learning techniques designed explicitly for SLAM and Visual SLAM applications in robotic systems. This work provides a detailed overview of DL roles in SLAM and VSLAM and emphasizes the differences between these two fields. Five powerful DL methods are investigated: Convolutional Neural Networks in extracting features and understanding meaning, Recurrent Neural Network in modeling temporal relationships, Deep Reinforcement Learning in developing exploration strategies, Graph Neural Network in modeling spatial relationships, and Attention Mechanisms in selectively processing information. In this research, we will examine the advantages and disadvantages of each approach in relation to robotic applications, taking into account issues such as real-time performance, resource restrictions, and adaptability to various situations. This article seeks to guide researchers and practitioners in selecting suitable deep learning algorithms to improve the capabilities of SLAM and Visual SLAM in robotic systems by combining ideas from recent research and actual implementations. The popular types of each concerned DL will be synthesized with the discussion of pros and cons.
引用
收藏
页数:33
相关论文
共 208 条
  • [81] Enhancement of Perivascular Spaces Using Densely Connected Deep Convolutional Neural Network
    Jung, Euijin
    Chikontwe, Philip
    Zong, Xiaopeng
    Lin, Weili
    Shen, Dinggang
    Park, Sang Hyun
    [J]. IEEE ACCESS, 2019, 7 : 18382 - 18391
  • [82] End-to-End Learning of Geometry and Context for Deep Stereo Regression
    Kendall, Alex
    Martirosyan, Hayk
    Dasgupta, Saumitro
    Henry, Peter
    Kennedy, Ryan
    Bachrach, Abraham
    Bry, Adam
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 66 - 75
  • [83] PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization
    Kendall, Alex
    Grimes, Matthew
    Cipolla, Roberto
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2938 - 2946
  • [84] Khaled M., 2019, Int. Arch. Photogram. Remote Sens. Spatial Inf. Sci., VXLII-2/W13, P857, DOI [10.5194/isprs-archives-xlii-2-w13-857-2019, DOI 10.5194/ISPRS-ARCHIVES-XLII-2-W13-857-2019]
  • [85] Context Representation and Fusion: Advancements and Opportunities
    Khattak, Asad Masood
    Akbar, Noman
    Aazam, Mohammad
    Ali, Taqdir
    Khan, Adil Mehmood
    Jeon, Seokhee
    Hwang, Myunggwon
    Lee, Sungyoung
    [J]. SENSORS, 2014, 14 (06) : 9628 - 9668
  • [86] A review of graph neural networks: concepts, architectures, techniques, challenges, datasets, applications, and future directions
    Khemani, Bharti
    Patil, Shruti
    Kotecha, Ketan
    Tanwar, Sudeep
    [J]. JOURNAL OF BIG DATA, 2024, 11 (01)
  • [87] Pose Estimation Utilizing a Gated Recurrent Unit Network for Visual Localization
    Kim, Sungkwan
    Kim, Inhwan
    Vecchietti, Luiz Felipe
    Har, Dongsoo
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 19
  • [88] Kipf T, 2018, Arxiv, DOI [arXiv:1802.04687, DOI 10.48550/ARXIV.1802.04687]
  • [89] Kozma R., 2022, Artificial intelligence in the age of neural networks and brain computing
  • [90] Socially compliant mobile robot navigation via inverse reinforcement learning
    Kretzschmar, Henrik
    Spies, Markus
    Sprunk, Christoph
    Burgard, Wolfram
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (11) : 1352 - 1370