Visual SLAM Integration With Semantic Segmentation and Deep Learning: A Review

被引:31
|
作者
Pu, Huayan [1 ]
Luo, Jun [1 ]
Wang, Gang [1 ]
Huang, Tao [1 ]
Liu, Hongliang [1 ]
Luo, Jun [1 ]
机构
[1] Chongqing Univ, Coll Mech & Vehicle Engn, State Key Lab Mech Transmiss, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Deep learning; robots; semantic segmentation; simultaneous localization and mapping (SLAM); INERTIAL ODOMETRY; SIMULTANEOUS LOCALIZATION; K-MEANS; TRACKING; ROBUST; RECONSTRUCTION; VERSATILE; VISION; ONLINE;
D O I
10.1109/JSEN.2023.3306371
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Simultaneous localization and mapping (SLAM) technology is essential for robots to navigate unfamiliar environments. It utilizes the sensors the robot carries to answer the question "Where am I?" Of the available sensors, cameras are commonly used. Compared to other sensors like light detection and ranging (LiDARs), the method based on cameras, known as visual SLAM, has been extensively explored by researchers due to the affordability and rich image data cameras provide. Although conventional visual SLAM algorithms have been able to accurately build a map in static environments, dynamic environments present a significant challenge for visual SLAM in practical robotics scenarios. While efforts have been made to address this issue, such as adding semantic segmentation to conventional algorithms, a comprehensive literature review is still lacking. This article discusses the challenges and approaches of visual SLAM with a focus on dynamic objects and their impact on feature extraction and mapping accuracy. First, two classical approaches of conventional visual SLAM are reviewed; then, this article explores the application of deep learning in the front-end and back-end of visual SLAM. Next, visual SLAM in dynamic environments is analyzed and summarized, and insights into future developments are elaborated upon. This article provides effective inspiration for researchers on how to combine deep learning and semantic segmentation with visual SLAM to promote its development.
引用
收藏
页码:22119 / 22138
页数:20
相关论文
共 50 条
  • [1] An Overview on Visual SLAM: From Tradition to Semantic
    Chen, Weifeng
    Shang, Guangtao
    Ji, Aihong
    Zhou, Chengjun
    Wang, Xiyang
    Xu, Chonghui
    Li, Zhenxiong
    Hu, Kai
    REMOTE SENSING, 2022, 14 (13)
  • [2] DFS-SLAM: A Visual SLAM Algorithm for Deep Fusion of Semantic Information
    Jiao, Songming
    Li, Yan
    Shan, Zhengwen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 11794 - 11801
  • [3] A comprehensive overview of dynamic visual SLAM and deep learning: concepts, methods and challenges
    Beghdadi, Ayman
    Mallem, Malik
    MACHINE VISION AND APPLICATIONS, 2022, 33 (04)
  • [4] Overview of deep learning application on visual SLAM
    Li, Shaopeng
    Zhang, Daqiao
    Xian, Yong
    Li, Bangjie
    Zhang, Tao
    Zhong, Chengliang
    DISPLAYS, 2022, 74
  • [5] DMS-SLAM: semantic visual SLAM based on deep mask segmentation in dynamic environments
    Gao, Shuyuan
    Zhang, Minhui
    Gao, Xicheng
    Zhang, Dawei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (04)
  • [6] Review of Deep Learning-Based Semantic Segmentation
    Zhang Xiangfu
    Jian, Liu
    Shi Zhangsong
    Wu Zhonghong
    Zhi, Wang
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (15)
  • [7] Deep Learning for Visual SLAM in Transportation Robotics: A review
    Duan, Chao
    Junginger, Steffen
    Huang, Jiahao
    Jin, Kairong
    Thurow, Kerstin
    TRANSPORTATION SAFETY AND ENVIRONMENT, 2019, 1 (03) : 177 - 184
  • [8] Review of Semantic Segmentation of Point Cloud Based on Deep Learning
    Zhang Jiaying
    Zhao Xiaoli
    Chen Zheng
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (04)
  • [9] Deep Learning Techniques for Visual SLAM: A Survey
    Mokssit, Saad
    Licea, Daniel Bonilla
    Guermah, Bassma
    Ghogho, Mounir
    IEEE ACCESS, 2023, 11 : 20026 - 20050
  • [10] A Review of Deep Learning-Based Semantic Segmentation for Point Cloud
    Zhang, Jiaying
    Zhao, Xiaoli
    Chen, Zheng
    Lu, Zhejun
    IEEE ACCESS, 2019, 7 : 179118 - 179133