A dynamic SLAM system with YOLOv7 segmentation and geometric constraints for indoor environments

被引:0
作者
Shen, Yewei [1 ]
Zhang, Xinguang [1 ]
机构
[1] Shanghai Univ Engn Sci, Sch Mech & Automot Engn, Shanghai, Peoples R China
关键词
visual SLAM; YOLOv7; semantic segmentation; optical flow tracking; SIMULTANEOUS LOCALIZATION;
D O I
10.1017/S0263574725101823
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
With the rapid advancements in robotics and autonomous driving, SLAM (simultaneous localization and mapping) has become a crucial technology for real-time localization and map creation, seeing widespread application across various domains. However, SLAM's performance in dynamic environments is often compromised due to the presence of moving objects, which can introduce errors and inconsistencies in localization and mapping. To overcome these challenges, this paper presents a visual SLAM system that employs dynamic feature point rejection. The system leverages a lightweight YOLOv7 model for detecting dynamic objects and performing semantic segmentation. Additionally, it incorporates optical flow tracking and multiview geometry techniques to identify and eliminate dynamic feature points. This approach effectively mitigates the impact of dynamic objects on the SLAM process, while maintaining the integrity of static feature points, ultimately enhancing the system's robustness and accuracy in dynamic environments. Finally, we evaluate our method on the TUM RGB-D dataset and in real-world scenarios. The experimental results demonstrate that our approach significantly reduces both the root mean square error (RMSE) and standard deviation (Std) compared to the ORB-SLAM2 algorithm.
引用
收藏
页数:19
相关论文
共 43 条
[1]   A review of visual SLAM for robotics: evolution, properties, and future applications [J].
Al-Tawil, Basheer ;
Hempel, Thorsten ;
Abdelrahman, Ahmed ;
Al-Hamadi, Ayoub .
FRONTIERS IN ROBOTICS AND AI, 2024, 11
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes [J].
Bescos, Berta ;
Facil, Jose M. ;
Civera, Javier ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :4076-4083
[4]   YOLACT Real-time Instance Segmentation [J].
Bolya, Daniel ;
Zhou, Chong ;
Xiao, Fanyi ;
Lee, Yong Jae .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9156-9165
[5]   Simultaneous Localization and Mapping: A Survey of Current Trends in Autonomous Driving [J].
Bresson, Guillaume ;
Alsayed, Zayed ;
Yu, Li ;
Glaser, Sebastien .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2017, 2 (03) :194-220
[6]   ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM [J].
Campos, Carlos ;
Elvira, Richard ;
Gomez Rodriguez, Juan J. ;
Montiel, Jose M. M. ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) :1874-1890
[7]  
Chen ZJ, 2018, 2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), P413, DOI 10.1109/WCICA.2018.8630714
[8]   SG-SLAM: A Real-Time RGB-D Visual SLAM Toward Dynamic Scenes With Semantic and Geometric Information [J].
Cheng, Shuhong ;
Sun, Changhe ;
Zhang, Shijun ;
Zhang, Dianfan .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[9]   RGB-D SLAM in Dynamic Environments Using Point Correlations [J].
Dai, Weichen ;
Zhang, Yu ;
Li, Ping ;
Fang, Zheng ;
Scherer, Sebastian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) :373-389
[10]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849