PR-SLAM: Parallel Real-Time Dynamic SLAM Method Based on Semantic Segmentation

被引:10
作者
Zhang, Hongyu [1 ]
Peng, Jiansheng [1 ,2 ]
Yang, Qing [1 ]
机构
[1] Guangxi Univ Sci & Technol, Coll Automat, Liuzhou 545000, Peoples R China
[2] Hechi Univ, Dept Artificial Intelligence & Mfg, Hechi 547000, Peoples R China
基金
中国国家自然科学基金;
关键词
Simultaneous localization and mapping; Real-time systems; Indoor environment; Geometric modeling; Autonomous systems; Semantic segmentation; Autonomous robots; Algorithm design and analysis; Semantic SLAM; semantic segmentation; real-time; indoor dynamic scene; autonomous exploration; robustness and accuracy; dynamic probability update strategy; geometric approach; ROBUST;
D O I
10.1109/ACCESS.2024.3373308
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SLAM (Simultaneous Localization and Mapping) is the core technology enabling autonomous exploration by mobile robots in unknown environments. While there have been numerous impressive SLAM systems developed, many of them are primarily based on the assumption of static environments, limiting their applicability in real-world settings. In order to enhance the robustness and accuracy of systems in dynamic real-world scenarios, we have introduced a parallelized real-time SLAM system called PR-SLAM, building upon the foundation of ORB-SLAM3. This algorithm introduces a dynamic probability update strategy within the semantic segmentation thread, completely decoupling the semantic segmentation thread from the tracking thread. Theoretically, the processing time per frame is solely dependent on the runtime of the tracking thread. Furthermore, we employ a geometric approach based on reprojection error to compensate for semantic gaps generated during semantic segmentation model inference. We have also designed a semantic optimization thread based on the dynamic probability of map points to optimize camera poses during semantic gaps. Finally, to reduce semantic gaps, we have performed lightweight modifications to SOLOV2. Comparative experiments were conducted against the state-of-the-art SLAM systems using the TUM dataset. The results indicate significant improvements in both accuracy and real-time performance for PR-SLAM. When compared to ORB-SLAM3, PR-SLAM achieved a remarkable 97.83% improvement in absolute trajectory accuracy and demonstrated an impressive 86.71% increase in runtime speed compared to DynaSLAM.
引用
收藏
页码:36498 / 36514
页数:17
相关论文
共 36 条
[1]  
Ahmed A., 2014, P IWAC, P1
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes [J].
Bescos, Berta ;
Facil, Jose M. ;
Civera, Javier ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :4076-4083
[4]   ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM [J].
Campos, Carlos ;
Elvira, Richard ;
Gomez Rodriguez, Juan J. ;
Montiel, Jose M. M. ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) :1874-1890
[5]   A review of visual SLAM methods for autonomous driving vehicles [J].
Cheng, Jun ;
Zhang, Liyan ;
Chen, Qihong ;
Hu, Xinrong ;
Cai, Jingcao .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
[6]   SG-SLAM: A Real-Time RGB-D Visual SLAM Toward Dynamic Scenes With Semantic and Geometric Information [J].
Cheng, Shuhong ;
Sun, Changhe ;
Zhang, Shijun ;
Zhang, Dianfan .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[7]  
Demim F., 2018, P INT C SIGN IM VIS, P1
[8]  
Demim F, 2017, INT CONF SYST CONTRO, P6, DOI 10.1109/ICoSC.2017.7958664
[9]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849
[10]   Blitz-SLAM: A semantic SLAM in dynamic environments [J].
Fan, Yingchun ;
Zhang, Qichi ;
Tang, Yuliang ;
Liu, Shaofen ;
Han, Hong .
PATTERN RECOGNITION, 2022, 121