RGB-D Based Visual SLAM Algorithm for Indoor Crowd Environment

被引:4
作者
Li, Jianfeng [1 ,2 ,3 ]
Dai, Juan [1 ,2 ,3 ]
Su, Zhong [1 ,2 ,3 ]
Zhu, Cui [4 ]
机构
[1] Beijing Informat Sci & Technol Univ, Beijing Key Lab High Dynam Nav Technol, Beijing 100192, Peoples R China
[2] Minist Educ, Key Lab Modern Measurement & Control Technol, Beijing 100192, Peoples R China
[3] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[4] Beijing Informat Sci & Technol Univ, Sch Informat & Commun Engn, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual SLAM; Indoor environment; Object detection; Dynamic environment;
D O I
10.1007/s10846-023-02046-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most current research on dynamic visual Simultaneous Localization and Mapping (SLAM) systems focuses on scenes where static objects occupy most of the environment. However, in densely populated indoor environments, the movement of the crowd can lead to the loss of feature information, thereby diminishing the system's robustness and accuracy. This paper proposes a visual SLAM algorithm for dense crowd environments based on a combination of the ORB-SLAM2 framework and RGB-D cameras. Firstly, we introduced a dedicated target detection network thread and improved the performance of the target detection network, enhancing its detection coverage in crowded environments, resulting in a 41.5% increase in average accuracy. Additionally, we found that some feature points other than humans in the detection box were mistakenly deleted. Therefore, we proposed an algorithm based on standard deviation fitting to effectively filter out the features. Finally, our system is evaluated on the TUM and Bonn RGB-D dynamic datasets and compared with ORB-SLAM2 and other state-of-the-art visual dynamic SLAM methods. The results indicate that our system's pose estimation error is reduced by at least 93.60% and 97.11% compared to ORB-SLAM2 in high dynamic environments and the Bonn RGB-D dynamic dataset, respectively. Our method demonstrates comparable performance compared to other recent visual dynamic SLAM methods.
引用
收藏
页数:14
相关论文
共 19 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes [J].
Bescos, Berta ;
Facil, Jose M. ;
Civera, Javier ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :4076-4083
[3]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4]   ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM [J].
Campos, Carlos ;
Elvira, Richard ;
Gomez Rodriguez, Juan J. ;
Montiel, Jose M. M. ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) :1874-1890
[5]   SOF-SLAM: A Semantic Visual SLAM for Dynamic Environments [J].
Cui, Linyan ;
Ma, Chaowei .
IEEE ACCESS, 2019, 7 :166528-166539
[6]  
Dendorfer P., 2020, arXiv
[7]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[8]   RGB-D SLAM in Dynamic Environments Using Static Point Weighting [J].
Li, Shile ;
Lee, Dongheui .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04) :2263-2270
[9]   RDS-SLAM: Real-Time Dynamic SLAM Using Semantic Segmentation Methods [J].
Liu, Yubao ;
Jun, Miura .
IEEE ACCESS, 2021, 9 :23772-23785
[10]   ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras [J].
Mur-Artal, Raul ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (05) :1255-1262