MSSD-SLAM: Multifeature Semantic RGB-D Inertial SLAM With Structural Regularity for Dynamic Environments

被引:4
作者
Wang, Yanan [1 ]
Tian, Yaobin [1 ]
Chen, Jiawei [1 ]
Chen, Cheng [1 ]
Xu, Kun [1 ]
Ding, Xilun [1 ]
机构
[1] Beihang Univ BUAA, Robot Inst, Sch Mech Engn & Automat, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Simultaneous localization and mapping; Semantics; Heuristic algorithms; Vehicle dynamics; Dynamics; Visualization; Feature extraction; Robustness; Object detection; Accuracy; Dynamic environment; line and plane features; RGB-D inertial simultaneous localization and mapping (SLAM); structural regularity; MONOCULAR SLAM; VERSATILE; ODOMETRY; ACCURATE; TRACKING; ROBUST;
D O I
10.1109/TIM.2024.3509541
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Traditional visual simultaneous localization and mapping (SLAM) methods excel in static, texture-rich environments, but may struggle in dynamic, textureless environments. Recent advances in dynamic SLAM incorporate deep learning techniques to eliminate dynamic objects. However, these methods may suffer from unrecognizable dynamic objects, and removing dynamic feature points exacerbates feature scarcity. To address the limitations, we propose MSSD-SLAM, a multifeature semantic RGB-D inertial SLAM system that incorporates point, line, and plane features to enhance robustness and enrich static map fidelity. By embedding structural constraints on 3-D spatial features constructed from multiframe observations, it guarantees geometric consistency and accurate camera pose estimation, supporting subsequent dynamic object detection. A novel Dynamic Filter has been developed that efficiently handles both semantically recognizable and unrecognizable objects by amalgamating semantic segmentation, structural constraints, and inertial measurement unit (IMU) measurements. Short-term dynamic objects are detected by the consistency of multisource information, while long-term dynamic objects, which may remain static temporarily, are identified through covisible projection among multiple frames. Validation on the TUM dataset and real-world scenarios demonstrates that the localization accuracy of MSSD-SLAM outperforms ORB-SLAM3 and Dynamic-VINS by 76% and 64%, respectively, indicating that our algorithm exhibits superior accuracy and robustness in dynamic indoor scenes compared with state-of-the-art algorithms.
引用
收藏
页数:17
相关论文
共 47 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM [J].
Bescos, Berta ;
Campos, Carlos ;
Tardos, Juan D. ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) :5191-5198
[3]   DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes [J].
Bescos, Berta ;
Facil, Jose M. ;
Civera, Javier ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :4076-4083
[4]   ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM [J].
Campos, Carlos ;
Elvira, Richard ;
Gomez Rodriguez, Juan J. ;
Montiel, Jose M. M. ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) :1874-1890
[5]   YOLOv4-tiny-based robust RGB-D SLAM approach with point and surface feature fusion in complex indoor environments [J].
Chang, Zhanyuan ;
Wu, Honglin ;
Li, Chuanjiang .
JOURNAL OF FIELD ROBOTICS, 2023, 40 (03) :521-534
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   SG-SLAM: A Real-Time RGB-D Visual SLAM Toward Dynamic Scenes With Semantic and Geometric Information [J].
Cheng, Shuhong ;
Sun, Changhe ;
Zhang, Shijun ;
Zhang, Dianfan .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[8]   Accurate Dynamic SLAM Using CRF-Based Long-Term Consistency [J].
Du, Zheng-Jun ;
Huang, Shi-Sheng ;
Mu, Tai-Jiang ;
Zhao, Qunhe ;
Martin, Ralph R. ;
Xu, Kun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (04) :1745-1757
[9]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[10]  
Feng C, 2014, IEEE INT CONF ROBOT, P6218, DOI 10.1109/ICRA.2014.6907776