DVDS: A deep visual dynamic slam system

被引：4

作者：

Xie, Tao ^{[1
]}

Sun, Qihao ^{[1
]}

Sun, Tao ^{[1
]}

Zhang, Jinhang ^{[1
]}

Dai, Kun ^{[1
]}

Zhao, Lijun ^{[1
]}

Wang, Ke ^{[1
]}

Li, Ruifeng ^{[1
]}

机构：

[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 260卷

基金：

中国国家自然科学基金;

关键词：

simultaneous localization and mapping; Transformer; Deep learning; VERSATILE;

D O I：

10.1016/j.eswa.2024.125438

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Simultaneous localization and mapping (SLAM) utilizing visual sensors represent an extensively investigated research area, holding significant potential for advancements in robotics and autonomous vehicular systems. Recently, dense SLAM systems underpinned by learning-based methodologies have showcased superior accuracy and robustness compared to conventional techniques. Nevertheless, contemporary learning-based SLAM systems exhibit notable discrepancies in pose estimation, particularly within dynamic environments. In addition, the constrained receptive field of convolutional features in these methods impedes their efficacy when confronted with homogeneous, texture-less images, rendering them vulnerable to noise perturbations. We develop a novel deep visual dynamic slam (DVDS) system that exploits solely static pixels within images to retrieve the camera poses. Specifically, we formulate a dynamic object exclusion mechanism that excises dynamic constituents within the scene before the optical flow computation, thus optimizing the precision of the estimation. In addition, we unveil an efficient dispersive transformer (DisFormer) that facilitates per-pixel features in assimilating long-range information from surrounding features, culminating in constructing more precise 4D correlation volumes. Building on the DisFormer, we suggest a Disformer-based gated recurrent unit (GRU) to generate a refined flow field coupled with a confidence map, which is subsequently employed by the dense bundle adjustment layer to iteratively rectify the residuals of inverse depths and associated camera poses. The global receptive field provided by the DisFormer promotes information integration from a wider contextual window, thus improving the robustness of our SLAM system. Comprehensive experiments underscore that our proposed DVDS system manifests superior efficacy compared with state-of-the-art works across both static and dynamic scenes.

引用

页数：15

共 50 条

[11] BY-SLAM: Dynamic Visual SLAM System Based on BEBLID and Semantic Information Extraction
Zhu, Daixian
Liu, Peixuan
Qiu, Qiang
Wei, Jiaxin
Gong, Ruolin
SENSORS, 2024, 24 (14)
[12] DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features
Li, Dongjiang
Shi, Xuesong
Long, Qiwei
Liu, Shenghui
Yang, Wei
Wang, Fangshi
Wei, Qi
Qiao, Fei
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4958 - 4965
[13] OTE-SLAM: An Object Tracking Enhanced Visual SLAM System for Dynamic Environments
Chang, Yimeng
Hu, Jun
Xu, Shiyou
SENSORS, 2023, 23 (18)
[14] DMS-SLAM: A General Visual SLAM System for Dynamic Scenes with Multiple Sensors
Liu, Guihua
Zeng, Weilin
Feng, Bo
Xu, Feng
SENSORS, 2019, 19 (17)
[15] A Dynamic Visual SLAM System Incorporating Object Tracking for UAVs
Li, Minglei
Li, Jia
Cao, Yanan
Chen, Guangyong
DRONES, 2024, 8 (06)
[16] A Novel Visual SLAM System for Autonomous Vehicles in Dynamic Environments
Zeng, Xinyu
He, Ying
Yu, F. Richard
Zhou, Guang
2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
[17] A comprehensive overview of dynamic visual SLAM and deep learning: concepts, methods and challenges
Ayman Beghdadi
Malik Mallem
Machine Vision and Applications, 2022, 33
[18] A comprehensive overview of dynamic visual SLAM and deep learning: concepts, methods and challenges
Beghdadi, Ayman
Mallem, Malik
MACHINE VISION AND APPLICATIONS, 2022, 33 (04)
[19] Visual SLAM method for dynamic environment based on deep learning image features
Liu D.
Yu T.
Cong M.
Du Y.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2024, 52 (06): : 156 - 163
[20] A robust visual SLAM system in dynamic man-made environments
LIU JiaCheng
MENG ZiYang
YOU Zheng
Science China(Technological Sciences), 2020, 63 (09) : 1628 - 1636

← 1 2 3 4 5 →