DVDS: A deep visual dynamic slam system

被引：4

作者：

Xie, Tao ^{[1
]}

Sun, Qihao ^{[1
]}

Sun, Tao ^{[1
]}

Zhang, Jinhang ^{[1
]}

Dai, Kun ^{[1
]}

Zhao, Lijun ^{[1
]}

Wang, Ke ^{[1
]}

Li, Ruifeng ^{[1
]}

机构：

[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 260卷

基金：

中国国家自然科学基金;

关键词：

simultaneous localization and mapping; Transformer; Deep learning; VERSATILE;

D O I：

10.1016/j.eswa.2024.125438

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Simultaneous localization and mapping (SLAM) utilizing visual sensors represent an extensively investigated research area, holding significant potential for advancements in robotics and autonomous vehicular systems. Recently, dense SLAM systems underpinned by learning-based methodologies have showcased superior accuracy and robustness compared to conventional techniques. Nevertheless, contemporary learning-based SLAM systems exhibit notable discrepancies in pose estimation, particularly within dynamic environments. In addition, the constrained receptive field of convolutional features in these methods impedes their efficacy when confronted with homogeneous, texture-less images, rendering them vulnerable to noise perturbations. We develop a novel deep visual dynamic slam (DVDS) system that exploits solely static pixels within images to retrieve the camera poses. Specifically, we formulate a dynamic object exclusion mechanism that excises dynamic constituents within the scene before the optical flow computation, thus optimizing the precision of the estimation. In addition, we unveil an efficient dispersive transformer (DisFormer) that facilitates per-pixel features in assimilating long-range information from surrounding features, culminating in constructing more precise 4D correlation volumes. Building on the DisFormer, we suggest a Disformer-based gated recurrent unit (GRU) to generate a refined flow field coupled with a confidence map, which is subsequently employed by the dense bundle adjustment layer to iteratively rectify the residuals of inverse depths and associated camera poses. The global receptive field provided by the DisFormer promotes information integration from a wider contextual window, thus improving the robustness of our SLAM system. Comprehensive experiments underscore that our proposed DVDS system manifests superior efficacy compared with state-of-the-art works across both static and dynamic scenes.

引用

页数：15

共 50 条

[41] Survey of Visual SLAM Based on Deep Learning
Huang Z.
Shao C.
Jiqiren/Robot, 2023, 45 (06): : 756 - 768
[42] PPS-SLAM: Dynamic Visual SLAM with a Precise Pruning Strategy
Peng, Jiansheng
Qian, Wei
Zhang, Hongyu
CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (02): : 2849 - 2868
[43] Overview of deep learning application on visual SLAM
Li, Shaopeng
Zhang, Daqiao
Xian, Yong
Li, Bangjie
Zhang, Tao
Zhong, Chengliang
DISPLAYS, 2022, 74
[44] Survey of Deep Learning Technology in Visual SLAM
Meng, Haijun
Lu, Huimin
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 37 - 42
[45] A Visual-Inertial Dynamic Object Tracking SLAM Tightly Coupled System
Zhang, Hanxuan
Wang, Dingyi
Huo, Ju
IEEE SENSORS JOURNAL, 2023, 23 (17) : 19905 - 19917
[46] DNIV-SLAM: Neural Implicit Visual SLAM in Dynamic Environments
Yang, Feng
Wang, Yanbo
Tan, Liwen
Li, Mingrui
Shan, Hongjian
Pan, Peng
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 33 - 47
[47] ATY-SLAM: A Visual Semantic SLAM for Dynamic Indoor Environments
Qi, Hao
Hu, Zhuhua
Xiang, Yunfeng
Cai, Dupeng
Zhao, Yaochi
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 3 - 14
[48] DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments
Yu, Chao
Liu, Zuxin
Liu, Xin-Jun
Xie, Fugui
Yang, Yi
Wei, Qi
Fei, Qiao
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1168 - 1174
[49] OFM-SLAM: A Visual Semantic SLAM for Dynamic Indoor Environments
Zhao, Xiong
Zuo, Tao
Hu, Xinyu
MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
[50] A real-time semantic visual SLAM for dynamic environment based on deep learning and dynamic probabilistic propagation
Liang Chen
Zhi Ling
Yu Gao
Rongchuan Sun
Sheng Jin
Complex & Intelligent Systems, 2023, 9 : 5653 - 5677

← 1 2 3 4 5 →