DVDS: A deep visual dynamic slam system

被引:4
|
作者
Xie, Tao [1 ]
Sun, Qihao [1 ]
Sun, Tao [1 ]
Zhang, Jinhang [1 ]
Dai, Kun [1 ]
Zhao, Lijun [1 ]
Wang, Ke [1 ]
Li, Ruifeng [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
simultaneous localization and mapping; Transformer; Deep learning; VERSATILE;
D O I
10.1016/j.eswa.2024.125438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Simultaneous localization and mapping (SLAM) utilizing visual sensors represent an extensively investigated research area, holding significant potential for advancements in robotics and autonomous vehicular systems. Recently, dense SLAM systems underpinned by learning-based methodologies have showcased superior accuracy and robustness compared to conventional techniques. Nevertheless, contemporary learning-based SLAM systems exhibit notable discrepancies in pose estimation, particularly within dynamic environments. In addition, the constrained receptive field of convolutional features in these methods impedes their efficacy when confronted with homogeneous, texture-less images, rendering them vulnerable to noise perturbations. We develop a novel deep visual dynamic slam (DVDS) system that exploits solely static pixels within images to retrieve the camera poses. Specifically, we formulate a dynamic object exclusion mechanism that excises dynamic constituents within the scene before the optical flow computation, thus optimizing the precision of the estimation. In addition, we unveil an efficient dispersive transformer (DisFormer) that facilitates per-pixel features in assimilating long-range information from surrounding features, culminating in constructing more precise 4D correlation volumes. Building on the DisFormer, we suggest a Disformer-based gated recurrent unit (GRU) to generate a refined flow field coupled with a confidence map, which is subsequently employed by the dense bundle adjustment layer to iteratively rectify the residuals of inverse depths and associated camera poses. The global receptive field provided by the DisFormer promotes information integration from a wider contextual window, thus improving the robustness of our SLAM system. Comprehensive experiments underscore that our proposed DVDS system manifests superior efficacy compared with state-of-the-art works across both static and dynamic scenes.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Survey of Visual SLAM Based on Deep Learning
    Huang Z.
    Shao C.
    Jiqiren/Robot, 2023, 45 (06): : 756 - 768
  • [42] PPS-SLAM: Dynamic Visual SLAM with a Precise Pruning Strategy
    Peng, Jiansheng
    Qian, Wei
    Zhang, Hongyu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (02): : 2849 - 2868
  • [43] Overview of deep learning application on visual SLAM
    Li, Shaopeng
    Zhang, Daqiao
    Xian, Yong
    Li, Bangjie
    Zhang, Tao
    Zhong, Chengliang
    DISPLAYS, 2022, 74
  • [44] Survey of Deep Learning Technology in Visual SLAM
    Meng, Haijun
    Lu, Huimin
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 37 - 42
  • [45] A Visual-Inertial Dynamic Object Tracking SLAM Tightly Coupled System
    Zhang, Hanxuan
    Wang, Dingyi
    Huo, Ju
    IEEE SENSORS JOURNAL, 2023, 23 (17) : 19905 - 19917
  • [46] DNIV-SLAM: Neural Implicit Visual SLAM in Dynamic Environments
    Yang, Feng
    Wang, Yanbo
    Tan, Liwen
    Li, Mingrui
    Shan, Hongjian
    Pan, Peng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 33 - 47
  • [47] ATY-SLAM: A Visual Semantic SLAM for Dynamic Indoor Environments
    Qi, Hao
    Hu, Zhuhua
    Xiang, Yunfeng
    Cai, Dupeng
    Zhao, Yaochi
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 3 - 14
  • [48] DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments
    Yu, Chao
    Liu, Zuxin
    Liu, Xin-Jun
    Xie, Fugui
    Yang, Yi
    Wei, Qi
    Fei, Qiao
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1168 - 1174
  • [49] OFM-SLAM: A Visual Semantic SLAM for Dynamic Indoor Environments
    Zhao, Xiong
    Zuo, Tao
    Hu, Xinyu
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [50] A real-time semantic visual SLAM for dynamic environment based on deep learning and dynamic probabilistic propagation
    Liang Chen
    Zhi Ling
    Yu Gao
    Rongchuan Sun
    Sheng Jin
    Complex & Intelligent Systems, 2023, 9 : 5653 - 5677