DVDS: A deep visual dynamic slam system

被引:4
|
作者
Xie, Tao [1 ]
Sun, Qihao [1 ]
Sun, Tao [1 ]
Zhang, Jinhang [1 ]
Dai, Kun [1 ]
Zhao, Lijun [1 ]
Wang, Ke [1 ]
Li, Ruifeng [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
simultaneous localization and mapping; Transformer; Deep learning; VERSATILE;
D O I
10.1016/j.eswa.2024.125438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Simultaneous localization and mapping (SLAM) utilizing visual sensors represent an extensively investigated research area, holding significant potential for advancements in robotics and autonomous vehicular systems. Recently, dense SLAM systems underpinned by learning-based methodologies have showcased superior accuracy and robustness compared to conventional techniques. Nevertheless, contemporary learning-based SLAM systems exhibit notable discrepancies in pose estimation, particularly within dynamic environments. In addition, the constrained receptive field of convolutional features in these methods impedes their efficacy when confronted with homogeneous, texture-less images, rendering them vulnerable to noise perturbations. We develop a novel deep visual dynamic slam (DVDS) system that exploits solely static pixels within images to retrieve the camera poses. Specifically, we formulate a dynamic object exclusion mechanism that excises dynamic constituents within the scene before the optical flow computation, thus optimizing the precision of the estimation. In addition, we unveil an efficient dispersive transformer (DisFormer) that facilitates per-pixel features in assimilating long-range information from surrounding features, culminating in constructing more precise 4D correlation volumes. Building on the DisFormer, we suggest a Disformer-based gated recurrent unit (GRU) to generate a refined flow field coupled with a confidence map, which is subsequently employed by the dense bundle adjustment layer to iteratively rectify the residuals of inverse depths and associated camera poses. The global receptive field provided by the DisFormer promotes information integration from a wider contextual window, thus improving the robustness of our SLAM system. Comprehensive experiments underscore that our proposed DVDS system manifests superior efficacy compared with state-of-the-art works across both static and dynamic scenes.
引用
收藏
页数:15
相关论文
共 50 条
  • [11] BY-SLAM: Dynamic Visual SLAM System Based on BEBLID and Semantic Information Extraction
    Zhu, Daixian
    Liu, Peixuan
    Qiu, Qiang
    Wei, Jiaxin
    Gong, Ruolin
    SENSORS, 2024, 24 (14)
  • [12] DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features
    Li, Dongjiang
    Shi, Xuesong
    Long, Qiwei
    Liu, Shenghui
    Yang, Wei
    Wang, Fangshi
    Wei, Qi
    Qiao, Fei
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4958 - 4965
  • [13] OTE-SLAM: An Object Tracking Enhanced Visual SLAM System for Dynamic Environments
    Chang, Yimeng
    Hu, Jun
    Xu, Shiyou
    SENSORS, 2023, 23 (18)
  • [14] DMS-SLAM: A General Visual SLAM System for Dynamic Scenes with Multiple Sensors
    Liu, Guihua
    Zeng, Weilin
    Feng, Bo
    Xu, Feng
    SENSORS, 2019, 19 (17)
  • [15] A Dynamic Visual SLAM System Incorporating Object Tracking for UAVs
    Li, Minglei
    Li, Jia
    Cao, Yanan
    Chen, Guangyong
    DRONES, 2024, 8 (06)
  • [16] A Novel Visual SLAM System for Autonomous Vehicles in Dynamic Environments
    Zeng, Xinyu
    He, Ying
    Yu, F. Richard
    Zhou, Guang
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [17] A comprehensive overview of dynamic visual SLAM and deep learning: concepts, methods and challenges
    Ayman Beghdadi
    Malik Mallem
    Machine Vision and Applications, 2022, 33
  • [18] A comprehensive overview of dynamic visual SLAM and deep learning: concepts, methods and challenges
    Beghdadi, Ayman
    Mallem, Malik
    MACHINE VISION AND APPLICATIONS, 2022, 33 (04)
  • [19] Visual SLAM method for dynamic environment based on deep learning image features
    Liu D.
    Yu T.
    Cong M.
    Du Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2024, 52 (06): : 156 - 163
  • [20] A robust visual SLAM system in dynamic man-made environments
    LIU JiaCheng
    MENG ZiYang
    YOU Zheng
    Science China(Technological Sciences), 2020, 63 (09) : 1628 - 1636