RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM With Neural Radiance Fields

被引:8
作者
Jiang, Haochen [1 ]
Xu, Yueming [2 ]
Li, Kejie [3 ]
Feng, Jianfeng [2 ]
Zhang, Li [1 ]
机构
[1] Fudan Univ, Sch Data Sci, Shanghai 200433, Peoples R China
[2] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200433, Peoples R China
[3] ByteDance, Seattle, WA USA
基金
上海市自然科学基金; 国家重点研发计划; 中国国家自然科学基金;
关键词
Simultaneous localization and mapping; Dynamics; Pose estimation; Cameras; Robustness; Optimization; Geometry; Deep learning methods; dynamic scene; NeRF; pose estimation; RGB-D SLAM; TRACKING;
D O I
10.1109/LRA.2024.3427554
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Leveraging neural implicit representation to conduct dense RGB-D SLAM has been studied in recent years. However, this approach relies on a static environment assumption and does not work robustly within a dynamic environment due to the inconsistent observation of geometry and photometry. To address the challenges presented in dynamic environments, we propose a novel dynamic SLAM framework with neural radiance field. Specifically, we introduce a motion mask generation method to filter out the invalid sampled rays. This design effectively fuses the optical flow mask and semantic mask to enhance the precision of motion mask. To further improve the accuracy of pose estimation, we have designed a divide-and-conquer pose optimization algorithm that distinguishes between keyframes and non-keyframes. The proposed edge warp loss can effectively enhance the geometry constraints between adjacent frames. Extensive experiments are conducted on the two challenging datasets, and the results show that RoDyn-SLAM achieves state-of-the-art performance among recent neural RGB-D methods in both accuracy and robustness. Our implementation of the Rodyn-SLAM will be open-sourced to benefit the community.
引用
收藏
页码:7509 / 7516
页数:8
相关论文
共 45 条
[1]   Neural RGB-D Surface Reconstruction [J].
Azinovic, Dejan ;
Martin-Brualla, Ricardo ;
Goldman, Dan B. ;
Niessner, Matthias ;
Thies, Justus .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :6280-6291
[2]   DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM [J].
Bescos, Berta ;
Campos, Carlos ;
Tardos, Juan D. ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) :5191-5198
[3]   DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes [J].
Bescos, Berta ;
Facil, Jose M. ;
Civera, Javier ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :4076-4083
[4]   NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior [J].
Bian, Wenjing ;
Wang, Zirui ;
Li, Kejie ;
Bian, Jia-Wang ;
Prisacariu, Victor Adrian .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :4160-4169
[5]   ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM [J].
Campos, Carlos ;
Elvira, Richard ;
Gomez Rodriguez, Juan J. ;
Montiel, Jose M. M. ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) :1874-1890
[6]   Flow Supervised Neural Radiance Fields for Static-Dynamic Decomposition [J].
Chen, Quei-An ;
Tsukada, Akihiro .
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, :10641-10647
[7]   Improving monocular visual SLAM in dynamic environments: an optical-flow-based approach [J].
Cheng, Jiyu ;
Sun, Yuxiang ;
Meng, Max Q-H .
ADVANCED ROBOTICS, 2019, 33 (12) :576-589
[8]  
Felzenszwalb P.F., 2012, Theory Comput, V8, P415, DOI DOI 10.4086/TOC.2012.V008A019
[9]   Dynamic View Synthesis from Dynamic Monocular Video [J].
Gao, Chen ;
Saraf, Ayush ;
Kopf, Johannes ;
Huang, Jia-Bin .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5692-5701
[10]   CLOSED-FORM SOLUTION OF ABSOLUTE ORIENTATION USING UNIT QUATERNIONS [J].
HORN, BKP .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1987, 4 (04) :629-642