YOLO-SLAM: A semantic SLAM system towards dynamic environment with geometric constraint

被引:153
作者
Wu, Wenxin [1 ]
Guo, Liang [1 ]
Gao, Hongli [1 ]
You, Zhichao [1 ]
Liu, Yuekai [1 ]
Chen, Zhiqiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Mech Engn, Chengdu 610031, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Visual SLAM; Dynamic environment; Object detection; Geometric constraint; LOCALIZATION; FRAMEWORK;
D O I
10.1007/s00521-021-06764-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Simultaneous localization and mapping (SLAM), as one of the core prerequisite technologies for intelligent mobile robots, has attracted much attention in recent years. However, the traditional SLAM systems rely on the static environment assumption, which becomes unstable for the dynamic environment and further limits the real-world practical applications. To deal with the problem, this paper presents a dynamic-environment-robust visual SLAM system named YOLO-SLAM. In YOLO-SLAM, a lightweight object detection network named Darknet19-YOLOv3 is designed, which adopts a low-latency backbone to accelerate and generate essential semantic information for the SLAM system. Then, a new geometric constraint method is proposed to filter dynamic features in the detecting areas, where dynamic features can be distinguished by utilizing the depth difference with Random Sample Consensus (RANSAC). YOLO-SLAM composes the object detection approach and the geometric constraint method in a tightly coupled manner, which is able to effectively reduce the impact of dynamic objects. Experiments are conducted on the challenging dynamic sequences of TUM dataset and Bonn dataset to evaluate the performance of YOLO-SLAM. The results demonstrate that the RMSE index of absolute trajectory error can be significantly reduced to 98.13% compared with ORB-SLAM2 and 51.28% compared with DS-SLAM, indicating that YOLO-SLAM is able to effectively improve stability and accuracy in the highly dynamic environment.
引用
收藏
页码:6011 / 6026
页数:16
相关论文
共 41 条
[1]   Lightweight Visual Odometry for Autonomous Mobile Robots [J].
Aladem, Mohamed ;
Rawashdeh, Samir A. .
SENSORS, 2018, 18 (09)
[2]   DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes [J].
Bescos, Berta ;
Facil, Jose M. ;
Civera, Javier ;
Neira, Jose .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :4076-4083
[3]   Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age [J].
Cadena, Cesar ;
Carlone, Luca ;
Carrillo, Henry ;
Latif, Yasir ;
Scaramuzza, Davide ;
Neira, Jose ;
Reid, Ian ;
Leonard, John J. .
IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) :1309-1332
[4]   Robust Visual Localization in Dynamic Environments Based on Sparse Motion Removal [J].
Cheng, Jiyu ;
Wang, Chaoqun ;
Meng, Max Q-H .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (02) :658-669
[5]   3-D Mapping With an RGB-D Camera [J].
Endres, Felix ;
Hess, Juergen ;
Sturm, Juergen ;
Cremers, Daniel ;
Burgard, Wolfram .
IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (01) :177-187
[6]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849
[7]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[8]   Visual simultaneous localization and mapping: a survey [J].
Fuentes-Pacheco, Jorge ;
Ruiz-Ascencio, Jose ;
Manuel Rendon-Mancha, Juan .
ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (01) :55-81
[9]   An evaluation of ROS-compatible stereo visual SLAM methods on a nVidia Jetson TX2 [J].
Giubilato, Riccardo ;
Chiodini, Sebastiano ;
Pertile, Marco ;
Debei, Stefano .
MEASUREMENT, 2019, 140 :161-170
[10]   Machinery health indicator construction based on convolutional neural networks considering trend burr [J].
Guo, Liang ;
Lei, Yaguo ;
Li, Naipeng ;
Yan, Tao ;
Li, Ningbo .
NEUROCOMPUTING, 2018, 292 :142-150