Improving RGB-D SLAM in dynamic environments using semantic aided segmentation

被引:10
作者
Kenye, Lhilo [1 ,2 ]
Kala, Rahul [1 ]
机构
[1] Indian Inst Informat Technol, Ctr Intelligent Robot, Allahabad, Prayagraj, India
[2] NavAjna Technol Pvt Ltd, Hyderabad, India
关键词
simultaneous localization and mapping; object recognition; dynamic SLAM; background detection; dynamic object filtering; computer vision; SIMULTANEOUS LOCALIZATION; MOTION REMOVAL; VISUAL SLAM;
D O I
10.1017/S0263574721001521
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Most conventional simultaneous localization and mapping (SLAM) approaches assume the working environment to be static. In a highly dynamic environment, this assumption divulges the impediments of a SLAM algorithm that lack modules that distinctively attend to dynamic objects despite the inclusion of optimization techniques. This work exploits such environments and reduces the effects of dynamic objects in a SLAM algorithm by separating features belonging to dynamic objects and static background using a generated binary mask image. While the features belonging to the static region are used for performing SLAM, the features belonging to non-static segments are reused instead of being eliminated. The approach employs deep neural network or DNN-based object detection module to obtain bounding boxes and then generates a lower resolution binary mask image using depth-first search algorithm over the detected semantics, characterizing the segmentation of the foreground from the static background. In addition, the features belonging to dynamic objects are tracked into consecutive frames to obtain better masking consistency. The proposed approach is tested on both publicly available dataset as well as self-collected dataset, which includes both indoor and outdoor environments. The experimental results show that the removal of features belonging to dynamic objects for a SLAM algorithm can significantly improve the overall output in a dynamic scene.
引用
收藏
页码:2065 / 2090
页数:26
相关论文
共 41 条
[11]  
Churchill W., 2012, 2012 15 INT IEEE C I
[12]   Experience-based navigation for long-term localisation [J].
Churchill, Winston ;
Newman, Paul .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (14) :1645-1661
[13]  
Churchill W, 2012, IEEE INT CONF ROBOT, P4525, DOI 10.1109/ICRA.2012.6224596
[14]  
Engel J, 2015, IEEE INT C INT ROBOT, P1935, DOI 10.1109/IROS.2015.7353631
[15]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849
[16]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[17]  
Huletski A., 2015, 2015 ART INT NAT LAN
[18]  
Jonathan, 2017, P IEEE C COMPUTER VI
[19]  
Kitt B., 2010, 2010 IEEERSJ INT C I
[20]   RTAB-Map as an open-source lidar and visual simultaneous localization and mapping library for large-scale and long-term online operation [J].
Labbe, Mathieu ;
Michaud, Francois .
JOURNAL OF FIELD ROBOTICS, 2019, 36 (02) :416-446