Detect-SLAM: Making Object Detection and SLAM Mutually Beneficial

被引:227
作者
Zhong, Fangwei [1 ]
Wang, Sheng [2 ]
Zhang, Ziqi [1 ]
Zhou, Chen [1 ]
Wang, Yizhou [1 ]
机构
[1] Peking Univ, Cooperat Medianet Innovat Ctr, Schl EECS, Natl Engn Lab Video Technol,Key Lab Machine Perce, Beijing 100871, Peoples R China
[2] Peking Univ, Shenzhen Grad Sch, Schl ECE, Shenzhen 518055, Peoples R China
来源
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018) | 2018年
关键词
SIMULTANEOUS LOCALIZATION;
D O I
10.1109/WACV.2018.00115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although significant progress has been made in SLAM and object detection in recent years, there are still a series of challenges for both tasks, e.g., SLAM in dynamic environments and detecting objects in complex environments. To address these challenges, we present a novel robotic vision system, which integrates SLAM with a deep neural network-based object detector to make the two functions mutually beneficial. The proposed system facilitates a robot to accomplish tasks reliably and efficiently in an unknown and dynamic environment. Experimental results show that compare to the state-of-the-art robotic vision systems, the proposed system has three advantages: i) it greatly improves the accuracy and robustness of SLAM in dynamic environments by removing unreliable features from moving objects leveraging the object detector, ii) it builds an instance-level semantic map of the environment in an online fashion using the synergy of the two functions for further semantic applications; and iii) it improves the object detector so that it can detect/recognize objects effectively under more challenging conditions such as unusual viewpoints, poor lighting condition, and motion blur, by leveraging the object map.
引用
收藏
页码:1001 / 1010
页数:10
相关论文
共 37 条
[1]  
Bogun I., 2015, ARXIV150901602
[2]  
Bowman Sean L., 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1722, DOI 10.1109/ICRA.2017.7989203
[3]  
Chhaya F, 2016, IEEE INT CONF ROBOT, P5758, DOI 10.1109/ICRA.2016.7487799
[4]   3-D Mapping With an RGB-D Camera [J].
Endres, Felix ;
Hess, Juergen ;
Sturm, Juergen ;
Cremers, Daniel ;
Burgard, Wolfram .
IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (01) :177-187
[5]  
Engel J., 2016, ARXIV160702565
[6]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[7]  
Frost DP, 2016, IEEE INT CONF ROBOT, P4770, DOI 10.1109/ICRA.2016.7487680
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Mobile robot mapping in populated environments [J].
Hähnel, D ;
Schulz, D ;
Burgard, W .
ADVANCED ROBOTICS, 2003, 17 (07) :579-597
[10]  
Joseph RK, 2016, CRIT POL ECON S ASIA, P1