Visual Navigation Using Heterogeneous Landmarks and Unsupervised Geometric Constraints

被引:81
作者
Lu, Yan [1 ]
Song, Dezhen [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Heterogeneous landmarks; simultaneous localization and mapping (SLAM); visual navigation; MONOCULAR-VISION; MULTIVIEW STEREO; SHAPE; GRAPH; SLAM; MAP;
D O I
10.1109/TRO.2015.2424032
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We present a heterogeneous landmark-based visual navigation approach for a monocular mobile robot. We utilize heterogeneous visual features, such as points, line segments, lines, planes, and vanishing points, and their inner geometric constraints managed by a novel multilayer feature graph (MFG). Our method extends the local bundle adjustment-based visual simultaneous localization and mapping (SLAM) framework by explicitly exploiting the heterogeneous features and their inner geometric relationships in an unsupervised manner. As the result, our heterogeneous landmark-based visual navigation algorithm takes a video stream as input, initializes and iteratively updates MFG based on extracted key frames, and refines robot localization and MFG landmarks through the process. We present pseudocode for the algorithm and analyze its complexity. We have evaluated our method and compared it with state-of-the-art point landmark-based visual SLAM methods using multiple indoor and outdoor datasets. In particular, on the KITTI dataset, our method reduces the translational error by 52.5% under urban sequences where rectilinear structures dominate the scene.
引用
收藏
页码:736 / 749
页数:14
相关论文
共 63 条
[11]  
Davison AJ, 2003, NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, P1403
[12]   Edge landmarks in monocular SLAM [J].
Eade, Ethan ;
Drummond, Tom .
IMAGE AND VISION COMPUTING, 2009, 27 (05) :588-596
[13]   3-D Mapping With an RGB-D Camera [J].
Endres, Felix ;
Hess, Juergen ;
Sturm, Juergen ;
Cremers, Daniel ;
Burgard, Wolfram .
IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (01) :177-187
[14]   Visually augmented navigation in an unstructured environment using a delayed state history [J].
Eustice, R ;
Pizarro, O ;
Singh, H .
2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, :25-32
[15]   Line Matching Leveraged By Point Correspondences [J].
Fan, Bin ;
Wu, Fuchao ;
Hu, Zhanyi .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :390-397
[16]   Growing semantically meaningful models for visual SLAM [J].
Flint, Alex ;
Mei, Christopher ;
Reid, Ian ;
Murray, David .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :467-474
[17]   Attentional Landmarks and Active Gaze Control for Visual SLAM [J].
Frintrop, Simone ;
Jensfelt, Patric .
IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (05) :1054-1065
[18]   Data processing algorithms for generating textured 3D building facade meshes from laser scans and camera images [J].
Frueh, C ;
Jain, S ;
Zakhor, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (02) :159-184
[19]   Piecewise Planar and Non-Planar Stereo for Urban Scene Reconstruction [J].
Gallup, David ;
Frahm, Jan-Michael ;
Pollefeys, Marc .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1418-1425
[20]   Discovering Higher Level Structure in Visual SLAM [J].
Gee, Andrew P. ;
Chekhlov, Denis ;
Calway, Andrew ;
Mayol-Cuevas, Walterio .
IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (05) :980-990