Linear RGB-D SLAM for Structured Environments

被引:21
作者
Joo, Kyungdon [1 ]
Kim, Pyojin [2 ]
Hebert, Martial [3 ,4 ]
Kweon, In So [5 ]
Kim, Hyoun Jin [6 ]
机构
[1] Ulsan Natl Inst Sci & Technol, Artificial Intelligence Grad Sch, Dept Comp Sci & Engn, Ulsan 44919, South Korea
[2] Sookmyung Womens Univ, Dept Mech Syst Engn, Seoul 04310, South Korea
[3] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[4] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[5] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
[6] Seoul Natl Univ, Sch Mech & Aerosp Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Linear SLAM; manhattan world; atlanta world; RGB-D image; Bayesian filtering; scene understanding; ODOMETRY; WORLD;
D O I
10.1109/TPAMI.2021.3106820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new linear RGB-D simultaneous localization and mapping (SLAM) formulation by utilizing planar features of the structured environments. The key idea is to understand a given structured scene and exploit its structural regularities such as the Manhattan world. This understanding allows us to decouple the camera rotation by tracking structural regularities, which makes SLAM problems free from being highly nonlinear. Additionally, it provides a simple yet effective cue for representing planar features, which leads to a linear SLAM formulation. Given an accurate camera rotation, we jointly estimate the camera translation and planar landmarks in the global planar map using a linear Kalman filter. Our linear SLAM method, called L-SLAM, can understand not only the Manhattan world but the more general scenario of the Atlanta world, which consists of a vertical direction and a set of horizontal directions orthogonal to the vertical direction. To this end, we introduce a novel tracking-by-detection scheme that infers the underlying scene structure by Atlanta representation. With efficient Atlanta representation, we formulate a unified linear SLAM framework for structured environments. We evaluate L-SLAM on a synthetic dataset and RGB-D benchmarks, demonstrating comparable performance to other state-of-the-art SLAM methods without using expensive nonlinear optimization. We assess the accuracy of L-SLAM on a practical application of augmented reality.
引用
收藏
页码:8403 / 8419
页数:17
相关论文
共 64 条
[1]  
[Anonymous], 2010, TRIGGP201001 U BONN
[2]   Consistency of the EKF-SLAM algorithm [J].
Bailey, Tim ;
Nieto, Juan ;
Guivant, Jose ;
Stevens, Michael ;
Nebot, Eduardo .
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, :3562-+
[3]  
Camposeco F, 2015, IEEE INT CONF ROBOT, P5219, DOI 10.1109/ICRA.2015.7139926
[4]  
Carlone L, 2015, IEEE INT CONF ROBOT, P4597, DOI 10.1109/ICRA.2015.7139836
[5]   Matterport3D: Learning from RGB-D Data in Indoor Environments [J].
Chang, Angel ;
Dai, Angela ;
Funkhouser, Thomas ;
Halber, Maciej ;
Niessner, Matthias ;
Savva, Manolis ;
Song, Shuran ;
Zeng, Andy ;
Zhang, Yinda .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :667-676
[6]   MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799
[7]  
Coughlan J. M., 1999, P IEEE INT C COMP VI, P941, DOI [DOI 10.1109/ICCV.1999.790349, 10.1109/ICCV.1999.790349]
[8]   BundleFusion: Real-Time Globally Consistent 3D Reconstruction Using On-the-Fly Surface Reintegration [J].
Dai, Angela ;
Niessner, Matthias ;
Zollhofer, Michael ;
Izadi, Shahram ;
Theobalt, Christian .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (03)
[9]   MonoSLAM: Real-time single camera SLAM [J].
Davison, Andrew J. ;
Reid, Ian D. ;
Molton, Nicholas D. ;
Stasse, Olivier .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (06) :1052-1067
[10]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849