Visual Camera Re-Localization From RGB and RGB-D Images Using DSAC

被引:124
作者
Brachmann, Eric [1 ]
Rother, Carsten [2 ]
机构
[1] Niantic, San Francisco, CA 94104 USA
[2] Heidelberg Univ, Visual Learning Lab, D-69117 Heidelberg, Germany
基金
欧洲研究理事会;
关键词
Cameras; Training; Three-dimensional displays; Visualization; Optimization; Neural networks; Solid modeling; Camera re-localization; pose estimation; differentiable RANSAC; DSAC; differentiable argmax; differentiable PnP;
D O I
10.1109/TPAMI.2021.3070754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In the minimal case, our system requires only RGB images and ground truth poses at training time, and it requires only a single RGB image at test time. The framework consists of a deep neural network and fully differentiable pose optimization. The neural network predicts so called scene coordinates, i.e., dense correspondences between the input image and 3D scene space of the environment. The pose optimization implements robust fitting of pose parameters using differentiable RANSAC (DSAC) to facilitate end-to-end training. The system, an extension of DSAC++ and referred to as DSAC*, achieves state-of-the-art accuracy on various public datasets for RGB-based re-localization, and competitive accuracy for RGB-D based re-localization.
引用
收藏
页码:5847 / 5865
页数:19
相关论文
共 84 条
[1]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/CVPR.2016.572, 10.1109/TPAMI.2017.2711011]
[2]   End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans [J].
Avetisyan, Armen ;
Dai, Angela ;
Niessner, Matthias .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2551-2560
[3]   RelocNet: Continuous Metric Learning Relocalisation Using Neural Nets [J].
Balntas, Vassileios ;
Li, Shuda ;
Prisacariu, Victor .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :782-799
[4]   Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task [J].
Bhowmik, Aritra ;
Gumhold, Stefan ;
Rother, Carsten ;
Brachmann, Eric .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4947-4956
[5]   Expert Sample Consensus Applied to Camera Re-Localization [J].
Brachmann, Eric ;
Rother, Carsten .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7524-7533
[6]   Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses [J].
Brachmann, Eric ;
Rother, Carsten .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4321-4330
[7]   Learning Less is More-6D Camera Localization via 3D Surface Regression [J].
Brachmann, Eric ;
Rother, Carsten .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4654-4662
[8]   DSAC - Differentiable RANSAC for Camera Localization [J].
Brachmann, Eric ;
Krull, Alexander ;
Nowozin, Sebastian ;
Shotton, Jamie ;
Michel, Frank ;
Gumhold, Stefan ;
Rother, Carsten .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2492-2500
[9]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[10]  
Bradski G, 2000, DR DOBBS J, V25, P120