Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints

被引：38

作者：

Jau, You-Yi ^{[1
]}

Zffu, Rui ^{[1
]}

Su, Hao ^{[1
]}

Chandraker, Manmohan ^{[1
]}

机构：

[1] Univ Calif San Diego, San Diego, CA 92103 USA

来源：

2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2020年

基金：

美国国家科学基金会;

关键词：

MONOCULAR SLAM; ORB;

D O I：

10.1109/IROS45743.2020.9341229

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Estimating relative camera poses from consecutive frames is a fundamental problem in visual odometry (VO) and simultaneous localization and mapping (SLAM), where classic methods consisting of hand-crafted features and sampling-based outlier rejection have been a dominant choice for over a decade. Although multiple works propose to replace these modules with learning-based counterparts, most have not yet been as accurate, robust and generalizable as conventional methods. In this paper, we design an end-to-end trainable framework consisting of learnable modules for detection, feature extraction, matching and outlier rejection, while directly optimizing for the geometric pose objective. We show both quantitatively and qualitatively that pose estimation performance may be achieved on par with the classic pipeline. Moreover, we are able to show by end-to-end training, the key components of the pipeline could be significantly improved, which leads to better generalizability to unseen datasets compared to existing learning-based methods.

引用

页码：4950 / 4957

页数：8

共 52 条

[1] RelocNet: Continuous Metric Learning Relocalisation Using Neural Nets [J].

Balntas, Vassileios ;

Li, Shuda ;

Prisacariu, Victor .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :782-799

[2] HPatches: A benchmark and evaluation of handcrafted and learned local descriptors [J].

Balntas, Vassileios ;

Lenc, Karel ;

Vedaldi, Andrea ;

Mikolajczyk, Krystian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3852-3861

[3]

Bian JW, 2019, ADV NEUR IN, V32

[4] Learning Less is More-6D Camera Localization via 3D Surface Regression [J].

Brachmann, Eric ;

Rother, Carsten .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4654-4662

[5] DSAC - Differentiable RANSAC for Camera Localization [J].

Brachmann, Eric ;

Krull, Alexander ;

Nowozin, Sebastian ;

Shotton, Jamie ;

Michel, Frank ;

Gumhold, Stefan ;

Rother, Carsten .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2492-2500

[6] Gradient descent optimization of smoothed information retrieval metrics [J].

Chapelle, Olivier ;

Wu, Mingrui .

INFORMATION RETRIEVAL, 2010, 13 (03) :216-235

[7]

Christiansen Peter Hviid, 2019, Unsuperpoint: End-to-end unsupervised interest point detector and descriptor

[8] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[9]

DeTone D., 2018, ARXIV181203245

[10] SuperPoint: Self-Supervised Interest Point Detection and Description [J].

DeTone, Daniel ;

Malisiewicz, Tomasz ;

Rabinovich, Andrew .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :337-349

← 1 2 3 4 5 6 →