SuperPoint: Self-Supervised Interest Point Detection and Description

被引：1993

作者：

DeTone, Daniel ^{[1
]}

Malisiewicz, Tomasz ^{[1
]}

Rabinovich, Andrew ^{[1
]}

机构：

[1] Mag Leap, Sunnyvale, CA 94089 USA

来源：

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年

关键词：

D O I：

10.1109/CVPRW.2018.00060

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a self-supervised framework for training interest point detectors and descriptors suitable for a large number of multiple-view geometry problems in computer vision. As opposed to patch-based neural networks, our fully-convolutional model operates on full-sized images and jointly computes pixel-level interest point locations and associated descriptors in one forward pass. We introduce Homographic Adaptation, a multi-scale, multi-homography approach for boosting interest point detection repeatability and performing cross-domain adaptation (e.g., synthetic-to-real). Our model, when trained on the MS-COCO generic image dataset using Homographic Adaptation, is able to repeatedly detect a much richer set of interest points than the initial pre-adapted deep model and any other traditional corner detector. The final system gives rise to state-of-the-art homography estimation results on HPatches when compared to LIFT, SIFT and ORB.

引用

页码：337 / 349

页数：13

共 33 条

[1]

[Anonymous], PyTorch

[2]

[Anonymous], 2017, IEEE transactions on pattern analysis and machine intelligence, DOI [10.1109/TPAMI.2016.2644615, DOI 10.1109/TPAMI.2016.2644615]

[3]

[Anonymous], 2005, PAMI

[4] HPatches: A benchmark and evaluation of handcrafted and learned local descriptors [J].

Balntas, Vassileios ;

Lenc, Karel ;

Vedaldi, Andrea ;

Mikolajczyk, Krystian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3852-3861

[5]

Choy C. B., 2016, Advances in Neural Information Processing Systems, P2414

[6]

DeTone D., 2016, Deep image homography estima

[7]

DeTone Daniel, 2017, ARXIV170707410

[8]

Ganin Yaroslav, 2015, PR MACH LEARN RES

[9]

Harris C., 1988, P 4 ALV VIS C, V15, P10, DOI DOI 10.5244/C.2.23

[10]

Hartley R, 2003, Multiple View Geometry in Computer Vision, DOI [DOI 10.1017/CBO9780511811685, 10.1016/S0143-8166(01)00145-2]

← 1 2 3 4 →