SuperPoint: Self-Supervised Interest Point Detection and Description

被引:1993
作者
DeTone, Daniel [1 ]
Malisiewicz, Tomasz [1 ]
Rabinovich, Andrew [1 ]
机构
[1] Mag Leap, Sunnyvale, CA 94089 USA
来源
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年
关键词
D O I
10.1109/CVPRW.2018.00060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a self-supervised framework for training interest point detectors and descriptors suitable for a large number of multiple-view geometry problems in computer vision. As opposed to patch-based neural networks, our fully-convolutional model operates on full-sized images and jointly computes pixel-level interest point locations and associated descriptors in one forward pass. We introduce Homographic Adaptation, a multi-scale, multi-homography approach for boosting interest point detection repeatability and performing cross-domain adaptation (e.g., synthetic-to-real). Our model, when trained on the MS-COCO generic image dataset using Homographic Adaptation, is able to repeatedly detect a much richer set of interest points than the initial pre-adapted deep model and any other traditional corner detector. The final system gives rise to state-of-the-art homography estimation results on HPatches when compared to LIFT, SIFT and ORB.
引用
收藏
页码:337 / 349
页数:13
相关论文
共 33 条
[1]  
[Anonymous], PyTorch
[2]  
[Anonymous], 2017, IEEE transactions on pattern analysis and machine intelligence, DOI [10.1109/TPAMI.2016.2644615, DOI 10.1109/TPAMI.2016.2644615]
[3]  
[Anonymous], 2005, PAMI
[4]   HPatches: A benchmark and evaluation of handcrafted and learned local descriptors [J].
Balntas, Vassileios ;
Lenc, Karel ;
Vedaldi, Andrea ;
Mikolajczyk, Krystian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3852-3861
[5]  
Choy C. B., 2016, Advances in Neural Information Processing Systems, P2414
[6]  
DeTone D., 2016, Deep image homography estima
[7]  
DeTone Daniel, 2017, ARXIV170707410
[8]  
Ganin Yaroslav, 2015, PR MACH LEARN RES
[9]  
Harris C., 1988, P 4 ALV VIS C, V15, P10, DOI DOI 10.5244/C.2.23
[10]  
Hartley R, 2003, Multiple View Geometry in Computer Vision, DOI [DOI 10.1017/CBO9780511811685, 10.1016/S0143-8166(01)00145-2]