Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis

被引:82
作者
Zhang, Zichao [1 ]
Sattler, Torsten [2 ]
Scaramuzza, Davide [1 ]
机构
[1] Univ Zurich, Robot & Percept Grp, Zurich, Switzerland
[2] Czech Tech Univ, Czech Inst Informat Robot & Cybernet, Prague, Czech Republic
关键词
Visual localization; Benchmark construction; Learned local features; PLACE RECOGNITION;
D O I
10.1007/s11263-020-01399-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtained via Structure-from-Motion (SfM). However, SfM itself relies on local features which are prone to fail when images were taken under different conditions, e.g., day/night changes. At the same time, manually annotating feature correspondences is not scalable and potentially inaccurate. In this work, we propose a semi-automated approach to generate reference poses based on feature matching between renderings of a 3D model and real images via learned features. Given an initial pose estimate, our approach iteratively refines the pose based on feature matches against a rendering of the model from the current pose estimate. We significantly improve the nighttime reference poses of the popular Aachen Day-Night dataset, showing that state-of-the-art visual localization methods perform better (up to 47%) than predicted by the original reference poses. We extend the dataset with new nighttime test images, provide uncertainty estimates for our new reference poses, and introduce a new evaluation criterion. We will make our reference poses and our framework publicly available upon publication.
引用
收藏
页码:821 / 844
页数:24
相关论文
共 151 条
  • [1] Aiger D., 2019, 35 INT S COMP GEOM S
  • [2] Rolling shutter absolute pose problem with known vertical direction
    Albl, Cenek
    Kukelova, Zuzana
    Pajdla, Tomas
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3355 - 3363
  • [3] ALCANTARILLA P, 2011, ICRA
  • [4] [Anonymous], 2016, CVPR
  • [5] [Anonymous], 2019, IEEE C COMP VIS PATT
  • [6] [Anonymous], 2011, TPAMI
  • [7] Arandjelovi R., 2014, ACCV
  • [8] Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization
    Armagan, Anil
    Hirzer, Martin
    Roth, Peter M.
    Lepetit, Vincent
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4590 - 4597
  • [9] AUBRY M, 2014, ACM T GRAPHIC, V33
  • [10] Badino H, 2011, IEEE INT VEH SYM, P794, DOI 10.1109/IVS.2011.5940504