Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching

被引:18
作者
Shi, Yujiao [1 ]
Yu, Xin [2 ]
Liu, Liu [3 ]
Campbell, Dylan [4 ]
Koniusz, Piotr [1 ,5 ]
Li, Hongdong [1 ]
机构
[1] Australian Natl Univ, Canberra, ACT 2601, Australia
[2] Univ Technol Sydney, Australian Artificial Intelligence Inst, Ultimo, NSW 2007, Australia
[3] Huawei, Cyberverse Lab, Shenzhen 518129, Peoples R China
[4] Univ Oxford, Oxford OX1 2JD, England
[5] Data61 CSIRO NICTA, Machine Learning Res Grp MLRG, Canberra, ACT 2601, Australia
关键词
Satellites; Cameras; Location awareness; Transforms; Databases; Task analysis; Image matching; Camera geo-localization; cross-view matching; street-view; satellite imagery; geotagging; RECOGNITION;
D O I
10.1109/TPAMI.2022.3189702
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of ground-to-satellite image geo-localization, that is, estimating the camera latitude, longitude and orientation (azimuth angle) by matching a query image captured at the ground level against a large-scale database with geotagged satellite images. Our prior arts treat the above task as pure image retrieval by selecting the most similar satellite reference image matching the ground-level query image. However, such an approach often produces coarse location estimates because the geotag of the retrieved satellite image only corresponds to the image center while the ground camera can be located at any point within the image. To further consolidate our prior research finding, we present a novel geometry-aware geo-localization method. Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image, once its coarse location and orientation have been determined. Moreover, we propose a new geometry-aware image retrieval pipeline to improve the coarse localization accuracy. Apart from a polar transform in our conference work, this new pipeline also maps satellite image pixels to the ground-level plane in the ground-view via a geometry-constrained projective transform to emphasize informative regions, such as road structures, for cross-view geo-localization. Extensive quantitative and qualitative experiments demonstrate the effectiveness of our newly proposed framework. We also significantly improve the performance of coarse localization results compared to the state-of-the-art in terms of location recalls.
引用
收藏
页码:2682 / 2697
页数:16
相关论文
共 55 条
[1]  
[Anonymous], 2016, PROC BRIT MACH VIS C
[2]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[3]   Ground-to-Aerial Image Geo-Localization ith a Hard Exemplar Reweighting Triplet Loss [J].
Cai, Sudong ;
Guo, Yulan ;
Khan, Salman ;
Hu, Jiwei ;
Wen, Gongjian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8390-8399
[4]   Minimal Scene Descriptions from Structure from Motion Models [J].
Cao, Song ;
Snavely, Noah .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :461-468
[5]   Semantic Cross-View Matching [J].
Castaldo, Francesco ;
Zamir, Amir ;
Angst, Roland ;
Palmieri, Francesco ;
Savarese, Silvio .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :1044-1052
[6]  
Cohen TS, 2016, PR MACH LEARN RES, V48
[7]   FAB-MAP: Probabilistic localization and mapping in the space of appearance [J].
Cummins, Mark ;
Newman, Paul .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (06) :647-665
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   Discriminative Feature-to-Point Matching in Image-Based Localization [J].
Donoser, Michael ;
Schmalstieg, Dieter .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :516-523
[10]  
Esteves C., 2018, PROC INT C LEARN REP