Feature Relation Guided Cross-View Image Based Geo-Localization

被引:3
作者
Hou, Qingfeng [1 ]
Lu, Jun [1 ]
Guo, Haitao [1 ]
Liu, Xiangyun [1 ]
Gong, Zhihui [1 ]
Zhu, Kun [1 ]
Ping, Yifan [1 ]
机构
[1] PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R China
基金
美国国家科学基金会;
关键词
cross-view; geo-localization; relation guided; deformable convolution; multiscale contextual information; global spatial relations mining;
D O I
10.3390/rs15205029
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The goal of cross-view image based geo-localization is to determine the location of a given street-view image by matching it with a collection of geo-tagged aerial images, which has important applications in the fields of remote sensing information utilization and augmented reality. Most current cross-view image based geo-localization methods focus on the image content and ignore the relations between feature nodes, resulting in insufficient mining of effective information. To address this problem, this study proposes feature relation guided cross-view image based geo-localization. This method first processes aerial remote sensing images using a polar transform to achieve the geometric coarse alignment of ground-to-aerial images, and then realizes local contextual feature concern and global feature correlation modeling of the images through the feature relation guided attention generation module designed in this study. Specifically, the module includes two branches of deformable convolution based multiscale contextual feature extraction and global spatial relations mining, which effectively capture global structural information between feature nodes at different locations while correlating contextual features and guiding global feature attention generation. Finally, a novel feature aggregation module, MixVPR, is introduced to aggregate global feature descriptors to accomplish image matching and localization. After experimental validation, the cross-view image based geo-localization algorithm proposed in this study yields results of 92.08%, 97.70%, and 98.66% for the top 1, top 5, and top 10 metrics, respectively, in CVUSA, a popular public cross-view dataset, and exhibits superior performance compared to algorithms of the same type.
引用
收藏
页数:20
相关论文
共 37 条
[1]   MixVPR: Feature Mixing for Visual Place Recognition [J].
Ali-bey, Amar ;
Chaib-draa, Brahim ;
Giguere, Philippe .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :2997-3006
[2]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[3]   Ground-to-Aerial Image Geo-Localization ith a Hard Exemplar Reweighting Triplet Loss [J].
Cai, Sudong ;
Guo, Yulan ;
Khan, Salman ;
Hu, Jiwei ;
Wen, Gongjian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8390-8399
[4]  
Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610
[5]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[6]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization [J].
Hu, Sixing ;
Feng, Mengdan ;
Nguyen, Rang M. H. ;
Lee, Gim Hee .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7258-7267
[9]   Learned Contextual Feature Reweighting for Image Geo-Localization [J].
Kim, Hyo Jin ;
Dunn, Enrique ;
Frahm, Jan-Michael .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3251-3260
[10]   Predicting Good Features for Image Geo-Localization Using Per-Bundle VLAD [J].
Kim, Hyo Jin ;
Dunn, Enrique ;
Frahm, Jan-Michael .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1170-1178