Learning Cross-View Geo-Localization Embeddings via Dynamic Weighted Decorrelation Regularization

被引:1
作者
Wang, Tingyu [1 ]
Zheng, Zhedong [2 ,3 ]
Zhu, Zunjie [1 ,4 ]
Sun, Yaoqi [1 ,4 ]
Yan, Chenggang [1 ]
Yang, Yi [5 ]
机构
[1] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou 310018, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Macau, Peoples R China
[3] Univ Macau, Inst Collaborat Innovat, Macau, Macau, Peoples R China
[4] Hangzhou Dianzi Univ, Lishui Inst, Lishui 323000, Peoples R China
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310027, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Feature extraction; Decorrelation; Training; Visualization; Optimization; Drones; Correlation; Satellites; Redundancy; Termination of employment; deep learning; geo-localization; image retrieval; the cross-correlation coefficient matrix;
D O I
10.1109/TGRS.2024.3491757
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the domain of cross-view geo-localization, the challenge lies in accurately matching images captured from distinct perspectives, such as aerial drone imagery and satellite imagery of the same geographical location. Existing methods predominantly concentrate on minimizing distances between feature embeddings in the representational space, inadvertently overlooking the significance of reducing embedding redundancy. This oversight potentially hampers the extraction of diverse and distinctive visual patterns critical for precise localization. This work argues that minimizing embedding redundancy is a pivotal factor in enhancing a model's ability to discriminate diverse scene characteristics. To support this claim, we introduce a straightforward yet effective regularization technique, termed dynamic weighted decorrelation regularization (DWDR). DWDR serves to actively promote the learning of orthogonal feature channels within neural networks. By dynamically adjusting weights, DWDR targets the minimization of interchannel correlations, guiding the correlation matrix toward diagonality, indicative of independence among channels. The dynamic weighting mechanism adaptively prioritizes the decorrelation of channels that remain highly correlated throughout training. Additionally, we devise a symmetrical sampling strategy for cross-view scenarios to ensure that the training examples are balanced across different imaging platforms in a batch. Despite its simplicity, the integration of DWDR and the proposed sampling scheme yields remarkable performance across four extensive benchmark datasets: University-1652, CVUSA, CVACT, and VIGOR. Notably, in stringent conditions, such as when constrained to exceedingly compact feature dimensions of 64, our methodology significantly outperforms conventional baselines, thereby affirming its efficacy and robustness under challenging constraints.
引用
收藏
页数:12
相关论文
共 50 条
[41]   HADGEO: IMAGE BASED 3-DOF CROSS-VIEW GEO-LOCALIZATION WITH HARD SAMPLE MINING [J].
Li, Chaoran ;
Yan, Chao ;
Xiang, Xiaojia ;
Lai, Jun ;
Zhou, Han ;
Tang, Dengqing .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, :3520-3524
[42]   Swin transformer for feature extraction: a cross-view geo-localization method for UAV-Satellite views [J].
Liu, Jinyu ;
Ren, Kan ;
Chen, Qian .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (13) :4864-4882
[43]   A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention [J].
Cui, Zhuofan ;
Zhou, Pengwei ;
Wang, Xiaolong ;
Zhang, Zilun ;
Li, Yingxuan ;
Li, Hongbo ;
Zhang, Yu .
REMOTE SENSING, 2023, 15 (19)
[44]   Attention-Driven Object Encoding and Multiscale Contextual Perception for Improved Cross-View Object Geo-Localization [J].
Song, Haoshuai ;
Tong, Xiaochong ;
Zhang, Xiaoyu ;
Lei, Yaxian ;
Li, He ;
Guo, Congzhou .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
[45]   It's Okay to Be Wrong: Cross-View Geo-Localization With Step-Adaptive Iterative Refinement [J].
Lu, Xiufan ;
Luo, Siqi ;
Zhu, Yingying .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[46]   DISASTER DAMAGE VISUALIZATION BY VLM-BASED INTERACTIVE IMAGE RETRIEVAL AND CROSS-VIEW IMAGE GEO-LOCALIZATION [J].
Sogi, Naoya ;
Shibata, Takashi ;
Terao, Makoto ;
Senzaki, Kenta ;
Tani, Masahiro ;
Rodrigues, Royston .
IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, :1746-1749
[47]   Do Keypoints Contain Crucial Information? Mining Keypoint Information to Enhance Cross-View Geo-Localization [J].
Liang, Yanchao ;
Wu, Xiangqian .
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
[48]   A Part-aware Attention Neural Network for Cross-view Geo-localization between UAV and Satellite [J].
Bui, Duc Viet ;
Kubo, Masao ;
Sato, Hiroshi .
JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2022, 9 (03) :275-284
[49]   Dual-branch Pattern and Multi-scale Context Facilitate Cross-view Geo-localization [J].
Zhang, Bing ;
Sun, Jing ;
Yan, Rui ;
Sun, Fuming ;
Wang, Fasheng .
PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, :25-29
[50]   CIG2S: A Cross-View Image Geo-Localization Model Based on G2S Transform Suitable for Center-Misaligned Scenarios [J].
Li, Jiangshan ;
Yang, Chunfang ;
Qi, Baojun ;
Zhu, Ma ;
Chen, Junyang ;
Leung, Victor C. M. .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,