Learning Cross-View Geo-Localization Embeddings via Dynamic Weighted Decorrelation Regularization

被引:1
作者
Wang, Tingyu [1 ]
Zheng, Zhedong [2 ,3 ]
Zhu, Zunjie [1 ,4 ]
Sun, Yaoqi [1 ,4 ]
Yan, Chenggang [1 ]
Yang, Yi [5 ]
机构
[1] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou 310018, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Macau, Peoples R China
[3] Univ Macau, Inst Collaborat Innovat, Macau, Macau, Peoples R China
[4] Hangzhou Dianzi Univ, Lishui Inst, Lishui 323000, Peoples R China
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310027, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Feature extraction; Decorrelation; Training; Visualization; Optimization; Drones; Correlation; Satellites; Redundancy; Termination of employment; deep learning; geo-localization; image retrieval; the cross-correlation coefficient matrix;
D O I
10.1109/TGRS.2024.3491757
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the domain of cross-view geo-localization, the challenge lies in accurately matching images captured from distinct perspectives, such as aerial drone imagery and satellite imagery of the same geographical location. Existing methods predominantly concentrate on minimizing distances between feature embeddings in the representational space, inadvertently overlooking the significance of reducing embedding redundancy. This oversight potentially hampers the extraction of diverse and distinctive visual patterns critical for precise localization. This work argues that minimizing embedding redundancy is a pivotal factor in enhancing a model's ability to discriminate diverse scene characteristics. To support this claim, we introduce a straightforward yet effective regularization technique, termed dynamic weighted decorrelation regularization (DWDR). DWDR serves to actively promote the learning of orthogonal feature channels within neural networks. By dynamically adjusting weights, DWDR targets the minimization of interchannel correlations, guiding the correlation matrix toward diagonality, indicative of independence among channels. The dynamic weighting mechanism adaptively prioritizes the decorrelation of channels that remain highly correlated throughout training. Additionally, we devise a symmetrical sampling strategy for cross-view scenarios to ensure that the training examples are balanced across different imaging platforms in a batch. Despite its simplicity, the integration of DWDR and the proposed sampling scheme yields remarkable performance across four extensive benchmark datasets: University-1652, CVUSA, CVACT, and VIGOR. Notably, in stringent conditions, such as when constrained to exceedingly compact feature dimensions of 64, our methodology significantly outperforms conventional baselines, thereby affirming its efficacy and robustness under challenging constraints.
引用
收藏
页数:12
相关论文
共 50 条
[31]   Feature Relation Guided Cross-View Image Based Geo-Localization [J].
Hou, Qingfeng ;
Lu, Jun ;
Guo, Haitao ;
Liu, Xiangyun ;
Gong, Zhihui ;
Zhu, Kun ;
Ping, Yifan .
REMOTE SENSING, 2023, 15 (20)
[32]   AFPN: Attention-guided Feature Partition Network for Cross-view Geo-localization [J].
Lin, Zhifeng ;
Huang, Ranran ;
Cai, Jiancheng ;
Liu, Xinmin ;
Ding, Changxing ;
Chai, Zhenhua .
PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, :39-44
[33]   CCR: A Counterfactual Causal Reasoning-Based Method for Cross-View Geo-Localization [J].
Du, Haolin ;
He, Jingfei ;
Zhao, Yuanqing .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) :11630-11643
[34]   Cross-view geo-localization with panoramic street-view and VHR satellite imagery in decentrality settings [J].
Xia, Panwang ;
Yu, Lei ;
Wan, Yi ;
Wu, Qiong ;
Chen, Peiqi ;
Zhong, Liheng ;
Yao, Yongxiang ;
Wei, Dong ;
Liu, Xinyi ;
Ru, Lixiang ;
Zhang, Yingying ;
Lao, Jiangwei ;
Chen, Jingdong ;
Yang, Ming ;
Zhang, Yongjun .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 227 :1-11
[35]   Semantic Concept Perception Network With Interactive Prompting for Cross-View Image Geo-Localization [J].
Gao, Yuan ;
Liu, Haibo ;
Wei, Xiaohui .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (06) :5343-5354
[36]   Multilevel Feedback Joint Representation Learning Network Based on Adaptive Area Elimination for Cross-View Geo-Localization [J].
Ge, Fawei ;
Zhang, Yunzhou ;
Wang, Li ;
Liu, Wei ;
Liu, Yixiu ;
Coleman, Sonya ;
Kerr, Dermot .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[37]   A Cross-View Geo-Localization Algorithm Using UAV Image and Satellite Image [J].
Fan, Jiqi ;
Zheng, Enhui ;
He, Yufei ;
Yang, Jianxing .
SENSORS, 2024, 24 (12)
[38]   An Efficient Pyramid Transformer Network for Cross-View Geo-Localization in Complex Terrains [J].
Ju, Chengjie ;
Xu, Wangping ;
Chen, Nanxing ;
Zheng, Enhui .
DRONES, 2025, 9 (05)
[39]   Road Structure Inspired UGV-Satellite Cross-View Geo-Localization [J].
Hu, Di ;
Yuan, Xia ;
Xi, Huiying ;
Li, Jie ;
Song, Zhenbo ;
Xiong, Fengchao ;
Zhang, Kai ;
Zhao, Chunxia .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 :16767-16786
[40]   Navigating the Metaverse: UAV-Based Cross-View Geo-Localization in Virtual Worlds [J].
Yagi, Ryota ;
Yairi, Takehisa ;
Iwasaki, Akira .
PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, :13-17