Learning Cross-View Geo-Localization Embeddings via Dynamic Weighted Decorrelation Regularization

被引:1
|
作者
Wang, Tingyu [1 ]
Zheng, Zhedong [2 ,3 ]
Zhu, Zunjie [1 ,4 ]
Sun, Yaoqi [1 ,4 ]
Yan, Chenggang [1 ]
Yang, Yi [5 ]
机构
[1] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou 310018, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Macau, Peoples R China
[3] Univ Macau, Inst Collaborat Innovat, Macau, Macau, Peoples R China
[4] Hangzhou Dianzi Univ, Lishui Inst, Lishui 323000, Peoples R China
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310027, Peoples R China
关键词
Feature extraction; Decorrelation; Training; Visualization; Optimization; Drones; Correlation; Satellites; Redundancy; Termination of employment; deep learning; geo-localization; image retrieval; the cross-correlation coefficient matrix;
D O I
10.1109/TGRS.2024.3491757
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the domain of cross-view geo-localization, the challenge lies in accurately matching images captured from distinct perspectives, such as aerial drone imagery and satellite imagery of the same geographical location. Existing methods predominantly concentrate on minimizing distances between feature embeddings in the representational space, inadvertently overlooking the significance of reducing embedding redundancy. This oversight potentially hampers the extraction of diverse and distinctive visual patterns critical for precise localization. This work argues that minimizing embedding redundancy is a pivotal factor in enhancing a model's ability to discriminate diverse scene characteristics. To support this claim, we introduce a straightforward yet effective regularization technique, termed dynamic weighted decorrelation regularization (DWDR). DWDR serves to actively promote the learning of orthogonal feature channels within neural networks. By dynamically adjusting weights, DWDR targets the minimization of interchannel correlations, guiding the correlation matrix toward diagonality, indicative of independence among channels. The dynamic weighting mechanism adaptively prioritizes the decorrelation of channels that remain highly correlated throughout training. Additionally, we devise a symmetrical sampling strategy for cross-view scenarios to ensure that the training examples are balanced across different imaging platforms in a batch. Despite its simplicity, the integration of DWDR and the proposed sampling scheme yields remarkable performance across four extensive benchmark datasets: University-1652, CVUSA, CVACT, and VIGOR. Notably, in stringent conditions, such as when constrained to exceedingly compact feature dimensions of 64, our methodology significantly outperforms conventional baselines, thereby affirming its efficacy and robustness under challenging constraints.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Cross-View Geo-Localization: A Survey
    Durgam, Abhilash
    Paheding, Sidike
    Dhiman, Vikas
    Devabhaktuni, Vijay
    IEEE ACCESS, 2024, 12 : 192028 - 192050
  • [2] Cross-View Geo-Localization via Learning Correspondence Semantic Similarity Knowledge
    Chen, Guanli
    Huang, Guoheng
    Yuan, Xiaochen
    Chen, Xuhang
    Zhong, Guo
    Pun, Chi-Man
    MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 220 - 233
  • [3] Cross-View Geo-Localization via Learning Disentangled Geometric Layout Correspondence
    Zhang, Xiaohan
    Li, Xingyu
    Sultani, Waqas
    Zhou, Yi
    Wshah, Safwan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3480 - 3488
  • [4] Cross-View Image Sequence Geo-localization
    Zhang, Xiaohan
    Sultani, Waqas
    Wshah, Safwan
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2913 - 2922
  • [5] GAMa: Cross-View Video Geo-Localization
    Vyas, Shruti
    Chen, Chen
    Shah, Mubarak
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 440 - 456
  • [6] Cross-view geo-localization with evolving transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    arXiv, 2021,
  • [7] Learning Cross-View Visual Geo-Localization Without Ground Truth
    Li, Haoyuan
    Xu, Chang
    Yang, Wen
    Yu, Huai
    Xia, Gui-Song
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [8] Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization
    Lin, Jinliang
    Zheng, Zhedong
    Zhong, Zhun
    Luo, Zhiming
    Li, Shaozi
    Yang, Yi
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3780 - 3792
  • [9] MUTUAL RELATIVE POSITION LEARNING TRANSFORMER FOR CROSS-VIEW GEO-LOCALIZATION
    Gu, Bo
    Ling, Hefei
    Shi, Yuxuan
    Li, Zongyi
    Zhao, Chuang
    Li, Ping
    Cao, Qiang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 286 - 290
  • [10] Leveraging cross-view geo-localization with ensemble learning and temporal awareness
    Ghanem, Abdulrahman
    Abdelhay, Ahmed
    Salah, Noor Eldeen
    Nour Eldeen, Ahmed
    Elhenawy, Mohammed
    Masoud, Mahmoud
    Hassan, Ammar M. M.
    Hassan, Abdallah A. A.
    PLOS ONE, 2023, 18 (03):