Learning Cross-View Geo-Localization Embeddings via Dynamic Weighted Decorrelation Regularization

被引:1
|
作者
Wang, Tingyu [1 ]
Zheng, Zhedong [2 ,3 ]
Zhu, Zunjie [1 ,4 ]
Sun, Yaoqi [1 ,4 ]
Yan, Chenggang [1 ]
Yang, Yi [5 ]
机构
[1] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou 310018, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Macau, Peoples R China
[3] Univ Macau, Inst Collaborat Innovat, Macau, Macau, Peoples R China
[4] Hangzhou Dianzi Univ, Lishui Inst, Lishui 323000, Peoples R China
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310027, Peoples R China
关键词
Feature extraction; Decorrelation; Training; Visualization; Optimization; Drones; Correlation; Satellites; Redundancy; Termination of employment; deep learning; geo-localization; image retrieval; the cross-correlation coefficient matrix;
D O I
10.1109/TGRS.2024.3491757
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the domain of cross-view geo-localization, the challenge lies in accurately matching images captured from distinct perspectives, such as aerial drone imagery and satellite imagery of the same geographical location. Existing methods predominantly concentrate on minimizing distances between feature embeddings in the representational space, inadvertently overlooking the significance of reducing embedding redundancy. This oversight potentially hampers the extraction of diverse and distinctive visual patterns critical for precise localization. This work argues that minimizing embedding redundancy is a pivotal factor in enhancing a model's ability to discriminate diverse scene characteristics. To support this claim, we introduce a straightforward yet effective regularization technique, termed dynamic weighted decorrelation regularization (DWDR). DWDR serves to actively promote the learning of orthogonal feature channels within neural networks. By dynamically adjusting weights, DWDR targets the minimization of interchannel correlations, guiding the correlation matrix toward diagonality, indicative of independence among channels. The dynamic weighting mechanism adaptively prioritizes the decorrelation of channels that remain highly correlated throughout training. Additionally, we devise a symmetrical sampling strategy for cross-view scenarios to ensure that the training examples are balanced across different imaging platforms in a batch. Despite its simplicity, the integration of DWDR and the proposed sampling scheme yields remarkable performance across four extensive benchmark datasets: University-1652, CVUSA, CVACT, and VIGOR. Notably, in stringent conditions, such as when constrained to exceedingly compact feature dimensions of 64, our methodology significantly outperforms conventional baselines, thereby affirming its efficacy and robustness under challenging constraints.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Cross-view Geo-localization Based on Cross-domain Matching
    Wu, Xiaokang
    Ma, Qianguang
    Li, Qi
    Yu, Yuanlong
    Liu, Wenxi
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 719 - 728
  • [22] Cross-View Visual Geo-Localization for Outdoor Augmented Reality
    Mithun, Niluthpol Chowdhury
    Minhas, Kshitij S.
    Chiu, Han-Pang
    Oskiper, Taragay
    Sizintsev, Mikhail
    Samarasekera, Supun
    Kumar, Rakesh
    2023 IEEE CONFERENCE VIRTUAL REALITY AND 3D USER INTERFACES, VR, 2023, : 493 - 502
  • [23] Geographic Semantic Network for Cross-View Image Geo-Localization
    Zhu, Yingying
    Sun, Bin
    Lu, Xiufan
    Jia, Sen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] Cross-view Geo-localization with Layer-to-Layer Transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] Optimal Feature Transport for Cross-View Image Geo-Localization
    Shi, Yujiao
    Yu, Xin
    Liu, Liu
    Zhang, Tong
    Li, Hongdong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11990 - 11997
  • [26] Predictive Information Preservation via Variational Information Bottleneck for Cross-View Geo-Localization
    Li, Wansi
    Hu, Qian
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT I, 2022, 1700 : 403 - 419
  • [27] Aligning Geometric Spatial Layout in Cross-View Geo-Localization via Feature Recombination
    Zhang, Qingwang
    Zhu, Yingying
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7251 - 7259
  • [28] UAV Geo-Localization Dataset and Method Based on Cross-View Matching
    Yao, Yuwen
    Sun, Cheng
    Wang, Tao
    Yang, Jianxing
    Zheng, Enhui
    SENSORS, 2024, 24 (21)
  • [29] Enhancing Cross-View Geo-Localization With Domain Alignment and Scene Consistency
    Xia, Panwang
    Wan, Yi
    Zheng, Zhi
    Zhang, Yongjun
    Deng, Jiwei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13271 - 13281
  • [30] Cross-View Object Geo-Localization in a Local Region With Satellite Imagery
    Sun, Yuxi
    Ye, Yunming
    Kang, Jian
    Fernandez-Beltran, Ruben
    Feng, Shanshan
    Li, Xutao
    Luo, Chuyao
    Zhang, Puzhao
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61