Aligning Geometric Spatial Layout in Cross-View Geo-Localization via Feature Recombination

被引:0
|
作者
Zhang, Qingwang [1 ]
Zhu, Yingying [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-view geo-localization holds significant potential for various applications, but drastic differences in viewpoints and visual appearances between cross-view images make this task extremely challenging. Recent works have made notable progress in cross-view geo-localization. However, existing methods either ignore the correspondence between geometric spatial layout in cross-view images or require high costs or strict constraints to achieve such alignment. In response to these challenges, we propose a Feature Recombination Module (FRM) that explicitly establishes the geometric spatial layout correspondences between two views. Unlike existing methods, FRM aligns geometric spatial layout by directly recombining features, avoiding image preprocessing, and introducing no additional computational and parameter costs. This effectively reduces ambiguities caused by geometric misalignments between ground-level and aerial-level images. Furthermore, it is not sensitive to frameworks and applies to both CNN-based and Transformer-based architectures. Additionally, as part of the training procedure, we also introduce a novel weighted ( B+1)-tuple loss (WBL) as optimization objective. Compared to the widely used weighted soft margin ranking loss, this innovative loss enhances convergence speed and final performance. Based on the two core components (FRM and WBL), we develop an end-to-end network architecture (FRGeo) to address these limitations from a different perspective. Extensive experiments show that our proposed FRGeo not only achieves state-of-the-art performance on cross-view geo-localization benchmarks, including CVUSA, CVACT, and VIGOR, but also is significantly superior or competitive in terms of computational complexity and trainable parameters. Our project homepage is at https://zqwlearning.github.io/FRGeo.
引用
收藏
页码:7251 / 7259
页数:9
相关论文
共 50 条
  • [41] Road Structure Inspired UGV-Satellite Cross-View Geo-Localization
    Hu, Di
    Yuan, Xia
    Xi, Huiying
    Li, Jie
    Song, Zhenbo
    Xiong, Fengchao
    Zhang, Kai
    Zhao, Chunxia
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 16767 - 16786
  • [42] Soft Exemplar Highlighting for Cross-View Image-Based Geo-Localization
    Guo, Yulan
    Choi, Michael
    Li, Kunhong
    Boussaid, Farid
    Bennamoun, Mohammed
    IEEE Transactions on Image Processing, 2022, 31 : 2094 - 2105
  • [43] Content-Aware Hierarchical Representation Selection for Cross-View Geo-Localization
    Lu, Zeng
    Pu, Tao
    Chen, Tianshui
    Lin, Liang
    COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 267 - 280
  • [44] CV-Cities: Advancing Cross-View Geo-Localization in Global Cities
    Huang, Gaoshuang
    Zhou, Yang
    Zhao, Luying
    Gan, Wenjian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 1592 - 1606
  • [45] Are These from the Same Place? Seeing the Unseen in Cross-View Image Geo-Localization
    Rodrigues, Royston
    Tani, Masahiro
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3752 - 3760
  • [46] Patch Similarity Self-Knowledge Distillation for Cross-View Geo-Localization
    Li, Songlian
    Hu, Min
    Xiao, Xiongwu
    Tu, Zhigang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5091 - 5103
  • [47] TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization
    Zhu, Sijie
    Shah, Mubarak
    Chen, Chen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1152 - 1161
  • [48] Each Part Matters: Local Patterns Facilitate Cross-View Geo-Localization
    Wang, Tingyu
    Zheng, Zhedong
    Yan, Chenggang
    Zhang, Jiyong
    Sun, Yaoqi
    Zheng, Bolun
    Yang, Yi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 867 - 879
  • [49] An Efficient Method based on Multi-view Semantic Alignment for Cross-view Geo-localization
    Wang, Yifeng
    Xia, Yamei
    Lu, Tianbo
    Zhang, Xiaoyan
    Yao, Wenbin
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [50] Joint Representation Learning Based on Feature Center Region Diffusion and Edge Radiation for Cross-View Geo-Localization
    Ge, Fawei
    Zhang, Yunzhou
    Wang, Li
    Liu, Yixiu
    Si, Pengju
    Zhang, Jinjin
    Shen, You
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63