An Efficient Method based on Multi-view Semantic Alignment for Cross-view Geo-localization

被引:0
|
作者
Wang, Yifeng [1 ]
Xia, Yamei [1 ]
Lu, Tianbo [1 ]
Zhang, Xiaoyan [1 ]
Yao, Wenbin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software EngineeringSch, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Geo-localization; Image Retrieval; Transformer; Semantic Alignment;
D O I
10.1109/IJCNN54540.2023.10191537
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-view geo-localization is to retrieve the most relevant images from different views. The biggest challenge is the visual differences between different views and the location shifts in practical applications. Existing methods usually extract fine-grained features of the retrieval target and match them by semantic alignment. The Transformer-based approach can focus on more contextual information than the CNN-based approach and also learn the geometric correspondence between two viewpoint images directly through the location encoding information. However, the existing methods need to fully utilize the information from different viewpoints, and the model needs to understand the context information sufficiently. To address these issues, we propose an efficient method to fully use image information from cross-views and feature fusion, divided into two branches: Aerial-View Local-Feature Cross-Fusion(ALCF) and Multi-View Global-feature Cross-Fusion(MGCF). By observing the characteristics of the aerial and street views, we perform a targeted fusion of global and local features from different viewpoints. In addition, we introduce a multi-view semantic alignment module, which can solve the problem that more noise information is introduced when the aerial view and street view images are semantically aligned. Experiments show that our proposed method achieves excellent performance in both the drone viewpoint target localization and drone navigation tasks on the University-1652 dataset.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Cross-View Geo-Localization: A Survey
    Durgam, Abhilash
    Paheding, Sidike
    Dhiman, Vikas
    Devabhaktuni, Vijay
    IEEE ACCESS, 2024, 12 : 192028 - 192050
  • [2] Geographic Semantic Network for Cross-View Image Geo-Localization
    Zhu, Yingying
    Sun, Bin
    Lu, Xiufan
    Jia, Sen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] UAV Geo-Localization Dataset and Method Based on Cross-View Matching
    Yao, Yuwen
    Sun, Cheng
    Wang, Tao
    Yang, Jianxing
    Zheng, Enhui
    SENSORS, 2024, 24 (21)
  • [4] Enhancing Cross-View Geo-Localization With Domain Alignment and Scene Consistency
    Xia, Panwang
    Wan, Yi
    Zheng, Zhi
    Zhang, Yongjun
    Deng, Jiwei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13271 - 13281
  • [5] Cross-View Image Sequence Geo-localization
    Zhang, Xiaohan
    Sultani, Waqas
    Wshah, Safwan
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2913 - 2922
  • [6] GAMa: Cross-View Video Geo-Localization
    Vyas, Shruti
    Chen, Chen
    Shah, Mubarak
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 440 - 456
  • [7] Cross-view geo-localization with evolving transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    arXiv, 2021,
  • [8] Cross-view Geo-localization Based on Cross-domain Matching
    Wu, Xiaokang
    Ma, Qianguang
    Li, Qi
    Yu, Yuanlong
    Liu, Wenxi
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 719 - 728
  • [9] AENet: attention efficient network for cross-view image geo-localization
    Xu, Jingqian
    Zhu, Ma
    Qi, Baojun
    Li, Jiangshan
    Yang, Chunfang
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (07): : 4119 - 4138
  • [10] Cross-View Geo-Localization via Learning Correspondence Semantic Similarity Knowledge
    Chen, Guanli
    Huang, Guoheng
    Yuan, Xiaochen
    Chen, Xuhang
    Zhong, Guo
    Pun, Chi-Man
    MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 220 - 233