An Efficient Method based on Multi-view Semantic Alignment for Cross-view Geo-localization

被引:0
|
作者
Wang, Yifeng [1 ]
Xia, Yamei [1 ]
Lu, Tianbo [1 ]
Zhang, Xiaoyan [1 ]
Yao, Wenbin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software EngineeringSch, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Geo-localization; Image Retrieval; Transformer; Semantic Alignment;
D O I
10.1109/IJCNN54540.2023.10191537
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-view geo-localization is to retrieve the most relevant images from different views. The biggest challenge is the visual differences between different views and the location shifts in practical applications. Existing methods usually extract fine-grained features of the retrieval target and match them by semantic alignment. The Transformer-based approach can focus on more contextual information than the CNN-based approach and also learn the geometric correspondence between two viewpoint images directly through the location encoding information. However, the existing methods need to fully utilize the information from different viewpoints, and the model needs to understand the context information sufficiently. To address these issues, we propose an efficient method to fully use image information from cross-views and feature fusion, divided into two branches: Aerial-View Local-Feature Cross-Fusion(ALCF) and Multi-View Global-feature Cross-Fusion(MGCF). By observing the characteristics of the aerial and street views, we perform a targeted fusion of global and local features from different viewpoints. In addition, we introduce a multi-view semantic alignment module, which can solve the problem that more noise information is introduced when the aerial view and street view images are semantically aligned. Experiments show that our proposed method achieves excellent performance in both the drone viewpoint target localization and drone navigation tasks on the University-1652 dataset.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] UAV-Satellite View Synthesis for Cross-View Geo-Localization
    Tian, Xiaoyang
    Shao, Jie
    Ouyang, Deqiang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4804 - 4815
  • [12] Benchmarking the Robustness of Cross-View Geo-Localization Models
    Zhang, Qingwang
    Zhu, Yingying
    COMPUTER VISION - ECCV 2024, PT LXXXVII, 2025, 15145 : 36 - 53
  • [13] Dual Path Network for Cross-view Geo-Localization
    Dong, Leyi
    Wang, Yuhui
    Huang, Junshi
    Qian, Xueming
    Fan, Mingyuan
    Lai, Shenqi
    PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, : 45 - 49
  • [14] Feature Relation Guided Cross-View Image Based Geo-Localization
    Hou, Qingfeng
    Lu, Jun
    Guo, Haitao
    Liu, Xiangyun
    Gong, Zhihui
    Zhu, Kun
    Ping, Yifan
    REMOTE SENSING, 2023, 15 (20)
  • [15] CCR: A Counterfactual Causal Reasoning-based Method for Cross-view Geo-localization
    Du H.
    He J.
    Zhao Y.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 1 - 1
  • [16] Lending Orientation to Neural Networks for Cross-view Geo-localization
    Liu, Liu
    Li, Hongdong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5607 - 5616
  • [17] Fusing Geometric and Scene Information for Cross-View Geo-Localization
    Guo, Siyuan
    Liu, Tianying
    Li, Wengen
    Guan, Jihong
    Zhou, Shuigeng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3978 - 3982
  • [18] Cross-View Image Matching for Geo-localization in Urban Environments
    Tian, Yicong
    Chen, Chen
    Shah, Mubarak
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1998 - 2006
  • [19] Perceptual Feature Fusion Network for Cross-View Geo-Localization
    Wang, Jiayi
    Chen, Ziyang
    Yuan, Xiaochen
    Zhao, Genping
    Computer Engineering and Applications, 60 (03): : 255 - 262
  • [20] Cross-View Visual Geo-Localization for Outdoor Augmented Reality
    Mithun, Niluthpol Chowdhury
    Minhas, Kshitij S.
    Chiu, Han-Pang
    Oskiper, Taragay
    Sizintsev, Mikhail
    Samarasekera, Supun
    Kumar, Rakesh
    2023 IEEE CONFERENCE VIRTUAL REALITY AND 3D USER INTERFACES, VR, 2023, : 493 - 502