CAMP: A Cross-View Geo-Localization Method Using Contrastive Attributes Mining and Position-Aware Partitioning

被引：5

作者：

Wu, Qiong ^{[1
]}

Wan, Yi ^{[1
]}

Zheng, Zhi ^{[2
]}

Zhang, Yongjun ^{[1
]}

Wang, Guangshuai ^{[3
]}

Zhao, Zhenyang ^{[3
]}

机构：

[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China

[2] Chinese Univ Hong Kong, Dept Geog & Resource Management, Shatin, Hong Kong, Peoples R China

[3] China Railway Design Grp Co Ltd, Tianjin 300308, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Contrastive learning; Task analysis; Drones; Satellite images; Data mining; Visualization; Cross-view geo-localization (CVGL); image retrieval; remote sensing; satellite image; unmanned aerial vehicles (UAVs);

D O I：

10.1109/TGRS.2024.3448499

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Cross-view geo-localization (CVGL) task aims to utilize geographic data, such as maps or high-resolution satellite images, as reference to estimate the positions of a ground- or near-ground- captured query image. This task is particularly challenging due to the significant changes in visual appearance resulting from the extreme viewpoint variations. To address this challenge, a range of innovative methods have been proposed. However, intra-scene geometric information and inter-scene discriminative representation are not fully explored. In this article, we propose a novel CVGL method using contrastive attributes mining and position-aware partitioning (CAMP), which incorporates a position-aware partition branch (PPB) and a contrastive attributes mining (CAM) strategy. PPB learns fine-grained local features of different parts and captures their spatial information, providing a comprehensive understanding of scenes from both textual and spatial perspectives. CAM establishes supervision of the negative samples based on the images from the same platform, empowering the model to better discern differences between distinct scenes without extra memory cost. The proposed CAMP surpasses existing methods, achieving state-of-the-art results on the satellite-drone CVGL datasets University-1652 and SUES-200. Additionally, our method also outperforms existing methods in cross-dataset generalization, achieving an 8.85% increase in R@1 when trained on the University-1652 dataset and tested on the SUES-200 dataset at a height of 150 m. Our code and model are available at https://github.com/Mabel0403/CAMP.

引用

页数：14

共 48 条

[1]

Bansal M., 2011, P 19 ACM INT C MULT, P1125

[2] Ground-to-Aerial Image Geo-Localization ith a Hard Exemplar Reweighting Triplet Loss [J].

Cai, Sudong ;

Guo, Yulan ;

Khan, Salman ;

Hu, Jiwei ;

Wen, Gongjian .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8390-8399

[3] Emerging Properties in Self-Supervised Vision Transformers [J].

Caron, Mathilde ;

Touvron, Hugo ;

Misra, Ishan ;

Jegou, Herve ;

Mairal, Julien ;

Bojanowski, Piotr ;

Joulin, Armand .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640

[4] Semantic Cross-View Matching [J].

Castaldo, Francesco ;

Zamir, Amir ;

Angst, Roland ;

Palmieri, Francesco ;

Savarese, Silvio .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :1044-1052

[5]

Chen T., 2020, INT C MACH LEARN PML, P1597

[6]

Chen Ting, 2020, Advances in neural information processing systems, DOI 10.48550/arXiv.2006.10029

[7]

Chen XL, 2020, Arxiv, DOI [arXiv:2003.04297, 10.48550/arXiv.2003.04297]

[8] An Empirical Study of Training Self-Supervised Vision Transformers [J].

Chen, Xinlei ;

Xie, Saining ;

He, Kaiming .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9620-9629

[9] A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization [J].

Dai, Ming ;

Hu, Jianhong ;

Zhuang, Jiedong ;

Zheng, Enhui .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4376-4389

[10] Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation [J].

Deuser, Fabian ;

Habel, Konrad ;

Oswald, Norbert .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :16801-16810

← 1 2 3 4 5 →