CCR: A Counterfactual Causal Reasoning-Based Method for Cross-View Geo-Localization

被引:3
作者
Du, Haolin [1 ]
He, Jingfei [1 ]
Zhao, Yuanqing [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin Key Lab Elect Mat & Devices, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Image retrieval; drone; cross-view; geolocalization; counterfactual causal reasoning;
D O I
10.1109/TCSVT.2024.3425509
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cross-view geo-localization seeks to match geographic locations using images from varied sources, including drones and satellites. Interpreting images captured by drones poses significant challenges due to the varying positions and scales resulting from the camera's aerial perspective. Traditional approaches have primarily focused on harnessing contextual cues, which may lead to overfitting. Therefore, it is crucial to find an optimal balance between leveraging contextual details and identifying relevant features. To address this, we introduce a novel method for cross-view geo-localization that employs counterfactual causal reasoning (CCR). This method aims to refine the model's focus, ensuring a balanced emphasis on both the intricate details of the target structure and its broader contextual environment. Our method incorporates an Adaptive Dimension Interaction Block (ADIB), which effectively discerns feature interactions across multiple dimensions, enhanced by counterfactual causal reasoning to improve recognition of target structures and their contexts. In tasks of image-based drone-view target localization and drone navigation, our method achieves superior performance on the University-1652 and SUES-200 benchmark datasets. The code and model files will be made available at https://github.com/Cyberpunk1998/CCR.
引用
收藏
页码:11630 / 11643
页数:14
相关论文
共 63 条
[1]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[2]  
Bansal M., 2011, P 19 ACM INT C MULT, P1125
[3]  
Brar S., 2015, Tech. Rep. 2015.11.8, V8
[4]  
Bui DV, 2022, J ROBOT NETW ARTIF L, V9, P275
[5]   Semantic Cross-View Matching [J].
Castaldo, Francesco ;
Zamir, Amir ;
Angst, Roland ;
Palmieri, Francesco ;
Savarese, Silvio .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :1044-1052
[6]   Point-to-Box Network for Accurate Object Detection via Single Point Supervision [J].
Chen, Pengfei ;
Yu, Xuehui ;
Han, Xumeng ;
Hassan, Najmul ;
Wang, Kai ;
Li, Jiachen ;
Zhao, Jian ;
Shi, Humphrey ;
Han, Zhenjun ;
Ye, Qixiang .
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :51-67
[7]   A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization [J].
Dai, Ming ;
Hu, Jianhong ;
Zhuang, Jiedong ;
Zheng, Enhui .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4376-4389
[8]   Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification [J].
Deng, Weijian ;
Zheng, Liang ;
Ye, Qixiang ;
Kang, Guoliang ;
Yang, Yi ;
Jiao, Jianbin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :994-1003
[9]   A Practical Cross-View Image Matching Method between UAV and Satellite for UAV-Based Geo-Localization [J].
Ding, Lirong ;
Zhou, Ji ;
Meng, Lingxuan ;
Long, Zhiyong .
REMOTE SENSING, 2021, 13 (01) :1-22
[10]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]