AFPN: Attention-guided Feature Partition Network for Cross-view Geo-localization

被引:0
作者
Lin, Zhifeng [1 ]
Huang, Ranran [2 ]
Cai, Jiancheng [2 ]
Liu, Xinmin [2 ]
Ding, Changxing [1 ]
Chai, Zhenhua [2 ]
机构
[1] South China Univ Technol, Shenzhen, Peoples R China
[2] Meituan, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023 | 2023年
关键词
Drone; Geo-localization; Image Retrieval; Attention; Transformer;
D O I
10.1145/3607834.3616563
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Cross-view geo-localization is to retrieve images of the same geographic target from different platforms. Since drones have received increasing attention in recent years because of their ability to capture high-quality multimedia data from the sky, we focus on image retrieval from the drone platform to the satellite platform in this paper. We propose an attention-guided feature partition network (AFPN) which leverages learnable spatial attention maps to divide the global high-level feature map into the class-aware foreground and the class-agnostic background feature in an end-to-end learning manner. Our backbone is based on the powerful vision transformer to model long-range global dependencies between patches. Data augmentation and multiple sampling strategies are also adopted in our experiments. Our method achieves Recall@1 accuracy at 95.60% on University-1652 and 94.48% on University-160k, and ranks 2nd in the ACMMM23 Multimedia Drone Satellite Matching Challenge.
引用
收藏
页码:39 / 44
页数:6
相关论文
共 20 条
[1]   Ground-to-Aerial Image Geo-Localization ith a Hard Exemplar Reweighting Triplet Loss [J].
Cai, Sudong ;
Guo, Yulan ;
Khan, Salman ;
Hu, Jiwei ;
Wen, Gongjian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8390-8399
[2]   A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization [J].
Dai, Ming ;
Hu, Jianhong ;
Zhuang, Jiedong ;
Zheng, Enhui .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4376-4389
[3]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4]  
Hadsell Raia., 2006, P IEEE COMPUTER SOC, V2, P1735, DOI [DOI 10.1109/CVPR.2006.100, 10.1109/CVPR.2006.100]
[5]   CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization [J].
Hu, Sixing ;
Feng, Mengdan ;
Nguyen, Rang M. H. ;
Lee, Gim Hee .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7258-7267
[6]   Lending Orientation to Neural Networks for Cross-view Geo-localization [J].
Liu, Liu ;
Li, Hongdong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5607-5616
[7]   Swin Transformer V2: Scaling Up Capacity and Resolution [J].
Liu, Ze ;
Hu, Han ;
Lin, Yutong ;
Yao, Zhuliang ;
Xie, Zhenda ;
Wei, Yixuan ;
Ning, Jia ;
Cao, Yue ;
Zhang, Zheng ;
Dong, Li ;
Wei, Furu ;
Guo, Baining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11999-12009
[8]   A ConvNet for the 2020s [J].
Liu, Zhuang ;
Mao, Hanzi ;
Wu, Chao-Yuan ;
Feichtenhofer, Christoph ;
Darrell, Trevor ;
Xie, Saining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11966-11976
[9]   Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks [J].
Long, Yang ;
Gong, Yiping ;
Xiao, Zhifeng ;
Liu, Qing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (05) :2486-2498
[10]  
Schroff F, 2015, PROC CVPR IEEE, P815, DOI 10.1109/CVPR.2015.7298682