AFPN: Attention-guided Feature Partition Network for Cross-view Geo-localization

被引：0

作者：

Lin, Zhifeng ^{[1
]}

Huang, Ranran ^{[2
]}

Cai, Jiancheng ^{[2
]}

Liu, Xinmin ^{[2
]}

Ding, Changxing ^{[1
]}

Chai, Zhenhua ^{[2
]}

机构：

[1] South China Univ Technol, Shenzhen, Peoples R China

[2] Meituan, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023 | 2023年

关键词：

Drone; Geo-localization; Image Retrieval; Attention; Transformer;

D O I：

10.1145/3607834.3616563

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Cross-view geo-localization is to retrieve images of the same geographic target from different platforms. Since drones have received increasing attention in recent years because of their ability to capture high-quality multimedia data from the sky, we focus on image retrieval from the drone platform to the satellite platform in this paper. We propose an attention-guided feature partition network (AFPN) which leverages learnable spatial attention maps to divide the global high-level feature map into the class-aware foreground and the class-agnostic background feature in an end-to-end learning manner. Our backbone is based on the powerful vision transformer to model long-range global dependencies between patches. Data augmentation and multiple sampling strategies are also adopted in our experiments. Our method achieves Recall@1 accuracy at 95.60% on University-1652 and 94.48% on University-160k, and ranks 2nd in the ACMMM23 Multimedia Drone Satellite Matching Challenge.

引用

页码：39 / 44

页数：6

共 20 条

[1] Ground-to-Aerial Image Geo-Localization ith a Hard Exemplar Reweighting Triplet Loss [J].

Cai, Sudong ;

Guo, Yulan ;

Khan, Salman ;

Hu, Jiwei ;

Wen, Gongjian .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8390-8399

[2] A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization [J].

Dai, Ming ;

Hu, Jianhong ;

Zhuang, Jiedong ;

Zheng, Enhui .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4376-4389

[3]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[4]

Hadsell Raia., 2006, P IEEE COMPUTER SOC, V2, P1735, DOI [DOI 10.1109/CVPR.2006.100, 10.1109/CVPR.2006.100]

[5] CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization [J].

Hu, Sixing ;

Feng, Mengdan ;

Nguyen, Rang M. H. ;

Lee, Gim Hee .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7258-7267

[6] Lending Orientation to Neural Networks for Cross-view Geo-localization [J].

Liu, Liu ;

Li, Hongdong .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5607-5616

[7] Swin Transformer V2: Scaling Up Capacity and Resolution [J].

Liu, Ze ;

Hu, Han ;

Lin, Yutong ;

Yao, Zhuliang ;

Xie, Zhenda ;

Wei, Yixuan ;

Ning, Jia ;

Cao, Yue ;

Zhang, Zheng ;

Dong, Li ;

Wei, Furu ;

Guo, Baining .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11999-12009

[8] A ConvNet for the 2020s [J].

Liu, Zhuang ;

Mao, Hanzi ;

Wu, Chao-Yuan ;

Feichtenhofer, Christoph ;

Darrell, Trevor ;

Xie, Saining .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11966-11976

[9] Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks [J].

Long, Yang ;

Gong, Yiping ;

Xiao, Zhifeng ;

Liu, Qing .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (05) :2486-2498

[10]

Schroff F, 2015, PROC CVPR IEEE, P815, DOI 10.1109/CVPR.2015.7298682

← 1 2 →