Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

被引：23

作者：

Dai, Ming ^{[1
]}

Zheng, Enhui ^{[2
]}

Feng, Zhenhua ^{[3
]}

Qi, Lei ^{[4
]}

Zhuang, Jiedong ^{[5
]}

Yang, Wankou ^{[1
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

[2] China Jiliang Univ, Unmanned Syst Applicat Technol Res Inst, Hangzhou 310018, Peoples R China

[3] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, England

[4] Southeast Univ, Sch Comp Sci, Nanjing 210096, Peoples R China

[5] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310063, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Autonomous aerial vehicles; Task analysis; Satellite images; Satellites; Location awareness; Drones; Web services; Unmanned aerial vehicle; geo-localization; transformer; image retrieval; NETWORK;

D O I：

10.1109/TIP.2023.3346279

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unmanned Aerial Vehicles (UAVs) rely on satellite systems for stable positioning. However, due to limited satellite coverage or communication disruptions, UAVs may lose signals for positioning. In such situations, vision-based techniques can serve as an alternative, ensuring the self-positioning capability of UAVs. However, most of the existing datasets are developed for the geo-localization task of the objects captured by UAVs, rather than UAV self-positioning. Furthermore, the existing UAV datasets apply discrete sampling to synthetic data, such as Google Maps, neglecting the crucial aspects of dense sampling and the uncertainties commonly experienced in practical scenarios. To address these issues, this paper presents a new dataset, DenseUAV, that is the first publicly available dataset tailored for the UAV self-positioning task. DenseUAV adopts dense sampling on UAV images obtained in low-altitude urban areas. In total, over 27K UAV- and satellite-view images of 14 university campuses are collected and annotated. In terms of methodology, we first verify the superiority of Transformers over CNNs for the proposed task. Then we incorporate metric learning into representation learning to enhance the model's discriminative capacity and to reduce the modality discrepancy. Besides, to facilitate joint learning from both the satellite and UAV views, we introduce a mutually supervised learning approach. Last, we enhance the Recall@K metric and introduce a new measurement, SDM@K, to evaluate both the retrieval and localization performance for the proposed task. As a result, the proposed baseline method achieves a remarkable Recall@1 score of 83.01% and an SDM@1 score of 86.50% on DenseUAV. The dataset and code have been made publicly available on https://github.com/Dmmm1997/DenseUAV.

引用

页码：493 / 508

页数：16

共 65 条

[1] Compositional Learning of Image-Text Query for Image Retrieval [J].

Anwaar, Muhammad Umer ;

Labintcev, Egor ;

Kleinsteuber, Martin .

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1139-1148

[2]

Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]

[3]

Chen T., 2020, INT C LEARNING REPRE, VVolume 2, P4

[4] A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization [J].

Dai, Ming ;

Hu, Jianhong ;

Zhuang, Jiedong ;

Zheng, Enhui .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4376-4389

[5] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[6] A Global-Local Self-Adaptive Network for Drone-View Object Detection [J].

Deng, Sutao ;

Li, Shuai ;

Xie, Ke ;

Song, Wenfeng ;

Liao, Xiao ;

Hao, Aimin ;

Qin, Hong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :1556-1569

[7] MULTI-SCALE GEM POOLING WITH N-PAIR CENTER LOSS FOR FINE-GRAINED IMAGE SEARCH [J].

Deng, Youming ;

Lin, Xianming ;

Li, Run ;

Ji, Rongrong .

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, :1000-1005

[8] A Practical Cross-View Image Matching Method between UAV and Satellite for UAV-Based Geo-Localization [J].

Ding, Lirong ;

Zhou, Ji ;

Meng, Lingxuan ;

Long, Zhiyong .

REMOTE SENSING, 2021, 13 (01) :1-22

[9]

Dosovitskiy A., 2021, P INT C LEARN REPR I, P1

[10] Soft Exemplar Highlighting for Cross-View Image-Based Geo-Localization [J].

Guo, Yulan ;

Choi, Michael ;

Li, Kunhong ;

Boussaid, Farid ;

Bennamoun, Mohammed .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :2094-2105

← 1 2 3 4 5 6 7 →