Convolutional Neural Networks for Geo-Localisation with a Single Aerial Image

被引:6
作者
Cabrera-Ponce, Aldrich A. [1 ]
Martinez-Carranza, Jose [1 ]
机构
[1] INAOE, Cholula, Mexico
关键词
Geo-localisation; Single Aerial Image; CNN; Deep Learning; SYSTEM;
D O I
10.1007/s11554-022-01207-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, Unmanned Aerial Vehicles (UAVs) navigating outdoors rely heavily on GPS for localisation and autonomous flight or applications for aerial photography re-cording with a GPS coordinate. However, GPS may fail or become unreliable, thus compromising the flight mission. Motivated by this scenario, in this work, we present a study on the use of popular Convolutional Neural Networks (CNN) to address the problem of geo-localisation from a single aerial image. We compare CNN-based architectures from the state-of-the-art, and introduce a compact architecture to speed up the inference process without affecting the estimation error. For our experiments, aerial images were recorded with a monocular camera onboard a UAV, flying outdoors with a height between 20 to 25 metres. On average, our compact network achieves a minimum estimation error of 2.8 metres and a maximum of 6.1 metres, which is comparable to the performance of other networks in the state-of-the-art. However, our network achieves on average an operation frequency of 103 fps versus 69 fps achieved by the fastest network in the comparison analysis. These results are promising since such speed would enable fast geo-localisation with cameras capturing images at those frame rates, which are useful for obtaining neater images than with conventional cameras working at 30 fps.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 62 条
[1]  
Ahmadi M, 2018, IEEE INT NEW CIRC, P125, DOI 10.1109/NEWCAS.2018.8585433
[2]  
Altwaijry H., 2016, BMVC
[3]   Learning to Match Aerial Images with Deep Attentive Architectures [J].
Altwaijry, Hani ;
Trulls, Eduard ;
Hays, James ;
Fua, Pascal ;
Belongie, Serge .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3539-3547
[4]  
Amer K., 2019, ARXIV PREPRINT ARXIV
[5]  
Arreola L, 2018, INT CONF UNMAN AIRCR, P1248, DOI 10.1109/ICUAS.2018.8453349
[6]   Extending Absolute Pose Regression to Multiple Scenes [J].
Blanton, Hunter ;
Greenwell, Connor ;
Workman, Scott ;
Jacobs, Nathan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :170-178
[7]  
Bodó Z, 2017, I S INTELL SYST INFO, P111, DOI 10.1109/SISY.2017.8080535
[8]  
Bresson G., 2019, URBAN LOCALIZATION S
[9]   Continual Learning for Multi-camera Relocalisation [J].
Cabrera-Ponce, Aldrich A. ;
Martin-Ortiz, Manuel ;
Martinez-Carranza, J. .
ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 :289-302
[10]  
Cabrera-Ponce AA, 2019, 2019 INTERNATIONAL WORKSHOP ON RESEARCH, EDUCATION AND DEVELOPMENT OF UNMANNED AERIAL SYSTEMS (RED UAS 2019), P192, DOI [10.1109/reduas47371.2019.8999713, 10.1109/REDUAS47371.2019.8999713]