Self-Supervised Monocular Depth Estimation Using Global and Local Mixed Multi-Scale Feature Enhancement Network for Low-Altitude UAV Remote Sensing

被引：4

作者：

Chang, Rong ^{[1
]}

Yu, Kailong ^{[2
]}

Yang, Yang ^{[2
]}

机构：

[1] Yunnan Power Grid Co Ltd Kunming, Yuxi Power Supply Bur, Yuxi 653100, Peoples R China

[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 13期

基金：

中国国家自然科学基金;

关键词：

monocular depth estimation; self-supervised learning; complex scene; Unmanned Aerial Vehicles (UAVs);

D O I：

10.3390/rs15133275

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Estimating depth from a single low-altitude aerial image captured by an Unmanned Aerial System (UAS) has become a recent research focus. This method has a wide range of applications in 3D modeling, digital terrain models, and target detection. Traditional 3D reconstruction requires multiple images, while UAV depth estimation can complete the task with just one image, thus having higher efficiency and lower cost. This study aims to use deep learning to estimate depth from a single UAS low-altitude remote sensing image. We propose a novel global and local mixed multi-scale feature enhancement network for monocular depth estimation in low-altitude remote sensing scenes, which exchanges information between feature maps of different scales during the forward process through convolutional operations while maintaining the maximum scale feature map. At the same time, we propose a Global Scene Attention (GSA) module in the decoder part of the depth network, which can better focus on object edges, distinguish foreground and background in the UAV field of view, and ultimately demonstrate excellent performance. Finally, we design several loss functions for the low-altitude remote sensing field to constrain the network to reach its optimal state. We conducted extensive experiments on public dataset UAVid 2020, and the results show that our method outperforms state-of-the-art methods.

引用

页数：14

共 41 条

[1] Application of unmanned aerial vehicles in earth resources monitoring: focus on evaluating potentials for forest monitoring in Ethiopia
Berie, Habitamu Taddese
Burud, Ingunn
[J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2018, 51 (01): : 326 - 335
[2] Unsupervised Scale-Consistent Depth Learning from Video
Bian, Jia-Wang
Zhan, Huangying
Wang, Naiyan
Li, Zhichao
Zhang, Le
Shen, Chunhua
Cheng, Ming-Ming
Reid, Ian
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (09) : 2548 - 2564
[3] Casser V, 2019, AAAI CONF ARTIF INTE, P8001
[4] Chang Shu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P572, DOI 10.1007/978-3-030-58529-7_34
[5] Deep Ordinal Regression Network for Monocular Depth Estimation
Fu, Huan
Gong, Mingming
Wang, Chaohui
Batmanghelich, Kayhan
Tao, Dacheng
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2002 - 2011
[6] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[7] Digging Into Self-Supervised Monocular Depth Estimation
Godard, Clement
Mac Aodha, Oisin
Firman, Michael
Brostow, Gabriel
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
[8] Unsupervised Monocular Depth Estimation with Left-Right Consistency
Godard, Clement
Mac Aodha, Oisin
Brostow, Gabriel J.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6602 - 6611
[9] 3D Packing for Self-Supervised Monocular Depth Estimation
Guizilini, Vitor
Ambrus, Rares
Pillai, Sudeep
Raventos, Allan
Gaidon, Adrien
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2482 - 2491
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 5 →