Self-Supervised Monocular Depth Estimation Using Global and Local Mixed Multi-Scale Feature Enhancement Network for Low-Altitude UAV Remote Sensing

被引:4
作者
Chang, Rong [1 ]
Yu, Kailong [2 ]
Yang, Yang [2 ]
机构
[1] Yunnan Power Grid Co Ltd Kunming, Yuxi Power Supply Bur, Yuxi 653100, Peoples R China
[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
monocular depth estimation; self-supervised learning; complex scene; Unmanned Aerial Vehicles (UAVs);
D O I
10.3390/rs15133275
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Estimating depth from a single low-altitude aerial image captured by an Unmanned Aerial System (UAS) has become a recent research focus. This method has a wide range of applications in 3D modeling, digital terrain models, and target detection. Traditional 3D reconstruction requires multiple images, while UAV depth estimation can complete the task with just one image, thus having higher efficiency and lower cost. This study aims to use deep learning to estimate depth from a single UAS low-altitude remote sensing image. We propose a novel global and local mixed multi-scale feature enhancement network for monocular depth estimation in low-altitude remote sensing scenes, which exchanges information between feature maps of different scales during the forward process through convolutional operations while maintaining the maximum scale feature map. At the same time, we propose a Global Scene Attention (GSA) module in the decoder part of the depth network, which can better focus on object edges, distinguish foreground and background in the UAV field of view, and ultimately demonstrate excellent performance. Finally, we design several loss functions for the low-altitude remote sensing field to constrain the network to reach its optimal state. We conducted extensive experiments on public dataset UAVid 2020, and the results show that our method outperforms state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 41 条
  • [1] Application of unmanned aerial vehicles in earth resources monitoring: focus on evaluating potentials for forest monitoring in Ethiopia
    Berie, Habitamu Taddese
    Burud, Ingunn
    [J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2018, 51 (01): : 326 - 335
  • [2] Unsupervised Scale-Consistent Depth Learning from Video
    Bian, Jia-Wang
    Zhan, Huangying
    Wang, Naiyan
    Li, Zhichao
    Zhang, Le
    Shen, Chunhua
    Cheng, Ming-Ming
    Reid, Ian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (09) : 2548 - 2564
  • [3] Casser V, 2019, AAAI CONF ARTIF INTE, P8001
  • [4] Chang Shu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P572, DOI 10.1007/978-3-030-58529-7_34
  • [5] Deep Ordinal Regression Network for Monocular Depth Estimation
    Fu, Huan
    Gong, Mingming
    Wang, Chaohui
    Batmanghelich, Kayhan
    Tao, Dacheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2002 - 2011
  • [6] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [7] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
  • [8] Unsupervised Monocular Depth Estimation with Left-Right Consistency
    Godard, Clement
    Mac Aodha, Oisin
    Brostow, Gabriel J.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6602 - 6611
  • [9] 3D Packing for Self-Supervised Monocular Depth Estimation
    Guizilini, Vitor
    Ambrus, Rares
    Pillai, Sudeep
    Raventos, Allan
    Gaidon, Adrien
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2482 - 2491
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778