Single image depth estimation using improved U-Net and edge-guide loss

被引:0
|
作者
He M. [1 ,2 ,3 ]
Gao Y. [1 ,2 ,3 ]
Long Y. [1 ,2 ,3 ]
机构
[1] College of Mechanical and Electronic Engineering, Northwest A & F University, Shaanxi, Yangling
[2] Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Shaanxi, Yangling
[3] Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Shaanxi, Yangling
关键词
Deep learning; Depth estimation; Edge-guide loss; Hybrid dilated convolution;
D O I
10.1007/s11042-024-19235-3
中图分类号
学科分类号
摘要
Monocular depth estimation is regarded as a critical link in context-aware scene comprehension, which typically uses image data from a single point of view as the input to directly predict the depth value corresponding to each pixel in the image. However, predicting accurate object borders without replicating texture is difficult, resulting in missing tiny objects and blurry object edge in predicted depth images. In this paper, we propose a method for estimating monocular depth using an improved U-Net-based encoder-decoder network structure. We propose a new training loss term called edge-guide loss, which pushes the network to focus on object edges, resulting in better accuracy of the depth of tiny objects and edges. In the network, we build the encoder using DenseNet-169 and the decoder using 2 × bilinear up-sampling, skip-connections and hybrid dilated convolution. And skip-connections are used to send multi-scale feature maps from encoder to decoder. We specifically create a new loss function, edge-guide loss and three basic loss terms. We test our algorithm on the NYU Depth V2 dataset. The results of the experiments show that the proposed network can create depth image from a single RGB image with unambiguous borders and more tiny object depth. In the meantime, compared with state-of-the-art approaches, our proposed network outperforms for both visual quality and objective measurement. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:84619 / 84637
页数:18
相关论文
共 50 条
  • [31] Depth estimation from single-shot monocular endoscope image using image domain adaptation and edge-aware depth estimation
    Oda, Masahiro
    Itoh, Hayato
    Tanaka, Kiyohito
    Takabatake, Hirotsugu
    Mori, Masaki
    Natori, Hiroshi
    Mori, Kensaku
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2022, 10 (03) : 266 - 273
  • [32] Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration
    Tian, Yuntong
    Hu, Yan
    Ma, Yuhui
    Hao, Huaying
    Mou, Lei
    Yang, Jianlong
    Zhao, Yitian
    Liu, Jiang
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 1360 - 1363
  • [33] Electron Microscope Image Registration Using Laplacian Sharpening Transformer U-Net
    Xie, Kunzi
    Yang, Yixing
    Pagnucco, Maurice
    Song, Yang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI, 2022, 13436 : 310 - 319
  • [34] Monocular Depth Estimation of 2D Images Based on Optimized U-net with Transfer Learning
    Yeh, Ming-Tsung
    Chen, Tsung-Chi
    Pai, Neng-Sheng
    Cheng, Chi-Huan
    SENSORS AND MATERIALS, 2024, 36 (06) : 2569 - 2583
  • [35] IECAU-Net: A Wood Defects Image Segmentation Network Based on Improved Attention U-Net and Attention Mechanism
    Dong, Yingda
    He, Chunguang
    Xiang, Xiaoyang
    Cui, Yuhan
    Kang, Yongkang
    Ding, Aiming
    Duo, Huaqiong
    Wang, Ximing
    BIORESOURCES, 2025, 20 (02): : 3545 - 3556
  • [36] Segmentation of thyroid glands and nodules in ultrasound images using the improved U-Net architecture
    Tianlei Zheng
    Hang Qin
    Yingying Cui
    Rong Wang
    Weiguo Zhao
    Shijin Zhang
    Shi Geng
    Lei Zhao
    BMC Medical Imaging, 23
  • [37] Segmentation of thyroid glands and nodules in ultrasound images using the improved U-Net architecture
    Zheng, Tianlei
    Qin, Hang
    Cui, Yingying
    Wang, Rong
    Zhao, Weiguo
    Zhang, Shijin
    Geng, Shi
    Zhao, Lei
    BMC MEDICAL IMAGING, 2023, 23 (01)
  • [38] Consistent convolution kernel design for missing shots interpolation using an improved U-net
    Han, Dong
    Wang, Benfeng
    Li, Jiakuo
    GEOPHYSICAL PROSPECTING, 2022, 70 (07) : 1193 - 1211
  • [39] Skin lesion segmentation using convolutional neural networks with improved U-Net architecture
    Iranpoor, Rasool
    Mahboob, Amir Soltany
    Shahbandegan, Shakiba
    Baniasadi, Nasrin
    2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
  • [40] Pelvis Segmentation Using Multi-pass U-Net and Iterative Shape Estimation
    Wang, Chunliang
    Connolly, Bryan
    Lopes, Pedro Filipe de Oliveira
    Frangi, Alejandro F.
    Smedby, Orjan
    COMPUTATIONAL METHODS AND CLINICAL APPLICATIONS IN MUSCULOSKELETAL IMAGING, MSKI 2018, 2019, 11404 : 49 - 57