Single image depth estimation using improved U-Net and edge-guide loss

被引:0
|
作者
He M. [1 ,2 ,3 ]
Gao Y. [1 ,2 ,3 ]
Long Y. [1 ,2 ,3 ]
机构
[1] College of Mechanical and Electronic Engineering, Northwest A & F University, Shaanxi, Yangling
[2] Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Shaanxi, Yangling
[3] Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Shaanxi, Yangling
关键词
Deep learning; Depth estimation; Edge-guide loss; Hybrid dilated convolution;
D O I
10.1007/s11042-024-19235-3
中图分类号
学科分类号
摘要
Monocular depth estimation is regarded as a critical link in context-aware scene comprehension, which typically uses image data from a single point of view as the input to directly predict the depth value corresponding to each pixel in the image. However, predicting accurate object borders without replicating texture is difficult, resulting in missing tiny objects and blurry object edge in predicted depth images. In this paper, we propose a method for estimating monocular depth using an improved U-Net-based encoder-decoder network structure. We propose a new training loss term called edge-guide loss, which pushes the network to focus on object edges, resulting in better accuracy of the depth of tiny objects and edges. In the network, we build the encoder using DenseNet-169 and the decoder using 2 × bilinear up-sampling, skip-connections and hybrid dilated convolution. And skip-connections are used to send multi-scale feature maps from encoder to decoder. We specifically create a new loss function, edge-guide loss and three basic loss terms. We test our algorithm on the NYU Depth V2 dataset. The results of the experiments show that the proposed network can create depth image from a single RGB image with unambiguous borders and more tiny object depth. In the meantime, compared with state-of-the-art approaches, our proposed network outperforms for both visual quality and objective measurement. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:84619 / 84637
页数:18
相关论文
共 50 条
  • [21] Improved U-Net Models and Its Applications in Medical Image Segmentation: A Review
    Zhang Huan
    Qiu Dawei
    Feng Yibo
    Liu Jing
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (02)
  • [22] Extraction of fractures in shale CT images using improved U-Net
    Wu, Xiang
    Wang, Fei
    Zhang, Xiaoqiu
    Han, Bohua
    Liu, Qianru
    Zhang, Yonghao
    ENERGY GEOSCIENCE, 2024, 5 (02):
  • [23] Research on thyroid nodule segmentation using an improved U-Net network
    Xu, Peng
    REVISTA INTERNACIONAL DE METODOS NUMERICOS PARA CALCULO Y DISENO EN INGENIERIA, 2024, 40 (02):
  • [24] CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING IMPROVED U-NET
    Wang, Yong
    Zhang, Dongfang
    Dai, Guangming
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (03) : 399 - 413
  • [25] IMPROVED DEEP LEARNING ARCHITECTURE FOR DEPTH ESTIMATION FROM SINGLE IMAGE
    Abuowaida, Suhaila F. A.
    Chan, Huah Yong
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2020, 6 (04): : 434 - 445
  • [26] PET scatter estimation using deep learning U-Net architecture
    Laurent, Baptiste
    Bousse, Alexandre
    Merlin, Thibaut
    Nekolla, Stephan
    Visvikis, Dimitris
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (06)
  • [27] Medical Image Segmentation Using U-Net and Progressive Neuron Expansion
    Paheding, Sidike
    Reyes, Abel A.
    Alam, Mohammad
    Asari, Vijayan K.
    PATTERN RECOGNITION AND TRACKING XXXIII, 2022, 12101
  • [28] DepthCut: improved depth edge estimation using multiple unreliable channels
    Paul Guerrero
    Holger Winnemöller
    Wilmot Li
    Niloy J. Mitra
    The Visual Computer, 2018, 34 : 1165 - 1176
  • [29] DepthCut: improved depth edge estimation using multiple unreliable channels
    Guerrero, Paul
    Winnemoller, Holger
    Li, Wilmot
    Mitra, Niloy J.
    VISUAL COMPUTER, 2018, 34 (09) : 1165 - 1176
  • [30] Liver Tumor Computed Tomography Image Segmentation Based on an Improved U-Net Model
    Li, Hefu
    Liang, Binmei
    APPLIED SCIENCES-BASEL, 2023, 13 (20):