Single image depth estimation using improved U-Net and edge-guide loss

被引:0
|
作者
He M. [1 ,2 ,3 ]
Gao Y. [1 ,2 ,3 ]
Long Y. [1 ,2 ,3 ]
机构
[1] College of Mechanical and Electronic Engineering, Northwest A & F University, Shaanxi, Yangling
[2] Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Shaanxi, Yangling
[3] Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Shaanxi, Yangling
关键词
Deep learning; Depth estimation; Edge-guide loss; Hybrid dilated convolution;
D O I
10.1007/s11042-024-19235-3
中图分类号
学科分类号
摘要
Monocular depth estimation is regarded as a critical link in context-aware scene comprehension, which typically uses image data from a single point of view as the input to directly predict the depth value corresponding to each pixel in the image. However, predicting accurate object borders without replicating texture is difficult, resulting in missing tiny objects and blurry object edge in predicted depth images. In this paper, we propose a method for estimating monocular depth using an improved U-Net-based encoder-decoder network structure. We propose a new training loss term called edge-guide loss, which pushes the network to focus on object edges, resulting in better accuracy of the depth of tiny objects and edges. In the network, we build the encoder using DenseNet-169 and the decoder using 2 × bilinear up-sampling, skip-connections and hybrid dilated convolution. And skip-connections are used to send multi-scale feature maps from encoder to decoder. We specifically create a new loss function, edge-guide loss and three basic loss terms. We test our algorithm on the NYU Depth V2 dataset. The results of the experiments show that the proposed network can create depth image from a single RGB image with unambiguous borders and more tiny object depth. In the meantime, compared with state-of-the-art approaches, our proposed network outperforms for both visual quality and objective measurement. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:84619 / 84637
页数:18
相关论文
共 50 条
  • [41] Monocular Image Depth Estimation Using a Conditional Generative Adversarial Net
    Zhang, Xiaofeng
    Chen, Shuo
    Xu, Qingyang
    Zhang, Xiaoxue
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9176 - 9180
  • [42] MARU-Net: Multiscale Attention Gated Residual U-Net With Contrastive Loss for SAR-Optical Image Matching
    Gazzea, Michele
    Sommervold, Oscar
    Arghandeh, Reza
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 4891 - 4899
  • [43] Scale Input Adapted Attention for Image Denoising Using a Densely Connected U-Net: SADE-Net
    Acar, Vedat
    Eksioglu, Ender M.
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 12876 : 792 - 801
  • [44] A Novel Attention-Guided Enhanced U-Net With Hybrid Edge-Preserving Structural Loss for Low-Dose CT Image Denoising
    Zubair, Muhammad
    Md Rais, Helmi
    Alazemi, Talal
    IEEE ACCESS, 2025, 13 : 6909 - 6923
  • [45] Flood forecasting based on radar precipitation nowcasting using U-net and its improved models
    Li, Jianzhu
    Li, Leijing
    Zhang, Ting
    Xing, Haoyu
    Shi, Yi
    Li, Zhixia
    Wang, Congmei
    Liu, Jin
    JOURNAL OF HYDROLOGY, 2024, 632
  • [46] Performance Characterization of Single and Multi GPU Training of U-Net Architecture for Medical Image Segmentation Tasks
    Patel, Trupesh R.
    Bodduluri, Sandeep
    Anthony, Thomas
    Monroe, William S.
    Kandhare, Pravinkumar G.
    Robinson, John-Paul
    Nakhmani, Arie
    Zhang, Chengcui
    Bhatt, Surya P.
    Bangalore, Purushotham, V
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [47] Multi Res U-Net Based Image Segmentation of Pulmonary Tuberculosis Using CT Images
    Ramkumar, M. O.
    Jayakumar, D.
    Yogesh, R.
    2020 7TH IEEE INTERNATIONAL CONFERENCE ON SMART STRUCTURES AND SYSTEMS (ICSSS 2020), 2020, : 332 - 335
  • [48] Remote Sensing Image Segmentation for Aircraft Recognition Using U-Net as Deep Learning Architecture
    Shaar, Fadi
    Yilmaz, Arif
    Topcu, Ahmet Ercan
    Alzoubi, Yehia Ibrahim
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [49] Brain Tumor Segmentation of MRI Images Using Processed Image Driven U-Net Architecture
    Arora, Anuja
    Jayal, Ambikesh
    Gupta, Mayank
    Mittal, Prakhar
    Satapathy, Suresh Chandra
    COMPUTERS, 2021, 10 (11)
  • [50] Enhanced medical image segmentation using U-Net with residual connections and dual attention mechanism
    Xiao, Leyi
    Song, Jiaojiao
    Xie, Xia
    Fan, Chaodong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 153