Monocular depth estimation based on dense connections

被引:0
作者
Wang, Quande [1 ]
Cheng, Kai [1 ]
机构
[1] School of Electronic Information, Wuhan University, Wuhan
来源
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition) | 2023年 / 51卷 / 11期
关键词
computer vision; deep learning; encoder-decoder structure; monocular depth estimation; multi-scale loss function;
D O I
10.13245/j.hust.229472
中图分类号
学科分类号
摘要
To solve the problems of blurred edges of depth map objects and poor image quality caused by the repetition of simple up-sampling operations in the current monocular depth estimation methods, a densely connected monocular depth estimation method was proposed.The method used an end-to-end encoder and decoder architecture for depth estimation from a single RGB image. The encoder introduced the high-performance convolutional neural network EfficientNet-B5, which could efficiently extract the global context features of images. The decoder was designed as a densely connected up-sampled feature pyramid structure to transfer global contextual features from low resolution to high resolution for higher quality depth maps. In addition, the depth estimation accuracy of object edges was further improved by designing a full-resolution multi-scale loss function.The training and testing results on the NYU Depth V2 indoor scene depth dataset and the KITTI outdoor scene depth dataset show that the proposed method can produce high-precision depth estimation results, and the predicted depth maps have clear edges and well-defined outlines.The designed ablation experiments could fully validate the reasonableness of the proposed method modules. © 2023 Huazhong University of Science and Technology. All rights reserved.
引用
收藏
页码:75 / 82
页数:7
相关论文
共 26 条
[1]  
RASOULI A, TSOTSOS J K., Autonomous vehicles that interact with pedestrians: a survey of theory and practice[J], IEEE Transactions on Intelligent Transportation Systems, 21, 3, pp. 900-918, (2020)
[2]  
HUSSAIN R, ZEADALLY S., Autonomous cars: research results, issues, and future challenges, IEEE Communications Surveys & Tutorials, 21, 2, pp. 1275-1313, (2018)
[3]  
HAN X F, LAGA H, BENNAMOUN M., Image based 3D object reconstruction: state-of-the-art and trends in the deep learning era[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 5, pp. 1578-1604, (2019)
[4]  
KHAMIS S, FANELLO S, RHEMANN C, Et al., Stereonet:guided hierarchical refinement for real-time edge aware depth prediction, Proc of the European Conference on Computer Vision (ECCV), pp. 573-590, (2018)
[5]  
RANFTL R, VINEET V, CHEN Q, Et al., Dense monocular depth estimation in complex dynamic scenes, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4058-4066, (2016)
[6]  
EIGEN D, PUHRSCH C, FERGUS R., Depth map prediction from a single image using a multi-scale deep network, Proc of Advances in Neural Information Processing Systems, pp. 2366-2374, (2014)
[7]  
FU H, GONG M, WANG C, Et al., Deep ordinal regression network for monocular depth estimation, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2002-2011, (2018)
[8]  
48, 5, pp. 7-12, (2020)
[9]  
LEE J H, HAN M K, KO D W, Et al., From big to small: multi-scale local planar guidance for monocular depth estimation
[10]  
SONG M, LIM S, KIM W., Monocular depth estimation using laplacian pyramid-based depth residuals[J], IEEE Transactions on Circuits and Systems for Video Technology, 31, 11, pp. 4381-4393, (2021)