IMPROVED DEEP LEARNING ARCHITECTURE FOR DEPTH ESTIMATION FROM SINGLE IMAGE

被引:7
作者
Abuowaida, Suhaila F. A. [1 ]
Chan, Huah Yong [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town 11800, Malaysia
来源
JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY | 2020年 / 6卷 / 04期
关键词
Depth estimation; Single image; Deep learning; Encoder-decoder;
D O I
10.5455/jjcit.71-1593368945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous benefits of depth estimation from the single image field on medicine, robot video games and 3D reality applications have garnered attention in recent years. Closely related to the third dimension of depth, this operation can be accomplished using human vision, though considered challenging due to the various issues when using computer vision. The differences in the geometry, the texture of the scene, the occlusion scene boundaries and the inherent ambiguity exist because of the minimal information that could be gathered from a single image. This paper, therefore, proposes a novel depth estimation in the field of architecture, which includes the stages that can manage depth estimation from a single RGB image. An encoder-decoder architecture has been proposed, based on the improvement yielded from DenseNet that extracted the map of an image using skip connection technique. This paper also takes on the reverse Huber loss function that essentially suits our architecture hand driven by the value distributions that are commonly present in depth maps. Experimental results have indicated that the depth estimation architecture that employs the NYU Depth v2 dataset has a better performance than the other state-of-the-art methods that tend to have fewer parameters and require fewer training time.
引用
收藏
页码:434 / 445
页数:12
相关论文
共 33 条
[1]  
Abrams A, 2012, LECT NOTES COMPUT SC, V7573, P357, DOI 10.1007/978-3-642-33709-3_26
[2]  
Alhashim I, 2019, Arxiv, DOI arXiv:1812.11941
[3]   Real-Time Monocular Depth Estimation using Synthetic Data with Domain Adaptation via Image Style Transfer [J].
Atapour-Abarghouei, Amir ;
Breckon, Toby P. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2800-2810
[4]  
Bebie T, 2000, COMPUT GRAPH FORUM, V19, pC391, DOI 10.1111/1467-8659.00431
[5]  
Carvalho M, 2018, IEEE IMAGE PROC, P2915, DOI 10.1109/ICIP.2018.8451312
[6]   Instance-aware Semantic Segmentation via Multi-task Network Cascades [J].
Dai, Jifeng ;
He, Kaiming ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3150-3158
[7]  
Eigen D, 2014, ADV NEUR IN, V27
[8]   Deep Ordinal Regression Network for Monocular Depth Estimation [J].
Fu, Huan ;
Gong, Mingming ;
Wang, Chaohui ;
Batmanghelich, Kayhan ;
Tao, Dacheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2002-2011
[9]   Depth estimation from single monocular images using deep hybrid network [J].
Grigorev, Aleksei ;
Jiang, Feng ;
Rho, Seungmin ;
Sori, Worku J. ;
Liu, Shaohui ;
Sai, Sergey .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (18) :18585-18604
[10]   Detail Preserving Depth Estimation from a Single Image Using Attention Guided Networks [J].
Hao, Zhixiang ;
Li, Yu ;
You, Shaodi ;
Lu, Feng .
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, :304-313