Deep Modular Network Architecture for Depth Estimation from Single Indoor Images

被引:0
作者
Ito, Seiya [1 ]
Kaneko, Naoshi [1 ]
Shinohara, Yuma [1 ]
Sumi, Kazuhiko [1 ]
机构
[1] Aoyama Gakuin Univ, Sagamihara, Kanagawa, Japan
来源
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I | 2019年 / 11129卷
关键词
Depth estimation; Convolutional Neural Network;
D O I
10.1007/978-3-030-11009-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel deep modular network architecture for indoor scene depth estimation from single RGB images. The proposed architecture consists of a main depth estimation network and two auxiliary semantic segmentation networks. Our insight is that semantic and geometrical structures in a scene are strongly correlated, thus we utilize global (i.e. room layout) and mid-level (i.e. objects in a room) semantic structures to enhance depth estimation. The first auxiliary network, or layout network, is responsible for room layout estimation to infer the positions of walls, floor, and ceiling of a room. The second auxiliary network, or object network, estimates per-pixel class labels of the objects in a scene, such as furniture, to give mid-level semantic cues. Estimated semantic structures are effectively fed into the depth estimation network using newly proposed discriminator networks, which discern the reliability of the estimated structures. The evaluation result shows that our architecture achieves significant performance improvements over previous approaches on the standard NYU Depth v2 indoor scene dataset.
引用
收藏
页码:324 / 336
页数:13
相关论文
共 50 条
[31]   Depth Estimation by Parameter Transfer With a Lightweight Model for Single Still Images [J].
Qin, Hongwei ;
Li, Xiu ;
Wang, Yangang ;
Zhang, Yongbing ;
Dai, Qionghai .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (04) :748-759
[32]   Depth Estimation from Monocular Images Using Dilated Convolution and Uncertainty Learning [J].
Ma, Haojie ;
Ding, Yinzhang ;
Wang, Lianghao ;
Zhang, Ming ;
Li, Dongxiao .
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 :13-23
[33]   EFFICIENT DEPTH ESTIMATION FROM SINGLE IMAGE [J].
Zhou, Wei ;
Dai, Yuchao ;
He, Renjie .
2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, :296-300
[34]   A geometry-aware deep network for depth estimation in monocular endoscopy [J].
Yang, Yongming ;
Shao, Shuwei ;
Yang, Tao ;
Wang, Peng ;
Yang, Zhuo ;
Wu, Chengdong ;
Liu, Hao .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
[35]   Deep Joint Depth Estimation and Color Correction From Monocular Underwater Images Based on Unsupervised Adaptation Networks [J].
Ye, Xinchen ;
Li, Zheng ;
Sun, Baoli ;
Wang, Zhihui ;
Xu, Rui ;
Li, Haojie ;
Fan, Xin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) :3995-4008
[36]   Depth estimation in turbid media from stack of epi-illuminated microscopy images, using deep learning [J].
Ghosh, Anindya ;
Hohmann, Martin ;
Klampfl, Florian ;
Schmidt, Michael .
TISSUE OPTICS AND PHOTONICS III, 2024, 13010
[37]   Depth Estimation of Monocular Road Images Based on Pyramid Scene Analysis Network [J].
Zhou Wujie ;
Pan Ting ;
Gu Pengli ;
Zhai Zhinian .
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (10) :2509-2515
[38]   Self-supervised Generative Adversarial Network for Depth Estimation in Laparoscopic Images [J].
Huang, Baoru ;
Zheng, Jian-Qing ;
Nguyen, Anh ;
Tuch, David ;
Vyas, Kunal ;
Giannarou, Stamatia ;
Elson, Daniel S. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT IV, 2021, 12904 :227-237
[39]   Depth Estimation of Single Defocused Images Based on Multi-Feature Fusion [J].
Cao, Fengyun .
TRAITEMENT DU SIGNAL, 2021, 38 (05) :1353-1360
[40]   Depth Reconstruction from Single Images Using a Convolutional Neural Network and a Condition Random Field Model [J].
Liu, Dan ;
Liu, Xuejun ;
Wu, Yiguang .
SENSORS, 2018, 18 (05)