Deep Modular Network Architecture for Depth Estimation from Single Indoor Images

被引:0
作者
Ito, Seiya [1 ]
Kaneko, Naoshi [1 ]
Shinohara, Yuma [1 ]
Sumi, Kazuhiko [1 ]
机构
[1] Aoyama Gakuin Univ, Sagamihara, Kanagawa, Japan
来源
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I | 2019年 / 11129卷
关键词
Depth estimation; Convolutional Neural Network;
D O I
10.1007/978-3-030-11009-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel deep modular network architecture for indoor scene depth estimation from single RGB images. The proposed architecture consists of a main depth estimation network and two auxiliary semantic segmentation networks. Our insight is that semantic and geometrical structures in a scene are strongly correlated, thus we utilize global (i.e. room layout) and mid-level (i.e. objects in a room) semantic structures to enhance depth estimation. The first auxiliary network, or layout network, is responsible for room layout estimation to infer the positions of walls, floor, and ceiling of a room. The second auxiliary network, or object network, estimates per-pixel class labels of the objects in a scene, such as furniture, to give mid-level semantic cues. Estimated semantic structures are effectively fed into the depth estimation network using newly proposed discriminator networks, which discern the reliability of the estimated structures. The evaluation result shows that our architecture achieves significant performance improvements over previous approaches on the standard NYU Depth v2 indoor scene dataset.
引用
收藏
页码:324 / 336
页数:13
相关论文
共 50 条
[41]   Deblur and deep depth from single defocus image [J].
Saeed Anwar ;
Zeeshan Hayder ;
Fatih Porikli .
Machine Vision and Applications, 2021, 32
[42]   Deblur and deep depth from single defocus image [J].
Anwar, Saeed ;
Hayder, Zeeshan ;
Porikli, Fatih .
MACHINE VISION AND APPLICATIONS, 2021, 32 (01)
[43]   SINGLE IMAGE DEPTH ESTIMATION FROM IMAGE DESCRIPTORS [J].
Lin, Yu-Hsun ;
Cheng, Wen-Huang ;
Miao, Hsin ;
Ku, Tsung-Hao ;
Hsieh, Yung-Huan .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :809-812
[44]   DEPTH ESTIMATION FROM SINGLE IMAGE AND SEMANTIC PRIOR [J].
Hambarde, Praful ;
Dudhane, Akshay ;
Patil, Prashant W. ;
Murala, Subrahmanyam ;
Dhall, Abhinav .
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, :1441-1445
[45]   Depth Map Estimation from a Single Video Sequence [J].
Kuo, Tien-Ying ;
Hsieh, Cheng-Hong ;
Lo, Yi-Chung .
2013 IEEE 17TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE), 2013, :103-104
[46]   Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry [J].
Sha Hao ;
Liu Yue ;
Wang Yongtian ;
Lu Chenguang ;
Zhao Mengze .
ACTA OPTICA SINICA, 2022, 42 (19)
[47]   D-Net: A Generalised and Optimised Deep Network for Monocular Depth Estimation [J].
Thompson, Joshua Luke ;
Phung, Son Lam ;
Bouzerdoum, Abdesselam .
IEEE ACCESS, 2021, 9 :134543-134555
[48]   Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network [J].
Ul Islam, Naeem ;
Park, Jaebyung .
IEEE ACCESS, 2021, 9 (09) :32781-32794
[49]   Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network [J].
Lin, Xiao ;
Sanchez-Escobedo, Dalila ;
Casas, Josep R. ;
Pardas, Montse .
SENSORS, 2019, 19 (08)
[50]   PCTNet: depth estimation from single structured light image with a parallel CNN-transformer network [J].
Zhu, Xinjun ;
Han, Zhiqiang ;
Zhang, Zhizhi ;
Song, Limei ;
Wang, Hongyi ;
Guo, Qinghua .
MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (08)