Temporally Consistent Depth Map Prediction Using Deep Convolutional Neural Network and Spatial-Temporal Conditional Random Field

被引:2
作者
Zhao, Xu-Ran [1 ]
Wang, Xun [1 ]
Chen, Qi-Chao [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Comp & Informat Engn, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
depth estimation; temporal consistency; convolutional neural network; conditional random fields;
D O I
10.1007/s11390-017-1735-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep convolutional neural networks (DCNNs) based methods recently keep setting new records on the tasks of predicting depth maps from monocular images. When dealing with video-based applications such as 2D (2-dimensional) to 3D (3-dimensional) video conversion, however, these approaches tend to produce temporally inconsistent depth maps, since their CNN models are optimized over single frames. In this paper, we address this problem by introducing a novel spatial-temporal conditional random fields (CRF) model into the DCNN architecture, which is able to enforce temporal consistency between depth map estimations over consecutive video frames. In our approach, temporally consistent superpixel (TSP) is first applied to an image sequence to establish the correspondence of targets in consecutive frames. A DCNN is then used to regress the depth value of each temporal superpixel, followed by a spatial-temporal CRF layer to model the relationship of the estimated depths in both spatial and temporal domains. The parameters in both DCNN and CRF models are jointly optimized with back propagation. Experimental results show that our approach not only is able to significantly enhance the temporal consistency of estimated depth maps over existing single-frame-based approaches, but also improves the depth estimation accuracy in terms of various evaluation metrics.
引用
收藏
页码:443 / 456
页数:14
相关论文
共 50 条
  • [21] Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos
    Meng, Bo
    Liu, XueJun
    Wang, Xiaolin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (20) : 26901 - 26918
  • [22] Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos
    Bo Meng
    XueJun Liu
    Xiaolin Wang
    Multimedia Tools and Applications, 2018, 77 : 26901 - 26918
  • [23] Spatial-temporal attention-based convolutional network with text and numerical information for stock price prediction
    Chin-Teng Lin
    Yu-Ka Wang
    Pei-Lun Huang
    Ye Shi
    Yu-Cheng Chang
    Neural Computing and Applications, 2022, 34 : 14387 - 14395
  • [24] Spatial-temporal attention-based convolutional network with text and numerical information for stock price prediction
    Lin, Chin-Teng
    Wang, Yu-Ka
    Huang, Pei-Lun
    Shi, Ye
    Chang, Yu-Cheng
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) : 14387 - 14395
  • [25] Forecasting Energy Demand Using Conditional Random Field and Convolution Neural Network
    Thangavel, Aravind
    Govindaraj, Vijayakumar
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2022, 28 (05) : 12 - 22
  • [26] Fast Prediction of Flow Field around Airfoils Based on Deep Convolutional Neural Network
    Wu, Ming-Yu
    Wu, Yan
    Yuan, Xin-Yi
    Chen, Zhi-Hua
    Wu, Wei-Tao
    Aubry, Nadine
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [27] A deep learning method for prediction of cardiovascular disease using convolutional neural network
    Sajja T.K.
    Kalluri H.K.
    Revue d'Intelligence Artificielle, 2020, 34 (05) : 601 - 606
  • [28] Prediction of IC Equivalent Magnetic Dipoles Using Deep Convolutional Neural Network
    Ma, Hanzhi
    Li, Er-Ping
    2018 IEEE ELECTRICAL DESIGN OF ADVANCED PACKAGING AND SYSTEMS SYMPOSIUM (EDAPS 2018), 2018,
  • [29] Ultra-short-term wind speed prediction based on deep spatial-temporal residual network
    Liang, Xinhao
    Hu, Feihu
    Li, Xin
    Zhang, Lin
    Feng, Xuan
    Abu Gunmi, Mohammad
    JOURNAL OF RENEWABLE AND SUSTAINABLE ENERGY, 2023, 15 (04)
  • [30] Pelvic bone tumor segmentation fusion algorithm based on fully convolutional neural network and conditional random field
    Wu, Shiqiang
    Ke, Zhanlong
    Cai, Liquan
    Wang, Liangming
    Zhang, Xiaolu
    Ke, Qingfeng
    Ye, Yuguang
    JOURNAL OF BONE ONCOLOGY, 2024, 45