Recurrent Neural Network for (Un-)supervised Learning of Monocular Video Visual Odometry and Depth

被引:120
作者
Wang, Rui [1 ]
Pizer, Stephen M. [1 ]
Frahm, Jan-Michael [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27515 USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based, single-view depth estimation methods have recently shown highly promising results. However, such methods ignore one of the most important features for determining depth in the human vision system, which is motion. We propose a learning-based, multi-view dense depth map and odometry estimation method that uses Recurrent Neural Networks (RNN) and trains utilizing multi-view image reprojection and forward-backward flow-consistency losses. Our model can be trained in a supervised or even unsupervised mode. It is designed for depth and visual odometry estimation from video where the input frames are temporally correlated. However, it also generalizes to single-view depth estimation. Our method produces superior results to the state-of-the-art approaches for single-view and multi-view learning-based depth estimation on the KITTI driving dataset.
引用
收藏
页码:5647 / 5656
页数:10
相关论文
共 42 条
  • [1] [Anonymous], C COMP VIS PATT REC
  • [2] ATAPOURABARGHOU.A, 2018, PROC CVPR IEEE, P2800, DOI DOI 10.1109/CVPR.2018.00296
  • [3] 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
    Choy, Christopher B.
    Xu, Danfei
    Gwak, Jun Young
    Chen, Kevin
    Savarese, Silvio
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 628 - 644
  • [4] MonoSLAM: Real-time single camera SLAM
    Davison, Andrew J.
    Reid, Ian D.
    Molton, Nicholas D.
    Stasse, Olivier
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (06) : 1052 - 1067
  • [5] Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture
    Eigen, David
    Fergus, Rob
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2650 - 2658
  • [6] Direct Sparse Odometry
    Engel, Jakob
    Koltun, Vladlen
    Cremers, Daniel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (03) : 611 - 625
  • [7] Deep Ordinal Regression Network for Monocular Depth Estimation
    Fu, Huan
    Gong, Mingming
    Wang, Chaohui
    Batmanghelich, Kayhan
    Tao, Dacheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2002 - 2011
  • [8] Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue
    Garg, Ravi
    VijayKumar, B. G.
    Carneiro, Gustavo
    Reid, Ian
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 740 - 756
  • [9] Vision meets robotics: The KITTI dataset
    Geiger, A.
    Lenz, P.
    Stiller, C.
    Urtasun, R.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) : 1231 - 1237
  • [10] Geiger A., 2012, C COMP VIS PATT REC