Recurrent Neural Network for (Un-)supervised Learning of Monocular Video Visual Odometry and Depth

被引:120
作者
Wang, Rui [1 ]
Pizer, Stephen M. [1 ]
Frahm, Jan-Michael [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27515 USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based, single-view depth estimation methods have recently shown highly promising results. However, such methods ignore one of the most important features for determining depth in the human vision system, which is motion. We propose a learning-based, multi-view dense depth map and odometry estimation method that uses Recurrent Neural Networks (RNN) and trains utilizing multi-view image reprojection and forward-backward flow-consistency losses. Our model can be trained in a supervised or even unsupervised mode. It is designed for depth and visual odometry estimation from video where the input frames are temporally correlated. However, it also generalizes to single-view depth estimation. Our method produces superior results to the state-of-the-art approaches for single-view and multi-view learning-based depth estimation on the KITTI driving dataset.
引用
收藏
页码:5647 / 5656
页数:10
相关论文
共 42 条
  • [21] Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
  • [22] Smooth Neighbors on Teacher Graphs for Semi-supervised Learning
    Luo, Yucen
    Zhu, Jun
    Li, Mengxi
    Ren, Yong
    Zhang, Bo
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8896 - 8905
  • [23] Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints
    Mahjourian, Reza
    Wicke, Martin
    Angelova, Anelia
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5667 - 5675
  • [24] A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation
    Mayer, Nikolaus
    Ilg, Eddy
    Hausser, Philip
    Fischer, Philipp
    Cremers, Daniel
    Dosovitskiy, Alexey
    Brox, Thomas
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4040 - 4048
  • [25] ORB-SLAM: A Versatile and Accurate Monocular SLAM System
    Mur-Artal, Raul
    Montiel, J. M. M.
    Tardos, Juan D.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (05) : 1147 - 1163
  • [26] Newcombe RA, 2011, IEEE I CONF COMP VIS, P2320, DOI 10.1109/ICCV.2011.6126513
  • [27] GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
    Qi, Xiaojuan
    Liao, Renjie
    Liu, Zhengzhe
    Urtasun, Raquel
    Jia, Jiaya
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 283 - 291
  • [28] MOTION PARALLAX AS AN INDEPENDENT CUE FOR DEPTH-PERCEPTION
    ROGERS, B
    GRAHAM, M
    [J]. PERCEPTION, 1979, 8 (02) : 125 - 134
  • [29] A taxonomy and evaluation of dense two-frame stereo correspondence algorithms
    Scharstein, D
    Szeliski, R
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 47 (1-3) : 7 - 42
  • [30] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556