Recurrent Neural Network for (Un-)supervised Learning of Monocular Video Visual Odometry and Depth

被引：120

作者：

Wang, Rui ^{[1
]}

Pizer, Stephen M. ^{[1
]}

Frahm, Jan-Michael ^{[1
]}

机构：

[1] Univ N Carolina, Chapel Hill, NC 27515 USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00570

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning-based, single-view depth estimation methods have recently shown highly promising results. However, such methods ignore one of the most important features for determining depth in the human vision system, which is motion. We propose a learning-based, multi-view dense depth map and odometry estimation method that uses Recurrent Neural Networks (RNN) and trains utilizing multi-view image reprojection and forward-backward flow-consistency losses. Our model can be trained in a supervised or even unsupervised mode. It is designed for depth and visual odometry estimation from video where the input frames are temporally correlated. However, it also generalizes to single-view depth estimation. Our method produces superior results to the state-of-the-art approaches for single-view and multi-view learning-based depth estimation on the KITTI driving dataset.

引用

页码：5647 / 5656

页数：10

共 42 条

[21] Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[22] Smooth Neighbors on Teacher Graphs for Semi-supervised Learning
Luo, Yucen
Zhu, Jun
Li, Mengxi
Ren, Yong
Zhang, Bo
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8896 - 8905
[23] Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints
Mahjourian, Reza
Wicke, Martin
Angelova, Anelia
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5667 - 5675
[24] A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation
Mayer, Nikolaus
Ilg, Eddy
Hausser, Philip
Fischer, Philipp
Cremers, Daniel
Dosovitskiy, Alexey
Brox, Thomas
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4040 - 4048
[25] ORB-SLAM: A Versatile and Accurate Monocular SLAM System
Mur-Artal, Raul
Montiel, J. M. M.
Tardos, Juan D.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (05) : 1147 - 1163
[26] Newcombe RA, 2011, IEEE I CONF COMP VIS, P2320, DOI 10.1109/ICCV.2011.6126513
[27] GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
Qi, Xiaojuan
Liao, Renjie
Liu, Zhengzhe
Urtasun, Raquel
Jia, Jiaya
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 283 - 291
[28] MOTION PARALLAX AS AN INDEPENDENT CUE FOR DEPTH-PERCEPTION
ROGERS, B
GRAHAM, M
[J]. PERCEPTION, 1979, 8 (02) : 125 - 134
[29] A taxonomy and evaluation of dense two-frame stereo correspondence algorithms
Scharstein, D
Szeliski, R
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 47 (1-3) : 7 - 42
[30] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556

← 1 2 3 4 5 →