Self-Supervised Monocular Scene Flow Estimation

被引:44
作者
Hur, Junhwa [1 ]
Roth, Stefan [1 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.00742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene flow estimation has been receiving increasing attention for 3D environment perception. Monocular scene flow estimation - obtaining 3D structure and 3D motion from two temporally consecutive images - is a highly ill-posed problem, and practical solutions are lacking to date. We propose a novel monocular scene flow method that yields competitive accuracy and real-time performance. By taking an inverse problem view, we design a single convolutional neural network (CNN) that successfully estimates depth and 3D motion simultaneously from a classical optical flow cost volume. We adopt self-supervised learning with 3D loss functions and occlusion reasoning to leverage unlabeled data. We validate our design choices, including the proxy loss and augmentation setup. Our model achieves state-of-the-art accuracy among unsupervised/self-supervised learning approaches to monocular scene flow, and yields competitive results for the optical flow and monocular depth estimation sub-tasks. Semi-supervised fine-tuning further improves the accuracy and yields promising results in real-time.
引用
收藏
页码:7394 / 7403
页数:10
相关论文
共 64 条
[1]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/ICCV.2019.00287
[2]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00373
[3]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01210
[4]  
[Anonymous], 2019, ECCV WORKSH, DOI DOI 10.1007/978-3-030-11021-543
[5]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00826
[6]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00337
[7]   Multi-view Scene Flow Estimation: A View Centered Variational Approach [J].
Basha, Tali ;
Moses, Yael ;
Kiryati, Nahum .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (01) :6-21
[8]   Bounding Boxes, Segmentations and Object Coordinates: How Important is Recognition for 3D Scene Flow Estimation in Autonomous Driving Scenarios? [J].
Behl, Aseem ;
Jafari, Omid Hosseini ;
Mustikovela, Siva Karthik ;
Abu Alhaija, Hassan ;
Rother, Carsten ;
Geiger, Andreas .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2593-2602
[9]   A Naturalistic Open Source Movie for Optical Flow Evaluation [J].
Butler, Daniel J. ;
Wulff, Jonas ;
Stanley, Garrett B. ;
Black, Michael J. .
COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 :611-625
[10]   Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera [J].
Chen, Yuhua ;
Schmid, Cordelia ;
Sminchisescu, Cristian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7062-7071