Self-Supervised Monocular Scene Flow Estimation

被引：44

作者：

Hur, Junhwa ^{[1
]}

Roth, Stefan ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00742

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene flow estimation has been receiving increasing attention for 3D environment perception. Monocular scene flow estimation - obtaining 3D structure and 3D motion from two temporally consecutive images - is a highly ill-posed problem, and practical solutions are lacking to date. We propose a novel monocular scene flow method that yields competitive accuracy and real-time performance. By taking an inverse problem view, we design a single convolutional neural network (CNN) that successfully estimates depth and 3D motion simultaneously from a classical optical flow cost volume. We adopt self-supervised learning with 3D loss functions and occlusion reasoning to leverage unlabeled data. We validate our design choices, including the proxy loss and augmentation setup. Our model achieves state-of-the-art accuracy among unsupervised/self-supervised learning approaches to monocular scene flow, and yields competitive results for the optical flow and monocular depth estimation sub-tasks. Semi-supervised fine-tuning further improves the accuracy and yields promising results in real-time.

引用

页码：7394 / 7403

页数：10

共 64 条

[1]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/ICCV.2019.00287

[2]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00373

[3]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01210

[4]

[Anonymous], 2019, ECCV WORKSH, DOI DOI 10.1007/978-3-030-11021-543

[5]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00826

[6]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00337

[7] Multi-view Scene Flow Estimation: A View Centered Variational Approach [J].

Basha, Tali ;

Moses, Yael ;

Kiryati, Nahum .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (01) :6-21

[8] Bounding Boxes, Segmentations and Object Coordinates: How Important is Recognition for 3D Scene Flow Estimation in Autonomous Driving Scenarios? [J].

Behl, Aseem ;

Jafari, Omid Hosseini ;

Mustikovela, Siva Karthik ;

Abu Alhaija, Hassan ;

Rother, Carsten ;

Geiger, Andreas .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2593-2602

[9] A Naturalistic Open Source Movie for Optical Flow Evaluation [J].

Butler, Daniel J. ;

Wulff, Jonas ;

Stanley, Garrett B. ;

Black, Michael J. .

COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 :611-625

[10] Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera [J].

Chen, Yuhua ;

Schmid, Cordelia ;

Sminchisescu, Cristian .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7062-7071

← 1 2 3 4 5 6 7 →