SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow

被引：20

作者：

Lang, Itai ^{[1
,2
]}

Aiger, Dror ^{[2
]}

Cole, Forrester ^{[2
]}

Avidan, Shai ^{[1
]}

Rubinstein, Michael ^{[2
]}

机构：

[1] Tel Aviv Univ, Tel Aviv, Israel

[2] Google Res, Mountain View, CA USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00511

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene flow estimation is a long-standing problem in computer vision, where the goal is to find the 3D motion of a scene from its consecutive observations. Recently, there have been efforts to compute the scene flow from 3D point clouds. A common approach is to train a regression model that consumes source and target point clouds and outputs the per-point translation vector. An alternative is to learn point matches between the point clouds concurrently with regressing a refinement of the initial correspondence flow. In both cases, the learning task is very challenging since the flow regression is done in the free 3D space, and a typical solution is to resort to a large annotated synthetic dataset. We introduce SCOOP, a new method for scene flow estimation that can be learned on a small amount of data without employing ground-truth flow supervision. In contrast to previous work, we train a pure correspondence model focused on learning point feature representation and initialize the flow as the difference between a source point and its softly corresponding target point. Then, in the run-time phase, we directly optimize a flow refinement component with a self-supervised objective, which leads to a coherent and accurate flow field between the point clouds. Experiments on widespread datasets demonstrate the performance gains achieved by our method compared to existing leading techniques while using a fraction of the training data. Our code is publicly available(1).

引用

页码：5281 / 5290

页数：10

共 37 条

[1]

[Anonymous], 2022, P EUR C COMP VIS ECC, DOI DOI 10.1109/WCSP55476.2022.10039178

[2]

[Anonymous], 2008, P 7 IEEE INT C COMP

[3] Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation [J].

Brox, Thomas ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :500-513

[4] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[5]

Chang A X, 2015, COMPUTER SCI, V1512, P3

[6] Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation [J].

Cheng, Wencan ;

Ko, Jong Hwan .

COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 :108-124

[7] SCALING ALGORITHMS FOR UNBALANCED OPTIMAL TRANSPORT PROBLEMS [J].

Chizat, Lenaic ;

Peyre, Gabriel ;

Schmitzer, Bernhard ;

Vialard, Francois-Xavier .

MATHEMATICS OF COMPUTATION, 2018, 87 (314) :2563-2609

[8]

Cuturi M., 2013, NEURIPS, P1, DOI 10.48550/arXiv.1306.0895

[9]

Deprelle Theo, 2019, Advances in Neural Information Processing Systems (NeurIPS), P3

[10] Weakly Supervised Learning of Rigid 3D Scene Flow [J].

Gojcic, Zan ;

Litany, Or ;

Wieser, Andreas ;

Guibas, Leonidas J. ;

Birdal, Tolga .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5688-5699

← 1 2 3 4 →