SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow

被引:20
作者
Lang, Itai [1 ,2 ]
Aiger, Dror [2 ]
Cole, Forrester [2 ]
Avidan, Shai [1 ]
Rubinstein, Michael [2 ]
机构
[1] Tel Aviv Univ, Tel Aviv, Israel
[2] Google Res, Mountain View, CA USA
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
关键词
D O I
10.1109/CVPR52729.2023.00511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene flow estimation is a long-standing problem in computer vision, where the goal is to find the 3D motion of a scene from its consecutive observations. Recently, there have been efforts to compute the scene flow from 3D point clouds. A common approach is to train a regression model that consumes source and target point clouds and outputs the per-point translation vector. An alternative is to learn point matches between the point clouds concurrently with regressing a refinement of the initial correspondence flow. In both cases, the learning task is very challenging since the flow regression is done in the free 3D space, and a typical solution is to resort to a large annotated synthetic dataset. We introduce SCOOP, a new method for scene flow estimation that can be learned on a small amount of data without employing ground-truth flow supervision. In contrast to previous work, we train a pure correspondence model focused on learning point feature representation and initialize the flow as the difference between a source point and its softly corresponding target point. Then, in the run-time phase, we directly optimize a flow refinement component with a self-supervised objective, which leads to a coherent and accurate flow field between the point clouds. Experiments on widespread datasets demonstrate the performance gains achieved by our method compared to existing leading techniques while using a fraction of the training data. Our code is publicly available(1).
引用
收藏
页码:5281 / 5290
页数:10
相关论文
共 37 条
[1]  
[Anonymous], 2022, P EUR C COMP VIS ECC, DOI DOI 10.1109/WCSP55476.2022.10039178
[2]  
[Anonymous], 2008, P 7 IEEE INT C COMP
[3]   Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation [J].
Brox, Thomas ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :500-513
[4]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[5]  
Chang A X, 2015, COMPUTER SCI, V1512, P3
[6]   Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation [J].
Cheng, Wencan ;
Ko, Jong Hwan .
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 :108-124
[7]   SCALING ALGORITHMS FOR UNBALANCED OPTIMAL TRANSPORT PROBLEMS [J].
Chizat, Lenaic ;
Peyre, Gabriel ;
Schmitzer, Bernhard ;
Vialard, Francois-Xavier .
MATHEMATICS OF COMPUTATION, 2018, 87 (314) :2563-2609
[8]  
Cuturi M., 2013, NEURIPS, P1, DOI 10.48550/arXiv.1306.0895
[9]  
Deprelle Theo, 2019, Advances in Neural Information Processing Systems (NeurIPS), P3
[10]   Weakly Supervised Learning of Rigid 3D Scene Flow [J].
Gojcic, Zan ;
Litany, Or ;
Wieser, Andreas ;
Guibas, Leonidas J. ;
Birdal, Tolga .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5688-5699