SLIM: Self-Supervised LiDAR Scene Flow and Motion Segmentation

被引：58

作者：

Baur, Stefan Andreas ^{[1
,2
]}

Emmerichs, David Josef ^{[1
]}

Moosmann, Frank ^{[1
]}

Pinggera, Peter ^{[1
]}

Ommer, Bjoern ^{[4
,5
]}

Geiger, Andreas ^{[2
,3
]}

机构：

[1] Mercedes Benz AG, Stuttgart, Germany

[2] MPI IS, Tubingen, Germany

[3] Univ Tubingen, Tubingen, Germany

[4] Ludwig Maximilian Univ Munich, Munich, Germany

[5] Heidelberg Univ, Heidelberg, Germany

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

D O I：

10.1109/ICCV48922.2021.01288

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, several frameworks for self-supervised learning of 3D scene flow on point clouds have emerged. Scene flow inherently separates every scene into multiple moving agents and a large class of points following a single rigid sensor motion. However, existing methods do not leverage this property of the data in their self-supervised training routines which could improve and stabilize flow predictions. Based on the discrepancy between a robust rigid ego-motion estimate and a raw flow prediction, we generate a self-supervised motion segmentation signal. The predicted motion segmentation, in turn, is used by our algorithm to attend to stationary points for aggregation of motion information in static parts of the scene. We learn our model end-to-end by backpropagating gradients through Kabsch's algorithm and demonstrate that this leads to accurate ego-motion which in turn improves the scene flow estimate. Using our method, we show state-of-the-art results across multiple scene flow metrics for different real-world datasets, showcasing the robustness and generalizability of this approach. We further analyze the performance gain when performing joint motion segmentation and scene flow in an ablation study. We also present a novel network architecture for 3D LiDAR scene flow which is capable of handling an order of magnitude more points during training than previously possible.

引用

页码：13106 / 13116

页数：11

共 59 条

[11] Digging Into Self-Supervised Monocular Depth Estimation [J].

Godard, Clement ;

Mac Aodha, Oisin ;

Firman, Michael ;

Brostow, Gabriel .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3827-3837

[12]

Gojcic Zan, 2021, CORR, P2021

[13] HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow Estimation on Large-scale Point Clouds [J].

Gu, Xiuye ;

Wang, Yijie ;

Wu, Chongruo ;

Lee, Yong Jae ;

Wang, Panqu .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3249-3258

[14]

Guibas Leonidas J., 2020, LECT NOTES COMPUTER, V12348

[15] Unsupervised Multi-Task Feature Learning on Point Clouds [J].

Hassani, Kaveh ;

Haley, Mike .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8159-8170

[16] A variational method for scene flow estimation from stereo sequences [J].

Huguet, Frederic ;

Devernay, Frederic .

2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :1342-1348

[17] Self-Supervised Monocular Scene Flow Estimation [J].

Hur, Junhwa ;

Roth, Stefan .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :7394-7403

[18]

Jaimez M, 2015, IEEE INT CONF ROBOT, P98, DOI 10.1109/ICRA.2015.7138986

[19]

Kabsch Wolfgang, 1976, THEORETICAL GEN CRYS, V32, P4

[20]

Kittenplon Yair, 2020, CORR

← 1 2 3 4 5 6 →