Temporal LiDAR Frame Prediction for Autonomous Driving

被引：13

作者：

Deng, David ^{[1
]}

Zakhor, Avideh ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020) | 2020年

关键词：

D O I：

10.1109/3DV50981.2020.00093

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Anticipating the future in a dynamic scene is critical for many fields such as autonomous driving and robotics. In this paper we propose a class of novel neural network architectures to predict future LiDAR frames given previous ones. Since the ground truth in this application is simply the next frame in the sequence, we can train our models in an self-supervised fashion. Our proposed architectures are based on FlowNet3D and Dynamic Graph CNN. We use Chamfer Distance (CD) and Earth Mover's Distance (EMD) as loss functions and evaluation metrics. We train and evaluate our models using the newly released nuScenes dataset, and characterize their performance and complexity with several baselines. Compared to directly using FlowNet3D, our proposed architectures achieve CD and EMD nearly an order of magnitude lower. In addition, we show that our predictions generate reasonable scene flow approximations without using any labelled supervision.

引用

页码：829 / 837

页数：9

共 28 条

[1]

Achlioptas P., 2017, Learning representations and generative models for 3D point clouds

[2]

[Anonymous], 2017, SGDR STOCHASTIC GRAD

[3]

Bai S., 2018, INT C LEARN REPR

[4] Learning Deep Architectures for AI [J].

Bengio, Yoshua .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127

[5]

Bertsekas D. P., 1985, Proceedings of the 24th IEEE Conference on Decision and Control (Cat. No.85CH2245-9), P1703

[6] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[7] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[8]

El Sallab A., 2018, NEURAL INFORM PROCES

[9] A Point Set Generation Network for 3D Object Reconstruction from a Single Image [J].

Fan, Haoqiang ;

Su, Hao ;

Guibas, Leonidas .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2463-2471

[10]

Fan Hehe, 2019, P IEEE CVF C COMP VI

← 1 2 3 →