Leapfrog Diffusion Model for Stochastic Trajectory Prediction

被引：86

作者：

Mao, Weibo ^{[1
]}

Xu, Chenxin ^{[1
]}

Zhu, Qi ^{[1
]}

Chen, Siheng ^{[1
,2
]}

Wang, Yanfeng ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Shanghai AI Lab, Shanghai, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

基金：

中国国家自然科学基金;

关键词：

NEURAL-NETWORKS;

D O I：

10.1109/CVPR52729.2023.00534

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To model the indeterminacy of human behaviors, stochastic trajectory prediction requires a sophisticated multi-modal distribution of future trajectories. Emerging diffusion models have revealed their tremendous representation capacities in numerous generation tasks, showing potential for stochastic trajectory prediction. However, expensive time consumption prevents diffusion models from real-time prediction, since a large number of denoising steps are required to assure sufficient representation ability. To resolve the dilemma, we present LEapfrog Diffusion model (LED), a novel diffusion-based trajectory prediction model, which provides real-time, precise, and diverse predictions. The core of the proposed LED is to leverage a trainable leapfrog initializer to directly learn an expressive multi-modal distribution of future trajectories, which skips a large number of denoising steps, significantly accelerating inference speed. Moreover, the leapfrog initializer is trained to appropriately allocate correlated samples to provide a diversity of predicted future trajectories, significantly improving prediction performances. Extensive experiments on four real-world datasets, including NBA/NFL/SDD/ETH-UCY, show that LED consistently improves performance and achieves 23.7%/21.9% ADE/FDE improvement on NFL. The proposed LED also speeds up the inference 19.3/30.8/24.3/25.1 times compared to the standard diffusion model on NBA/NFL/SDD/ETH-UCY, satisfying real-time inference needs. Code is available at https: //github.com/MediaBrain-SJTU/LED.

引用

页码：5517 / 5526

页数：10

共 53 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].

Alahi, Alexandre ;

Goel, Kratarth ;

Ramanathan, Vignesh ;

Robicquet, Alexandre ;

Li Fei-Fei ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971

[2]

[Anonymous], 2021, ADV NEURAL INFORM PR, DOI DOI 10.1080/20477724.2021.1951556

[3] Non-Probability Sampling Network for Stochastic Human Trajectory Prediction [J].

Bae, Inhwan ;

Park, Jin-Hwi ;

Jeon, Hae-Gon .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :6467-6477

[4]

Bengio Y., 2014, NIPS 2014 WORKSH DEE, DOI DOI 10.48550/ARXIV.1412.3555

[5]

Bhattacharyya S, 2019, DE GR FRONT COMPU IN, V2, P1, DOI 10.1515/9783110552072

[6]

Chen Nanxin, 2020, INT C LEARN REPR

[7] Towards Efficient Human-Robot Collaboration With Robust Plan Recognition and Trajectory Prediction [J].

Cheng, Yujiao ;

Sun, Liting ;

Liu, Changliu ;

Tomizuka, Masayoshi .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :2602-2609

[8] MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction [J].

Dendorfer, Patrick ;

Elflein, Sven ;

Leal-Taixe, Laura .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13138-13147

[9] TPNet: Trajectory Proposal Network for Motion Prediction [J].

Fang, Liangji ;

Jiang, Qinhong ;

Shi, Jianping ;

Zhou, Bolei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6796-6805

[10] Science, technology and the future of small autonomous drones [J].

Floreano, Dario ;

Wood, Robert J. .

NATURE, 2015, 521 (7553) :460-466

← 1 2 3 4 5 6 →