Uncertainty-aware pedestrian trajectory prediction via distributional diffusion

被引:7
作者
Liu, Yao [1 ,2 ,3 ]
Ye, Zesheng [2 ]
Wang, Rui [4 ]
Li, Binghao [5 ]
Sheng, Quan Z. [5 ]
Yao, Lina [6 ]
机构
[1] Macquarie Univ, Sch Comp, Sydney, Australia
[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, Australia
[3] Northeastern Univ, Coll Comp Sci & Engn, Shenyang, Peoples R China
[4] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
[5] Univ New South Wales, Sch Minerals & Energy Resources Engn, Sydney, Australia
[6] CSIRO, Data 61, Sydney, Australia
基金
澳大利亚研究理事会;
关键词
Pedestrian trajectory; Sufficient statistics; Diffusion model; Uncertainty;
D O I
10.1016/j.knosys.2024.111862
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tremendous efforts have been put forth on predicting pedestrian trajectory with generative models to accommodate uncertainty and multi-modality in human behaviors. An individual's inherent uncertainty, e.g., change of destination, can be masked by complex patterns resulting from the movements of interacting pedestrians. However, latent variable-based generative models often entangle such uncertainty with complexity, leading to limited either latent expressivity or predictive diversity. In this work, we propose to separately model these two factors by implicitly deriving a flexible latent representation to capture intricate pedestrian movements, while integrating predictive uncertainty of individuals with explicit bivariate Gaussian mixture densities over their future locations. More specifically, we present a model-agnostic uncertainty-aware pedestrian trajectory prediction framework, parameterizing sufficient statistics for the mixture of Gaussians that jointly comprise the multi-modal trajectories. We further estimate these parameters of interest by approximating a denoising process that progressively recovers pedestrian movements from noise. Unlike previous studies, we translate the predictive stochasticity to explicit distributions, allowing it to readily generate plausible future trajectories indicating individuals' self-uncertainty. Moreover, our framework is compatible with different neural net architectures. We empirically show the performance gains over state-of-the-art even with lighter backbones, across most scenes on two public benchmarks.
引用
收藏
页数:13
相关论文
共 49 条
[11]   SOCIAL FORCE MODEL FOR PEDESTRIAN DYNAMICS [J].
HELBING, D ;
MOLNAR, P .
PHYSICAL REVIEW E, 1995, 51 (05) :4282-4286
[12]  
Ho J, 2020, P 34 INT C NEUR INF, P6840
[13]  
Ho JAT, 2022, Arxiv, DOI [arXiv:2207.12598, 10.48550/arXiv.2207.12598]
[14]   STGAT: Modeling Spatial-Temporal Interactions for Human Trajectory Prediction [J].
Huang, Yingfan ;
Bi, HuiKun ;
Li, Zhaoxin ;
Mao, Tianlu ;
Wang, Zhaoqi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6281-6290
[15]  
Gulrajani I, 2017, ADV NEUR IN, V30
[16]   The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs [J].
Ivanovic, Boris ;
Pavone, Marco .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2375-2384
[17]   MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks [J].
Karnewar, Animesh ;
Wang, Oliver .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :7796-7805
[18]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[19]  
Kodali N, 2017, Arxiv, DOI [arXiv:1705.07215, 10.48550/arXiv.1705.07215, DOI 10.48550/ARXIV.1705.07215]
[20]  
Kosaraju V, 2019, ADV NEUR IN, V32