Learning Autoencoder Diffusion Models of Pedestrian Group Relationships for Multimodal Trajectory Prediction

被引：6

作者：

Lv, Kai ^{[1
]}

Yuan, Liang ^{[1
,2
]}

Ni, Xiaoyu ^{[3
,4
]}

机构：

[1] Xinjiang Univ, Sch Mech Engn, Urumqi 830046, Peoples R China

[2] Shanghai Jiao Tong Univ, ICCI, Shanghai 200240, Peoples R China

[3] Hebei Univ Architecture, Sch Mech Engn, Zhangjiakou 075031, Hebei, Peoples R China

[4] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

基金：

中国国家自然科学基金;

关键词：

Trajectory; Pedestrians; Predictive models; Computational modeling; Task analysis; Decoding; Adaptation models; Diffusion model (DM); multimodal distribution; pedestrian groups; pedestrian trajectory prediction; ATTENTION;

D O I：

10.1109/TIM.2024.3375973

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Pedestrian trajectory prediction is crucial for enabling dynamic obstacle avoidance in social robots. Variational autoencoders (VAEs) have shown potential in predicting multimodal distributions of future pedestrian trajectories. However, standards VAE struggle to generate accurate future trajectories, and existing prediction methods often overlook the relationships between pedestrian groups. This article introduces a novel prediction model, called the learning autoencoder diffusion model (LADM) of pedestrian group relationships for multimodal trajectory prediction, which takes into account pedestrian group relationships, enhancing the accuracy of multimodal distribution trajectory prediction. In the LADM framework, each pedestrian is assigned to their most probable group through a learning process, and the interaction relationships between pedestrians and groups are determined using a pedestrian-group interaction module (PGIM). To improve the quality of generated future trajectory distributions, we propose the autoencoder diffusion model (DM); the VAE functions as a generator and a DM acts as a refiner. We evaluate our proposed method on two public datasets (ETH and UCY) and compare it with state-of-the-art methods. Experimental results demonstrate that our approach outperforms existing methods in terms of average displacement error (ADE) and final displacement error (FDE) metrics.

引用

页码：1 / 12

页数：12

共 58 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces
Alahi, Alexandre
Goel, Kratarth
Ramanathan, Vignesh
Robicquet, Alexandre
Li Fei-Fei
Savarese, Silvio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
[2] Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs
Amirian, Javad
Hayet, Jean-Bernard
Pettre, Julien
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2964 - 2972
[3] [Anonymous], 2019, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[4] Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction
Bae, Inhwan
Park, Jin-Hwi
Jeon, Hae-Gon
[J]. COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 270 - 289
[5] CrowdGAN: Identity-Free Interactive Crowd Video Generation and Beyond
Chai, Liangyu
Liu, Yongtuo
Liu, Wenxi
Han, Guoqiang
He, Shengfeng
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2856 - 2871
[6] Chaofan Tao, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12366), P547, DOI 10.1007/978-3-030-58589-1_33
[7] Vehicle Trajectory Prediction Based on Intention-Aware Non-Autoregressive Transformer With Multi-Attention Learning for Internet of Vehicles
Chen, Xiaobo
Zhang, Huanjia
Zhao, Feng
Cai, Yingfeng
Wang, Hai
Ye, Qiaolin
[J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[8] Chorowski J, 2014, Arxiv, DOI arXiv:1412.1602
[9] Cunjun Yu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12357), P507, DOI 10.1007/978-3-030-58610-2_30
[10] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144

← 1 2 3 4 5 6 →