Learning Autoencoder Diffusion Models of Pedestrian Group Relationships for Multimodal Trajectory Prediction

被引:6
作者
Lv, Kai [1 ]
Yuan, Liang [1 ,2 ]
Ni, Xiaoyu [3 ,4 ]
机构
[1] Xinjiang Univ, Sch Mech Engn, Urumqi 830046, Peoples R China
[2] Shanghai Jiao Tong Univ, ICCI, Shanghai 200240, Peoples R China
[3] Hebei Univ Architecture, Sch Mech Engn, Zhangjiakou 075031, Hebei, Peoples R China
[4] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Trajectory; Pedestrians; Predictive models; Computational modeling; Task analysis; Decoding; Adaptation models; Diffusion model (DM); multimodal distribution; pedestrian groups; pedestrian trajectory prediction; ATTENTION;
D O I
10.1109/TIM.2024.3375973
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Pedestrian trajectory prediction is crucial for enabling dynamic obstacle avoidance in social robots. Variational autoencoders (VAEs) have shown potential in predicting multimodal distributions of future pedestrian trajectories. However, standards VAE struggle to generate accurate future trajectories, and existing prediction methods often overlook the relationships between pedestrian groups. This article introduces a novel prediction model, called the learning autoencoder diffusion model (LADM) of pedestrian group relationships for multimodal trajectory prediction, which takes into account pedestrian group relationships, enhancing the accuracy of multimodal distribution trajectory prediction. In the LADM framework, each pedestrian is assigned to their most probable group through a learning process, and the interaction relationships between pedestrians and groups are determined using a pedestrian-group interaction module (PGIM). To improve the quality of generated future trajectory distributions, we propose the autoencoder diffusion model (DM); the VAE functions as a generator and a DM acts as a refiner. We evaluate our proposed method on two public datasets (ETH and UCY) and compare it with state-of-the-art methods. Experimental results demonstrate that our approach outperforms existing methods in terms of average displacement error (ADE) and final displacement error (FDE) metrics.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 58 条
  • [1] Social LSTM: Human Trajectory Prediction in Crowded Spaces
    Alahi, Alexandre
    Goel, Kratarth
    Ramanathan, Vignesh
    Robicquet, Alexandre
    Li Fei-Fei
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
  • [2] Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs
    Amirian, Javad
    Hayet, Jean-Bernard
    Pettre, Julien
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2964 - 2972
  • [3] [Anonymous], 2019, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
  • [4] Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction
    Bae, Inhwan
    Park, Jin-Hwi
    Jeon, Hae-Gon
    [J]. COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 270 - 289
  • [5] CrowdGAN: Identity-Free Interactive Crowd Video Generation and Beyond
    Chai, Liangyu
    Liu, Yongtuo
    Liu, Wenxi
    Han, Guoqiang
    He, Shengfeng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2856 - 2871
  • [6] Chaofan Tao, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12366), P547, DOI 10.1007/978-3-030-58589-1_33
  • [7] Vehicle Trajectory Prediction Based on Intention-Aware Non-Autoregressive Transformer With Multi-Attention Learning for Internet of Vehicles
    Chen, Xiaobo
    Zhang, Huanjia
    Zhao, Feng
    Cai, Yingfeng
    Wang, Hai
    Ye, Qiaolin
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [8] Chorowski J, 2014, Arxiv, DOI arXiv:1412.1602
  • [9] Cunjun Yu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12357), P507, DOI 10.1007/978-3-030-58610-2_30
  • [10] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144