Social Self-Attention Generative Adversarial Networks for Human Trajectory Prediction

被引:8
作者
Yang C. [1 ]
Pan H. [1 ]
Sun W. [1 ]
Gao H. [1 ]
机构
[1] Harbin Institute of Technology, Research Institute of Intelligent Control and Systems, Harbin
来源
IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 04期
基金
中国国家自然科学基金;
关键词
Generative adversarial networks (GANs); self-attention; social interactions; trajectory prediction;
D O I
10.1109/TAI.2023.3299899
中图分类号
学科分类号
摘要
Predicting accurate human future trajectories is of critical importance for self-driving vehicles if they are to navigate complex scenarios. Trajectories of humans are not only dependent on the humans themselves, but also the interactions with surrounding agents. Previous works mainly model interactions among agents by using a diversity of polymerization methods that integrate various learned agent states hit or miss. In this article, we propose social self-attention generative adversarial networks (Social SAGAN), which generate socially acceptable multimodal trajectory predictions. Social SAGAN incorporates a generator that predicts future trajectories of pedestrians, a discriminator that classifies trajectory predictions as real or fake, and a social self-attention mechanism that selectively refines the most interactive information and helps the overall model to capture what to pay attention to. Through extensive experiments, we demonstrate that our model achieves competitive prediction accuracy and computational complexity compared with previous state-of-the-art methods on all trajectory forecasting benchmarks. © 2020 IEEE.
引用
收藏
页码:1805 / 1815
页数:10
相关论文
共 48 条
[1]  
Hirakawa T., Yamashita T., Tamaki T., Fujiyoshi H., Survey on visionbased path prediction, Proc. Int. Conf. Distributed, Ambient, Pervasive Interact., pp. 48-64, (2018)
[2]  
Bagautdinov T., Alahi A., Fleuret F., Fua P., Savarese S., Social scene understanding: End-To-end multi-person action localization and collective activity recognition, Proc. Ieee Conf. Comput. Vis. Pattern Recognit., pp. 3425-3434, (2017)
[3]  
Chen C., Liu Y., Kreiss S., Alahi A., Crowd-robot interaction: Crowdaware robot navigationwith attention-based deep reinforcement learning, Proc. Int. Conf. Robot. Automat., pp. 6015-6022, (2019)
[4]  
Pan H., Hong Y., Sun W., Jia Y., Deep dual-resolution networks for real-Time and accurate semantic segmentation of traffic scenes, Ieee Trans. Intell. Transp. Syst., 24, 3, pp. 3448-3460, (2023)
[5]  
Tay M.K.C., Laugier C., Modelling smooth paths using Gaussian processes, Proc. Field Serv. Robot.: Results 6th Int. Conf., pp. 381-390, (2008)
[6]  
Pellegrini S., Ess A., Gool L.V., Improving data association by joint modeling of pedestrian trajectories and groupings, Proc. Eur. Conf. Comput. Vis., pp. 452-465, (2010)
[7]  
Pan H., Zhang D., Sun W., Yu X., Event-Triggered adaptive asymptotic tracking control of uncertain MIMO nonlinear systems with actuator faults, Ieee Trans. Cybern., 52, 9, pp. 8655-8667, (2022)
[8]  
Alahi A., Ramanathan V., Fei-Fei L., Socially-Aware large-scale crowd forecasting, Proc. Ieee Conf. Comput. Vis. Pattern Recognit., pp. 2211-2218, (2014)
[9]  
Wang D., Yang K., Liu L., Wang H., An incremental learning model for mobile robot: From short-Term memory to long-Term memory, Ieee Trans. Artif. Intell., 3, 5, pp. 798-808, (2022)
[10]  
Kant R., Saini P., Kumari J., Long-short term memory auto-encoder based position prediction model for fixed-wing UAV during communication failure, Ieee Trans. Artif. Intell., 4, 1, pp. 173-181, (2023)