Social Transformer: A Pedestrian Trajectory Prediction Method based on Social Feature Processing Using Transformer

被引:5
作者
Wen, Fan [1 ]
Li, Ming [1 ]
Wang, Ruiyang [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
Trajectory prediction; Self-attention; Transformer; Self-driving;
D O I
10.1109/IJCNN55064.2022.9891949
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In pedestrian trajectory prediction, the prediction accuracy depends largely on the consideration of the impact of social relations on the prediction object. Social pooling and graph neural networks (GNN) are two traditional social feature processing methods, they process sparse and nonuniform social features into more intensive and uniform information. In this paper, the Social Transformer Network (STNet) was proposed based on the GNN, which is a graph attention network. After a conditional variational auto-encoder (CVAE)-based preprocessing network provided a destination prediction, a transformer network was used to process the social feature data of the past trajectory and destination information. The transformer network was based on the self-attention mechanism, and it can assign different attention weights to different social features so that more attention is paid to the social relations with greater impacts on the pedestrian's trajectory. In this paper, STNet was tested on the ETH/UCY datasets. The results showed that average displacement error (ADE) was reduced by 17.2% and final displacement error (FDE) was reduced by 14.6%, indicating that the STNet improved the prediction accuracy.
引用
收藏
页数:7
相关论文
共 14 条
[1]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[2]   Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs [J].
Amirian, Javad ;
Hayet, Jean-Bernard ;
Pettre, Julien .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :2964-2972
[3]   Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction [J].
Yu, Cunjun ;
Ma, Xiao ;
Ren, Jiawei ;
Zhao, Haiyu ;
Yi, Shuai .
COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :507-523
[4]   Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks [J].
Gupta, Agrim ;
Johnson, Justin ;
Li Fei-Fei ;
Savarese, Silvio ;
Alahi, Alexandre .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2255-2264
[5]  
Hamilton W.L., 2017, IEEE Data Eng. Bull.
[6]   Crowds by example [J].
Lerner, Alon ;
Chrysanthou, Yiorgos ;
Lischinski, Dani .
COMPUTER GRAPHICS FORUM, 2007, 26 (03) :655-664
[7]  
Li X, 2019, IEEE INT C INTELL TR, P3960, DOI 10.1109/ITSC.2019.8917228
[8]   Peeking into the Future: Predicting Future Person Activities and Locations in Videos [J].
Liang, Junwei ;
Jiang, Lu ;
Niebles, Juan Carlos ;
Hauptmann, Alexander ;
Li Fei-Fei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5718-5727
[9]  
Mangalam, 2020, IT IS NOT JOURNEY DE
[10]   Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction [J].
Mohamed, Abduallah ;
Qian, Kun ;
Elhoseiny, Mohamed ;
Claudel, Christian .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :14412-14420