Pedestrian trajectory prediction model with social features and attention

被引:0
作者
Zhang Z. [1 ]
Diao Y. [1 ]
机构
[1] School of Computer Science and Technology, Civil Aviation University of China, Tianjin
来源
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University | 2020年 / 47卷 / 01期
关键词
Attention mechanism; Generative adversarial networks; Long short-term memory; Pedestrian interaction; Trajectory generation;
D O I
10.19665/j.issn1001-2400.2020.01.002
中图分类号
学科分类号
摘要
To address the problems that the pedestrian interaction feature of the Social GAN is simple and that it cannot make full use of the most of pedestrian interaction information, this paper proposes a pedestrian trajectory prediction model with social features and attention mechanism. This model adapts the structure of generative adversarial networks. The generator adapts an encoder-decoder model and the attention model is put between encoder and decoder. Three social features are set to enrich pedestrian interaction information which assists the attention module to make full use of the most of pedestrian interaction information by allocating the influence of pedestrians in the scene, so that the accuracy of the model is improved. Experimental results on multiple datasets show that the accuracy of this model in the pedestrian trajectory prediction task is increased by 15% compared with the previous pedestrian trajectory prediction model based on the pooling module. The improvement effect is most obvious in scenes with dense pedestrians and lots of non-straight tracks, with the accuracy increased by 34%. © 2020, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:10 / 17and79
页数:1769
相关论文
共 17 条
[1]  
Luo Y., Cai P., Bera A., Et al., PORCA: Modeling and Planning for Autonomous Driving among Many Pedestrians, IEEE Robotics and Automation Letters, 3, 4, pp. 3418-3425, (2018)
[2]  
Cheng J., Cheng H., Meng M.Q.H., Et al., Autonomous Navigation by Mobile Robots in Human Environments: a Survey, Proceedings of the 2018 IEEE International Conference on Robotics and Biomimetics, pp. 1981-1986, (2018)
[3]  
Choi W., Savarese S., A Unified Framework for Multi-target Tracking and Collective Activity Recognition, Lecture Notes in Computer Science: 7575, pp. 215-230, (2012)
[4]  
Lipton Z.C., Berkowitz J., Elkan C., A Critical Review of Recurrent Neural Networks for Sequence Learning, Computer Science, (2015)
[5]  
Chung J., Kastner K., Dinh L., Et al., A Recurrent Latent Variable Model for Sequential Data, Advances in Neural Information Processing Systems 28: 2015, pp. 2980-2988, (2015)
[6]  
Liu J., Shahroudy A., Xu D., Et al., Spatio-temporal LSTM with Trust Gates for 3d Human Action Recognition, Lecture Notes in Computer Science: 9907, pp. 816-833, (2016)
[7]  
Alahi A., Goel K., Ramanathan V., Et al., Social LSTM: Human Trajectory Prediction in Crowded Spaces, Proceedings of the 2016 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 961-971, (2016)
[8]  
Vemula A., Muelling K., Oh J., Social Attention: Modeling Attention in Human Crowds, Proceedings of the 2018 IEEE International Conference on Robotics and Automation, pp. 4601-4607, (2018)
[9]  
Goodfellow I.J., Pouget-Abadie J., Mirza M., Et al., Generative Adversarial Nets, Advances in Neural Information Processing Systems 27: 3. Vancouver: Neural Information Processing Systems Foundation, pp. 2672-2680, (2014)
[10]  
Chen X., Shen H., Bian Q., Et al., Face Image Super-resolution with an Attention Mechanism, Journal of Xidian University, 46, 3, pp. 148-153, (2019)