Pedestrian Trajectory Prediction Using RNN Encoder-Decoder with SpatioTemporal Attentions

被引:0
作者
Bhujel, Niraj [1 ]
Yau, Wei-Yun [2 ]
Teoh, Eam Khwang [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] ASTAR, Inst InfoComm Res, Singapore 138577, Singapore
来源
2019 IEEE 5TH INTERNATIONAL CONFERENCE ON MECHATRONICS SYSTEM AND ROBOTS (ICMSR 2019) | 2019年
关键词
Learning; LSTM; Attention Mechanism; Trajectory Prediction; Social Robot Navigation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian motion are inherently multi-modal in nature influenced by presence of other human and physical objects in the environment. Trajectory prediction models need to address both human -human and human-space interaction issues. In this work, we leverage both pedestrians information and scene information of the navigation environment for jointly predicting trajectories of the pedestrian. We introduce a new Recurrent Neural Network based sequence model with attention mechanisms that address both human-human and human-space interaction challenges. The encoder encodes all the pedestrian trajectories and create a social context. The scene information of navigation environment is extracted using CNN and serves as a physical context for the model. Our approach utilizes physical and social attention mechanism to find semantic alignments between encoder and decoder. The social attention mechanism allow the model to look into similar step of pedestrian trajectory. The physical attention mechanism tells the model where and what to focus on the scene. Experiment on several datasets shows that the proposed approach which combine social and physical attention performs better than when this information is utilized independently.
引用
收藏
页码:110 / 114
页数:5
相关论文
共 29 条
[1]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[2]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[3]  
Becker S., 2018, arXiv
[4]  
Chen YF, 2017, IEEE INT C INT ROBOT, P1343, DOI 10.1109/IROS.2017.8202312
[5]  
Cho KYHY, 2014, Arxiv, DOI arXiv:1406.1078
[6]   Soft plus Hardwired attention: An LSTM framework for human trajectory prediction and abnormal event detection [J].
Fernando, Tharindu ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
NEURAL NETWORKS, 2018, 108 :466-478
[7]  
Ferrer G, 2014, IEEE INT C INT ROBOT, P1730, DOI 10.1109/IROS.2014.6942788
[8]  
Graves A, 2014, Arxiv, DOI arXiv:1308.0850
[9]  
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
[10]   Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks [J].
Gupta, Agrim ;
Johnson, Justin ;
Li Fei-Fei ;
Savarese, Silvio ;
Alahi, Alexandre .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2255-2264