Modeling social interaction and intention for pedestrian trajectory prediction

被引:13
作者
Chen, Kai [1 ]
Song, Xiao [1 ]
Ren, Xiaoxiang [2 ]
机构
[1] Beihang Univ BUAA, Beijing, Peoples R China
[2] Nanan Primary Sch, Nanan, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Social-interaction; Pedestrian intention; Convolutional long-short-term memory; Encoder-decoder; Attention;
D O I
10.1016/j.physa.2021.125790
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Future pedestrian trajectory prediction offers great prospects for many practical applications. Most existing methods focus on social interaction among pedestrians but ignore the fact that in addition to pedestrians there are other kinds of objects (cars, dogs, bicycles, motorcycles, etc.) with a great influence on the subject pedestrian's future trajectory. Most existing methods neglect the intentions of the pedestrian, which can be obtained by the key points of the subject pedestrian's face. Therefore, rich category information about the subject pedestrian's surroundings and face key points plays a great role in promoting the modeling of pedestrian movement. Motivated by this idea, this paper tries to predict a pedestrian's future trajectory by jointly using various categories and the relative positions of the subject pedestrian's surroundings and the key points in his face. We propose a data modeling method to effectively unify rich visual features about categories, interaction and face key points into a multi-channel tensor and build an end-to-end fully convolutional encoder-decoder attention model based on convolutional long-short-term memory utilizing this tensor. We evaluate and compare our method with several existing methods on 5 crowded video sequences from the public dataset multi-object tracking (MOT)-16. Experimental results show that our method outperforms state-of-the-art approaches, with less prediction error. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 29 条
[1]  
Alaghb G, 2018, ARXIV PREPRINT ARXIV
[2]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[3]   Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks [J].
Gupta, Agrim ;
Johnson, Justin ;
Li Fei-Fei ;
Savarese, Silvio ;
Alahi, Alexandre .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2255-2264
[4]   Simulating dynamical features of escape panic [J].
Helbing, D ;
Farkas, I ;
Vicsek, T .
NATURE, 2000, 407 (6803) :487-490
[5]  
Jaipuria N., 2018, ARXIV PREPRINT ARXIV
[6]   Modeling handicapped pedestrians considering physical characteristics using cellular automaton [J].
Kim, Jooyoung ;
Ahn, Chiwon ;
Lee, Seungjae .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 510 :507-517
[7]  
King DB, 2015, ACS SYM SER, V1214, P1
[8]  
Kitani KM, 2012, LECT NOTES COMPUT SC, V7575, P201, DOI 10.1007/978-3-642-33765-9_15
[9]   Crowds by example [J].
Lerner, Alon ;
Chrysanthou, Yiorgos ;
Lischinski, Dani .
COMPUTER GRAPHICS FORUM, 2007, 26 (03) :655-664
[10]   Peeking into the Future: Predicting Future Person Activities and Locations in Videos [J].
Liang, Junwei ;
Jiang, Lu ;
Niebles, Juan Carlos ;
Hauptmann, Alexander ;
Li Fei-Fei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5718-5727