Static-dynamic global graph representation for pedestrian trajectory prediction

被引:14
作者
Zhou, Hao [1 ,2 ]
Yang, Xu [2 ]
Fan, Mingyu [3 ]
Huang, Hai [1 ]
Ren, Dongchun [4 ]
Xia, Huaxia [4 ]
机构
[1] Harbin Engn Univ, Coll Shipbldg Engn, Harbin 150001, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[3] Donghua Univ, Inst Artificial Intelligence, Shanghai 200051, Peoples R China
[4] Intelligent Transportat Div, Beijing 100102, Peoples R China
基金
中国国家自然科学基金;
关键词
Trajectory prediction; Social interaction; Global graph representation; BEHAVIOR; MODEL;
D O I
10.1016/j.knosys.2023.110775
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effectively understanding social interactions among pedestrians plays a significant role in accurate pedestrian trajectory prediction. Previous works used either distance-based or data-driven methods to model interactions. However, the distance-based method has difficulty modeling complex interactions and ignores interactive pedestrians that are beyond a certain distance. The data-driven method models interactions among all pedestrians in a scene and introduces noninteractive pedestrians into the model due to the lack of proper supervision. To overcome these limitations, we first propose a novel global graph representation, which considers the spatial distance (from near to far) and the motion state (from static to dynamic), to explicitly model the social interactions among pedestrians. The global graph representation consists of two subgraphs: the static and the dynamic graph representations, where the static graph considers only the nearby pedestrians within a certain distance threshold, and the dynamic graph considers the interactive pedestrians that will likely collide soon. The proposed graph representation explicitly models the interaction by incorporating both the static (location) and dynamic states (velocity) in a distance-based manner. Then, based on the global graph representation, a novel data driven graph encoding network is proposed to extract the interaction features. It adopts two independent LSTMs and an attention module to encode the interaction feature from the perspective of the ego-pedestrian. Finally, the proposed prediction method is evaluated on two benchmark pedestrian trajectory prediction datasets, and comparisons are made with the state-of-the-arts. Experimental results demonstrate the effectiveness of the proposed method.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 54 条
[1]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[2]  
Amirian J., 2020, P ASIAN C COMPUTER V
[3]  
Bansal M, 2019, ROBOTICS: SCIENCE AND SYSTEMS XV
[4]  
Bhattacharyya A, 2020, Arxiv, DOI arXiv:1908.09008
[5]  
Chai Yuning., 2020, CORL, P86
[6]  
Chung JY, 2014, Arxiv, DOI arXiv:1412.3555
[7]  
Deo N, 2021, Arxiv, DOI [arXiv:2001.00735, DOI 10.48550/ARXIV.2001.00735]
[8]   Convolutional Social Pooling for Vehicle Trajectory Prediction [J].
Deo, Nachiket ;
Trivedi, Mohan M. .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :1549-1557
[9]   VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation [J].
Gao, Jiyang ;
Sun, Chen ;
Zhao, Hang ;
Shen, Yi ;
Anguelov, Dragomir ;
Li, Congcong ;
Schmid, Cordelia .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11522-11530
[10]   Empirical Investigation on Pedestrian Crowd Dynamics and Grouping [J].
Gorrini, Andrea ;
Bandini, Stefania ;
Vizzari, Giuseppe .
TRAFFIC AND GRANULAR FLOW '13, 2015, :83-91