Non-Autoregressive Sparse Transformer Networks for Pedestrian Trajectory Prediction

被引:6
作者
Liu, Di [1 ,2 ]
Li, Qiang [1 ]
Li, Sen [1 ]
Kong, Jun [1 ]
Qi, Miao [3 ]
机构
[1] Northeast Normal Univ, Coll Informat Sci & Technol, Changchun 130117, Peoples R China
[2] Northeast Elect Power Univ, Sch Comp Sci, Jilin 132012, Peoples R China
[3] Northeast Normal Univ, Key Lab Appl Stat MOE, Changchun 130024, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
中国国家自然科学基金;
关键词
pedestrian trajectory prediction; sparse transformer; non-autoregressive; MODELS;
D O I
10.3390/app13053296
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Pedestrian trajectory prediction is an important task in practical applications such as automatic driving and surveillance systems. It is challenging to effectively model social interactions among pedestrians and capture temporal dependencies. Previous methods typically emphasized social interactions among pedestrians but ignored the temporal consistency of predictions and suffered from superfluous interactions by dense undirected graphs, resulting in a considerable deviance from reality. In addition, autoregressive approaches predicted future locations conditioning on previous predictions one by one, which would lead to error accumulation and time consuming. To address these issues, we present Non-autoregressive Sparse Transformer (NaST) networks for pedestrian trajectory prediction. Specifically, NaST models sparse spatial interactions and sparse temporal dependency via a sparse spatial transformer and a sparse temporal transformer separately. Different from previous predictions such as RNN-based approaches, the transformer decoder works in non-autoregressive pattern and predicts all the future locations at one time from a query sequence, which could avoid the error accumulation and be less computationally intensive. We evaluate our proposed method on the ETH and UCY datasets, and the experimental results show our method outperforms comparative state-of-the-art methods.
引用
收藏
页数:19
相关论文
共 66 条
[1]  
Achaji L., 2022, ARXIV
[2]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[3]  
[Anonymous], 2020, ARXIV
[4]   Discrete choice models of pedestrian walking behavior [J].
Antonini, Gianluca ;
Bierlaire, Michel ;
Weber, Mats .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2006, 40 (08) :667-687
[5]  
Awad G., 2018, P TREC VIDEO RETRIEV
[6]  
Bai HY, 2015, IEEE INT CONF ROBOT, P454, DOI 10.1109/ICRA.2015.7139219
[7]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[8]  
Chen G, 2021, P IEEE CVF INT C COM, P15580
[9]  
Chen N., 2021, INT C LEARN REPR ICL
[10]  
Chung JY, 2015, PR MACH LEARN RES, V37, P2067