MGFormer: a multi-information-based GRU-transformer network for pedestrian trajectory prediction

被引:0
作者
Liu, Quankai [1 ]
Sang, Haifeng [1 ]
Wang, Jinyu [1 ]
Chen, Wangxing [1 ]
机构
[1] Shenyang Univ Technol, Sch Informat Sci & Engn, 111 Shenliao West Rd,Shenyang Econ & Technol Dev Z, Shenyang 110870, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Pedestrian trajectory prediction; Multi-information fusion; Attention mechanism; Information evaluation;
D O I
10.1007/s11227-025-07304-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian trajectory prediction plays a crucial role in enabling fully autonomous driving in urban environments, this paper proposes a GRU-Transformer network with multi-information integration (MGFormer) to predict pedestrian trajectories from a driving perspective. Firstly, MGFormer integrates pose and optical flow information with observed trajectory information. It uses skeleton sequence reorganization and local optical flow division techniques to reduce information distortion caused by occlusion. Subsequently, a cross-information fusion attention mechanism based on information evaluation is introduced. This mechanism comprehensively considers the importance of various information sources and the relevance of different features within each information source. Finally, MGFormer utilizes an innovative trajectory decoding network that combines GRU and Transformer models to enhance the effectiveness of decoding and predicting fused features. On the JAAD and PIE datasets, this method outperforms other approaches in predicting the displacement error within a 1.5 second, while also significantly reducing inference time compared to the latest model.
引用
收藏
页数:25
相关论文
共 47 条
[1]   Multi-scale pedestrian intent prediction using 3D joint information as spatio-temporal representation [J].
Ahmed, Sarfraz ;
Al Bazi, Ammar ;
Saha, Chitta ;
Rajbhandari, Sujan ;
Huda, M. Nazmul .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 225
[2]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[3]   Predicting Vulnerable Road User Behavior With Transformer-Based Gumbel Distribution Networks [J].
Astuti, Lia ;
Lin, Yu-Chen ;
Chiu, Chui-Hong ;
Chen, Wen-Hui .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :8043-8056
[4]  
Bai S., 2018, arXiv
[5]   Long-Term On-Board Prediction of People in Traffic Scenes under Uncertainty [J].
Bhattacharyya, Apratim ;
Fritz, Mario ;
Schiele, Bernt .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4194-4202
[6]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[7]   Future pedestrian location prediction in first-person videos for autonomous vehicles and social robots [J].
Chen, Kai ;
Zhu, Haihua ;
Tang, Dunbing ;
Zheng, Kun .
IMAGE AND VISION COMPUTING, 2023, 134
[8]   Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction [J].
Chiara, Luigi Filippo ;
Coscia, Pasquale ;
Das, Sourav ;
Calderara, Simone ;
Cucchiara, Rita ;
Ballan, Lamberto .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :2517-2526
[9]  
Dendorfer P, 2021, Img Proc Comp Vis Re, V12623, P405, DOI 10.1007/978-3-030-69532-3_25
[10]   Bio-LSTM: A Biomechanically Inspired Recurrent Neural Network for 3-D Pedestrian Pose and Gait Prediction [J].
Du, Xiaoxiao ;
Vasudevan, Ram ;
Johnson-Roberson, Matthew .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :1501-1508