Pedestrian Trajectory Prediction Method Based on Multi-information Fusion Network

被引：0

作者：

Gao, Song ^{[1
,2
]}

Zhou, Jianglin ^{[2
]}

Gao, Bolin ^{[1
,3
]}

Lu, Jian ^{[2
]}

Wang, He ^{[2
]}

Xu, Yueyun ^{[2
]}

机构：

[1] School of Vehicles and Mobility, Tsinghua University, Beijing

[2] National Innovation Center of Intelligent and Connected Vehicles, Beijing

来源：

Qiche Gongcheng/Automotive Engineering | 2024年 / 46卷 / 11期

关键词：

autonomous driving; ego-vehicle; multiple pedestrian information fusion network; pedestrian trajectory prediction;

D O I：

10.19562/j.chinasae.qcgc.2024.11.004

中图分类号：

学科分类号：

摘要：

With the continuous development of autonomous driving technology,accurately predicting the future trajectories of pedestrians has become a critical element in ensuring system safety and reliability. However, most existing studies on pedestrian trajectory prediction rely on fixed camera perspectives,which limits the compre⁃ hensive observation of pedestrian movement and thus makes them unsuitable for direct application to pedestrian tra⁃ jectory prediction under the ego-vehicle perspective in autonomous vehicles. To solve the problem,in this paper a pedestrian trajectory prediction method under the ego-vehicle perspective based on the Multi-Pedestrian Information Fusion Network(MPIFN)is proposed,which achieves accurate prediction of pedestrians' future trajectories by inte⁃ grating social information,local environmental information,and temporal information of pedestrians. In this paper, a Local Environmental Information Extraction Module that combines deformable convolution with traditional convo⁃ lutional and pooling operations is constructed,aiming to more effectively extract local information from complex en⁃ vironment. By dynamically adjusting the position of convolutional kernels,this module enhances the model’s adapt⁃ ability to irregular and complex shapes. Meanwhile,the pedestrian spatiotemporal information extraction module and multimodal feature fusion module are developed to facilitate comprehensive integration of social and environ⁃ mental information. The experimental results show that the proposed method achieves advanced performance on two ego-vehicle driving datasets,JAAD and PSI. Specifically,on the JAAD dataset,the Center Final Mean Squared Er⁃ ror(CF_MSE)is 4 063,and the Center Mean Squared Error(C_MSE)is 829. On the PSI dataset,the Average Root Mean Square Error(ARB)and Final Root Mean Square Error(FRB)also achieve outstanding performance with values of 18.08/29.21/44.98 and 25.27/54.62/93.09 for prediction horizons of 0.5 s,1.0 s,and 1.5 s,respec⁃ tively. © 2024 SAE-China. All rights reserved.

引用

页码：1973 / 1982

页数：9

共 29 条

[1]

MARCHETTI F, BECATTINI F, SEIDENARI L, Et al., Smemo: social memory for trajectory forecasting[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 46, 6, pp. 4410-4425, (2024)

[2]

SUN J, LI Y, CHAI L, Et al., Modality exploration,retrieval and adaptation for trajectory prediction[J], IEEE Transactions on Pat⁃ tern Analysis and Machine Intelligence, 45, 12, pp. 15051-15064, (2023)

[3]

SHI L, WANG L, LONG C, Et al., Representing multimodal be⁃ haviors with mean location for pedestrian trajectory prediction[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 9, pp. 11184-11202, (2023)

[4]

YAO Y, ATKINS E, JOHNSON-ROBERSON M, Et al., BiTraP: bi-directional pedestrian trajectory prediction with multi-modal goal estimation[J], IEEE Robotics and Automation Letters, 6, 2, pp. 1463-1470, (2021)

[5]

GUO J H, HEI Z F, LUO Y G, Et al., Vehicle cut-in trajectory prediction based on deep learning in a human-machine mixed driving environment, Automotive Engineering, 44, 2, pp. 153-160, (2022)

[6]

GUO J H, XIAO B P, WANG J Y, Et al., Study on vehicle cut⁃in intention prediction based on residual BiLSTM network, Auto⁃ motive Engineering, 43, 7, pp. 971-977, (2021)

[7]

GODARD C, MAC AODHA O, BROSTOW G J., Unsupervised monocular depth estimation with left-right consistency[C], Pro⁃ ceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 270-279, (2017)

[8]

CHEN Y, SCHMID C, SMINCHISESCU C., Self-supervised learning with geometric constraints in monocular video:connect⁃ ing flow,depth,and camera[C], Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 7063-7072, (2019)

[9]

BHATTACHARYYA A, FRITZ M, SCHIELE B., Long-term on-board prediction of people in traffic scenes under uncertainty, Proceedings of the IEEE Conference on Computer Vision and Pat⁃ tern Recognition, pp. 4194-4202, (2018)

[10]

YAO Y, XU M, CHOI C, Et al., Egocentric vision-based future vehicle localization for intelligent driving assistance systems[C], 2019 International Conference on Robotics and Automation (ICRA), pp. 9711-9717, (2019)

← 1 2 3 →