IMGCN: interpretable masked graph convolution network for pedestrian trajectory prediction

被引：7

作者：

Chen, Wangxing ^{[1
]}

Sang, Haifeng ^{[1
]}

Wang, Jinyu ^{[1
]}

Zhao, Zishan ^{[1
]}

机构：

[1] Shenyang Univ Technol, Sch Informat Sci & Engn, Shenyang 110870, Liaoning, Peoples R China

来源：

TRANSPORTMETRICA B-TRANSPORT DYNAMICS | 2024年 / 12卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Pedestrian trajectory prediction; graph convolution network; view-distance mask module; motion offset mask module; temporal convolution networks; POLICIES; MODEL;

D O I：

10.1080/21680566.2024.2389896

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Pedestrian trajectory prediction holds significant research value in various fields, such as autonomous driving, autonomous service robots, and human flow monitoring. Two key challenges in pedestrian trajectory prediction are the modeling of pedestrian social interactions and movement factors. Previous methods have not utilized interpretable information to explore complex situations when modeling social interactions. These methods also focus too much on temporal interactions at each moment when modeling movement factors and are therefore susceptible to slight motion changes. To solve the above problems, we propose an Interpretable Masked Graph Convolution Network (IMGCN) for pedestrian trajectory prediction. The IMGCN utilizes interpretable information such as the pedestrian view area, distance, and motion direction to intelligently mask interaction features, resulting in more precise modeling of social interaction and movement factors. Specifically, we design a spatial and a temporal branch to model pedestrians' social interaction and movement factors, respectively. Within the spatial branch, the view-distance mask module masks pedestrian social interaction by determining whether the pedestrian is within a certain distance and view area to achieve more accurate interaction modeling. In the temporal branch, the motion offset mask module masks pedestrian temporal interaction according to the offset degree of their motion direction to achieve accurate modeling of movement factors. Ultimately, the 2D Gaussian distribution parameters of future trajectory points are predicted by the temporal convolution networks for multi-modal trajectory prediction. On the ETH, UCY and SDD datasets, our proposed method outperforms the baseline models in terms of average displacement error and final displacement error. The code is publicly available at https://github.com/Chenwangxing/IMGCN_master.

引用

页数：21

共 57 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].

Alahi, Alexandre ;

Goel, Kratarth ;

Ramanathan, Vignesh ;

Robicquet, Alexandre ;

Li Fei-Fei ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971

[2] Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs [J].

Amirian, Javad ;

Hayet, Jean-Bernard ;

Pettre, Julien .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :2964-2972

[3]

Bütepage J, 2018, IEEE INT CONF ROBOT, P4563, DOI 10.1109/ICRA.2018.8460651

[4] Human Trajectory Prediction via Counterfactual Analysis [J].

Chen, Guangyi ;

Li, Junlong ;

Lu, Jiwen ;

Zhou, Jie .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9804-9813

[5] Traj-MAE: Masked Autoencoders for Trajectory Prediction [J].

Chen, Hao ;

Wang, Jiaze ;

Shao, Kun ;

Liu, Furui ;

Hao, Jianye ;

Guan, Chenyong ;

Chen, Guangyong ;

Heng, Pheng-Ann .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :8317-8328

[6] WTGCN: wavelet transform graph convolution network for pedestrian trajectory prediction [J].

Chen, Wangxing ;

Sang, Haifeng ;

Wang, Jinyu ;

Zhao, Zishan .

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) :5531-5548

[7] STIGCN: spatial-temporal interaction-aware graph convolution network for pedestrian trajectory prediction [J].

Chen, Wangxing ;

Sang, Haifeng ;

Wang, Jinyu ;

Zhao, Zishan .

JOURNAL OF SUPERCOMPUTING, 2024, 80 (08) :10695-10719

[8]

Chen WH, 2021, PR MACH LEARN RES, V157, P454

[9]

Cui HG, 2019, IEEE INT CONF ROBOT, P2090, DOI [10.1109/ICRA.2019.8793868, 10.1109/icra.2019.8793868]

[10]

Cunjun Yu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12357), P507, DOI 10.1007/978-3-030-58610-2_30

← 1 2 3 4 5 6 →