Pedestrian Graph plus : A Fast Pedestrian Crossing Prediction Model Based on Graph Convolutional Networks

被引:47
作者
Cadena, Pablo Rodrigo Gantier [1 ,2 ,3 ]
Qian, Yeqiang [4 ]
Wang, Chunxiang [1 ,2 ,3 ]
Yang, Ming [1 ,2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] Minist Educ China, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
[3] Shanghai Engn Res Ctr Intelligent Control & Manag, Shanghai 200240, Peoples R China
[4] Shanghai Jiao Tong Univ, Univ Michigan Shanghai Jiao Tong Univ Joint Inst, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Predictive models; Data models; Computational modeling; Pose estimation; Task analysis; Real-time systems; Convolution; Pedestrian crossing prediction; pedestrian behavior; graph convolutional network;
D O I
10.1109/TITS.2022.3173537
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Estimating when pedestrians cross the street is essential for intelligent transportation systems. Accurate, real-time prediction is critical to ensure the safety of the most vulnerable road users while improving passenger comfort. In the present work, we developed a model called Pedestrian Graph +, an improvement of our previous work, Pedestrian Graph, which predicts pedestrian crossing action in urban areas based on a Graph Convolution Network. We integrated two convolutional modules in the new model that provide additional context information (cropped images, cropped segmentation maps, ego-vehicle velocity data) to the main Graph Convolutional module, thus increasing accuracy. Our model is faster and smaller than other state-of-the-art models, achieving equivalent accuracy. Our model is faster than state-of-the-art models, with an inference time of 6 ms (on a GTX 1080) and low memory consumption (0.3 MB). We tested our model on two datasets, Joint Attention in Autonomous Driving (JAAD) and Pedestrian Intention Estimation (PIE), achieving 86% and 89% accuracy, respectively. Another contribution of our work is the ability to dynamically process almost any input size in the time domain without significant loss of accuracy. It is possible due to the fully convolutional property of ConvNets. Our models and results are available at https://github.com/RodrigoGantier/Pedestrian_graph_plus.
引用
收藏
页码:21050 / 21061
页数:12
相关论文
共 61 条
[21]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[22]  
Huang G., 2016, P IEEE C COMP VIS PA, P2261, DOI DOI 10.1109/CVPR.2017.243
[23]  
Ioffe Sergey, 2015, P MACHINE LEARNING R, V37, P448
[24]  
Kay W., 2017, The Kinetics Human Action Video Dataset
[25]  
Kipf T. N., 2017, ICLR, P1, DOI https://doi.org/10.48550/arXiv.1609.02907
[26]   Benchmark for Evaluating Pedestrian Action Prediction [J].
Kotseruba, Iuliia ;
Rasouli, Amir ;
Tsotsos, John K. .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1257-1267
[27]  
Kotseruba I, 2020, IEEE INT VEH SYM, P1688, DOI [10.1109/IV47402.2020.9304591, 10.1109/iv47402.2020.9304591]
[28]  
Lee Jin Han, 2019, ARXIV1907
[29]   CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark [J].
Li, Jiefeng ;
Wang, Can ;
Zhu, Hao ;
Mao, Yihuan ;
Fang, Hao-Shu ;
Lu, Cewu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10855-10864
[30]   Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction [J].
Liu, Bingbin ;
Adeli, Ehsan ;
Cao, Zhangjie ;
Lee, Kuan-Hui ;
Shenoi, Abhijeet ;
Gaidon, Adrien ;
Niebles, Juan Carlos .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :3485-3492