Multi-scale pedestrian intent prediction using 3D joint information as spatio-temporal representation

被引:10
作者
Ahmed, Sarfraz [1 ]
Al Bazi, Ammar [2 ]
Saha, Chitta [1 ]
Rajbhandari, Sujan [3 ]
Huda, M. Nazmul [4 ]
机构
[1] Coventry Univ, Sch Future Transport Engn, Priory St, Coventry CV1 5FB, W Midlands, England
[2] Coventry Univ, Sch Mech Engn, Priory St, Coventry CV1 5FB, W Midlands, England
[3] Bangor Univ, DSP Ctr Excellence, Sch Comp Sci & Elect Engn, Bangor LL57 2DG, Gwynedd, Wales
[4] Brunel Univ London, Dept Elect & Elect Engn, Kingston Lane, London UB8 3PH, England
关键词
LSTM; Intent prediction; Pose estimation; Tracking; Pedestrian detection; TRAJECTORY PREDICTION; MODEL;
D O I
10.1016/j.eswa.2023.120077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a rise of use of Autonomous Vehicles on public roads. With the predicted rise of road traffic accidents over the coming years, these vehicles must be capable of safely operate in the public domain. The field of pedestrian detection has significantly advanced in the last decade, providing high-level accuracy, with some technique reaching near-human level accuracy. However, there remains further work required for pedestrian intent prediction to reach human-level performance. One of the challenges facing current pedestrian intent predictors are the varying scales of pedestrians, particularly smaller pedestrians. This is because smaller pedestrians can blend into the background, making them difficult to detect, track or apply pose estimations techniques. Therefore, in this work, we present a novel intent prediction approach for multi-scale pedestrians using 2D pose estimation and a Long Short-term memory (LSTM) architecture. The pose estimator predicts keypoints for the pedestrian along the video frames. Based on the accumulation of these keypoints along the frames, spatio-temporal data is generated. This spatio-temporal data is fed to the LSTM for classifying the crossing behaviour of the pedestrians. We evaluate the performance of the proposed techniques on the popular Joint Attention in Autonomous Driving (JAAD) dataset and the new larger-scale Pedestrian Intention Estimation (PIE) dataset. Using data generalisation techniques, we show that the proposed technique outperformed the state-of-the-art techniques by up to 7%, reaching up to 94% accuracy while maintaining a comparable run-time of 6.1 ms.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Ahmed Sarfraz, 2019, Towards Autonomous Robotic Systems. 20th Annual Conference, TAROS 2019. Proceedings: Lecture Notes in Artificial Intelligence (LNAI 11650), P223, DOI 10.1007/978-3-030-25332-5_20
  • [2] Pedestrian and Cyclist Detection and Intent Estimation for Autonomous Vehicles: A Survey
    Ahmed, Sarfraz
    Huda, M. Nazmul
    Rajbhandari, Sujan
    Saha, Chitta
    Elshaw, Mark
    Kanarachos, Stratis
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (11):
  • [3] Benfold B., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3457, DOI 10.1109/CVPR.2011.5995667
  • [4] Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
  • [5] Bouhsain S. A., 2020, EUROPEAN ASS RES TRA
  • [6] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [7] Pedestrian Trajectory Prediction in Heterogeneous Traffic Using Pose Keypoints-Based Convolutional Encoder-Decoder Network
    Chen, Kai
    Song, Xiao
    Ren, Xiaoxiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1764 - 1775
  • [8] HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
    Cheng, Bowen
    Xiao, Bin
    Wang, Jingdong
    Shi, Honghui
    Huang, Thomas S.
    Zhang, Lei
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5385 - 5394
  • [9] CMU, 2017, CMU GRAPH LAB MOT CA
  • [10] Pedestrian Detection: An Evaluation of the State of the Art
    Dollar, Piotr
    Wojek, Christian
    Schiele, Bernt
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 743 - 761