Multi-scale graph-transformer network for trajectory prediction of the autonomous vehicles

被引:8
作者
Singh, Divya [1 ]
Srivastava, Rajeev [1 ]
机构
[1] Banaras Hindu Univ, Dept Comp Sci & Engn, Indian Inst Technol, Varanasi 221005, Uttar Pradesh, India
关键词
Graph neural network; Transformer; Autonomous vehicles; Trajectory prediction;
D O I
10.1007/s11370-022-00422-w
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The accurate trajectory prediction is a crucial task for the autonomous vehicles that help to plan and fast decision making capability of the system to reach their destination in the complex road scenario with abiding by the traffic rules. For this, autonomous vehicles should have more attention to their goal without affecting the other's task and maintain their safety from road accidents. With this motivation, we proposed a multi-scale graph-transformer-based attention mechanism that provides the interaction between the road agents with different time instances, because from time to time, few new agents may enter the frame scene, and few may leave the frame scene. Each dynamic obstacles trajectory can be defined as state sequences within an interval of time, where spatial coordinates of dynamic obstacles represented by the each state under the world coordinate frame. We have presented graph-based Multi-scale spatial features with transformer network that achieves significant prediction results compared to other existing methods, and we provide an in-depth analysis of the trained weights for different highways scenarios with transformer and the Long-Short Term Memory. We evaluate our model with three publicly available datasets and achieve state-of-the-art performances as presented in the manuscript. The performance balance is more in favour of our model for sparser datasets compared to the dense datasets.
引用
收藏
页码:307 / 320
页数:14
相关论文
共 54 条
  • [11] Cunjun Yu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12357), P507, DOI 10.1007/978-3-030-58610-2_30
  • [12] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [13] Dirik M, 2020, J ENG RES-KUWAIT, V8, P95
  • [14] Design of Mobile Robot Control Infrastructure Based on Decision Trees and Adaptive Potential Area Methods
    Donmez, Emrah
    Kocamaz, Adnan Fatih
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2020, 44 (01) : 431 - 448
  • [15] A Vision-Based Real-Time Mobile Robot Controller Design Based on Gaussian Function for Indoor Environment
    Donmez, Emrah
    Kocamaz, Adnan Fatih
    Dirik, Mahmut
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 7127 - 7142
  • [16] TPNet: Trajectory Proposal Network for Motion Prediction
    Fang, Liangji
    Jiang, Qinhong
    Shi, Jianping
    Zhou, Bolei
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6796 - 6805
  • [17] Fisac JF, 2019, P INT C ROBOTICS AUT, V2019
  • [18] Fragkiadaki K, 2015, P IEEE INT C COMPUTE, V2015
  • [19] Gambs S, 2012, P 1 WORK MEAS PRIV M, P5
  • [20] Gao JY, 2020, PROC CVPR IEEE, P11522, DOI 10.1109/CVPR42600.2020.01154