Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

被引:0
|
作者
Chowdhury, Jayabrata [1 ]
Shivaraman, Venkataramanan [2 ]
Sundaram, Suresh [1 ]
Sujit, P. B. [2 ]
机构
[1] Indian Inst Sci, Bengaluru, India
[2] Indian Inst Sci Educ & Res, Bhopal, India
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10 | 2024年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advancements in motion planning for Autonomous Vehicles (AVs) show great promise in using expert driver behaviors in non-stationary driving environments. However, learning only through expert drivers needs more generalizability to recover from domain shifts and near-failure scenarios due to the dynamic behavior of traffic participants and weather conditions. A deep Graph-based Prediction and Planning Policy Network (GP3Net) framework is proposed for non-stationary environments that encodes the interactions between traffic participants with contextual information and provides a decision for safe maneuver for AV. A spatio-temporal graph models the interactions between traffic participants for predicting the future trajectories of those participants. The predicted trajectories are utilized to generate a future occupancy map around the AV with uncertainties embedded to anticipate the evolving non-stationary driving environments. Then the contextual information and future occupancy maps are input to the policy network of the GP3Net framework and trained using Proximal Policy Optimization (PPO) algorithm. The proposed GP3Net performance is evaluated on standard CARLA benchmarking scenarios with domain shifts of traffic patterns (urban, highway, and mixed). The results show that the GP3Net outperforms previous state-of-the-art imitation learning-based planning models for different towns. Further, in unseen new weather conditions, GP3Net completes the desired route with fewer traffic infractions. Finally, the results emphasize the advantage of including the prediction module to enhance safety measures in non-stationary environments.
引用
收藏
页码:11606 / 11614
页数:9
相关论文
共 14 条
  • [11] BAAO: Bayesian and Adam optimizer for fault prediction in self-driving software systems using deep learning-based hyperparameter tuning
    Sumedha Dangi
    Deepak Kumar
    Vipin Khurana
    International Journal of Information Technology, 2025, 17 (2) : 841 - 850
  • [12] Hierarchical Traffic Engineering in 3D Networks Using QoS-Aware Graph-Based Deep Reinforcement Learning
    Kolakowski, Robert
    Tomaszewski, Lechoslaw
    Tepinski, Rafal
    Kuklinski, Slawomir
    ELECTRONICS, 2025, 14 (05):
  • [13] MACNS: A generic graph neural network integrated deep reinforcement learning based multi-agent collaborative navigation system for dynamic trajectory planning
    Xiao, Ziren
    Li, Peisong
    Liu, Chang
    Gao, Honghao
    Wang, Xinheng
    INFORMATION FUSION, 2024, 105
  • [14] VizNav: A Modular Off-Policy Deep Reinforcement Learning Framework for Vision-Based Autonomous UAV Navigation in 3D Dynamic Environments
    Almahamid, Fadi
    Grolinger, Katarina
    DRONES, 2024, 8 (05)