Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

被引:0
|
作者
Chowdhury, Jayabrata [1 ]
Shivaraman, Venkataramanan [2 ]
Sundaram, Suresh [1 ]
Sujit, P. B. [2 ]
机构
[1] Indian Inst Sci, Bengaluru, India
[2] Indian Inst Sci Educ & Res, Bhopal, India
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10 | 2024年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advancements in motion planning for Autonomous Vehicles (AVs) show great promise in using expert driver behaviors in non-stationary driving environments. However, learning only through expert drivers needs more generalizability to recover from domain shifts and near-failure scenarios due to the dynamic behavior of traffic participants and weather conditions. A deep Graph-based Prediction and Planning Policy Network (GP3Net) framework is proposed for non-stationary environments that encodes the interactions between traffic participants with contextual information and provides a decision for safe maneuver for AV. A spatio-temporal graph models the interactions between traffic participants for predicting the future trajectories of those participants. The predicted trajectories are utilized to generate a future occupancy map around the AV with uncertainties embedded to anticipate the evolving non-stationary driving environments. Then the contextual information and future occupancy maps are input to the policy network of the GP3Net framework and trained using Proximal Policy Optimization (PPO) algorithm. The proposed GP3Net performance is evaluated on standard CARLA benchmarking scenarios with domain shifts of traffic patterns (urban, highway, and mixed). The results show that the GP3Net outperforms previous state-of-the-art imitation learning-based planning models for different towns. Further, in unseen new weather conditions, GP3Net completes the desired route with fewer traffic infractions. Finally, the results emphasize the advantage of including the prediction module to enhance safety measures in non-stationary environments.
引用
收藏
页码:11606 / 11614
页数:9
相关论文
共 14 条
  • [1] Self-driving network and service coordination using deep reinforcement learning
    Schneider, Stefan
    Manzoor, Adnan
    Qarawlus, Haydar
    Schellenberg, Rafael
    Karl, Holger
    Khalili, Ramin
    Hecker, Artur
    16th International Conference on Network and Service Management, CNSM 2020, 2nd International Workshop on Analytics for Service and Application Management, AnServApp 2020 and 1st International Workshop on the Future Evolution of Internet Protocols, IPFuture 2020, 2020,
  • [2] Self-Driving Network and Service Coordination Using Deep Reinforcement Learning
    Schneider, Stefan
    Manzoor, Adnan
    Qarawlus, Haydar
    Schellenberg, Rafael
    Karl, Holger
    Khalili, Ramin
    Hecker, Artur
    2020 16TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2020,
  • [3] Deep Reinforcement Learning based Planning for Urban Self-driving with Demonstration and Depth Completion
    Wang, Chuyao
    Aouf, Nabil
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 962 - 967
  • [4] Trajectory Prediction Using Graph-Based Deep Learning for Longitudinal Control of Autonomous Vehicles: A Proactive Approach for Autonomous Driving in Urban Dynamic Traffic Environments
    Yoon, Youngmin
    Yi, Kyongsu
    IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2022, 17 (04): : 18 - 27
  • [5] Improving the learning of self-driving vehicles based on real driving behavior using deep neural network techniques
    Zaghari, Nayereh
    Fathy, Mahmood
    Jameii, Seyed Mahdi
    Sabokrou, Mohammad
    Shahverdy, Mohammad
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (04): : 3752 - 3794
  • [6] Improving the learning of self-driving vehicles based on real driving behavior using deep neural network techniques
    Nayereh Zaghari
    Mahmood Fathy
    Seyed Mahdi Jameii
    Mohammad Sabokrou
    Mohammad Shahverdy
    The Journal of Supercomputing, 2021, 77 : 3752 - 3794
  • [7] A Multi-sensing Input and Multi-constraint Reward Mechanism Based Deep Reinforcement Learning Method for Self-driving Policy Learning
    Wang, Zhongli
    Wang, Hao
    Cui, Xin
    Zheng, Chaochao
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT IV, 2021, 13016 : 691 - 701
  • [8] Ad Hoc-Obstacle Avoidance-Based Navigation System Using Deep Reinforcement Learning for Self-Driving Vehicles
    Manikandan, N. S.
    Kaliyaperumal, Ganesan
    Wang, Yong
    IEEE ACCESS, 2023, 11 : 92285 - 92297
  • [9] Deep Reinforcement Learning based control algorithms: Training and validation using the ROS Framework in CARLA Simulator for Self-Driving applications
    Perez-Gill, Oscar
    Barea, Rafael
    Lopez-Guillen, Elena
    Bergasa, Luis M.
    Gomez-Huelamo, Carlos
    Gutierrez, Rodrigo
    Diaz, Alejandro
    2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1268 - 1273
  • [10] Petri-net-based dynamic scheduling of flexible manufacturing system via deep reinforcement learning with graph convolutional network
    Hu, Liang
    Liu, Zhenyu
    Hu, Weifei
    Wang, Yueyang
    Tan, Jianrong
    Wu, Fei
    JOURNAL OF MANUFACTURING SYSTEMS, 2020, 55 : 1 - 14