An actor-critic based learning method for decision-making and planning of autonomous vehicles

被引:0
|
作者
Can Xu
WanZhong Zhao
QingYun Chen
ChunYan Wang
机构
[1] Nanjing University of Aeronautics and Astronautics,Department of Vehicle Engineering
来源
关键词
trajectory planning; decision-making; actor-critic; feature extraction; autonomous driving;
D O I
暂无
中图分类号
学科分类号
摘要
In order to improve the agility and applicability of trajectory planning algorithm for autonomous vehicles, this paper proposes a novel actor-critic based learning method for decision-making and planning in multi-vehicle complex traffic. It is the coupling planning of vehicle’s path and speed thus to make the trajectory more flexible. First, generations from the decided action to the planned trajectory are described by the end-point of the trajectory. Then, the actor-critic based learning method is built to learn an optimal policy for the decision process. It can update the policy by the gradient of the current policy’s advantage. In this process, features of the real traffic are carefully extracted by time headway (TH) and speed distribution. Reward function is built by the safety, efficiency and driving comfort. Furthermore, to make the policy network have better convergency, the policy network is modularized in two parts: the lane-changing network and the lane-keeping network, which decide the optimal end-point of the path and speed candidates respectively. Finally, the curved overtaking scenario and the interaction process with human driver are conducted to illustrate the feasibility and superiority. The results show that the proposed method has better real-time performance and can make the planned coupling trajectory more continuous and smoother than the existing rule-based method.
引用
收藏
页码:984 / 994
页数:10
相关论文
共 50 条
  • [31] Soft Actor-Critic and Risk Assessment-Based Reinforcement Learning Method for Ship Path Planning
    Wang, Jue
    Ji, Bin
    Fu, Qian
    SUSTAINABILITY, 2024, 16 (08)
  • [32] Merging in Congested Freeway Traffic Using Multipolicy Decision Making and Passive Actor-Critic Learning
    Nishi, Tomoki
    Doshi, Prashant
    Prokhorov, Danil
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2019, 4 (02): : 287 - 297
  • [33] Autonomous Control of Lift System based on Actor-Critic Learning for Air Cushion Vehicle
    Zhou, Hua
    Wang, Yuanhui
    Jiyang, E.
    Wang, Xiaole
    OCEANS 2023 - LIMERICK, 2023,
  • [34] A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
    Wen, Shuhuan
    Shu, Yili
    Rad, Ahmad
    Wen, Zeteng
    Guo, Zhengzheng
    Gong, Simeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [35] Lexicographic Actor-Critic Deep Reinforcement Learning for Urban Autonomous Driving
    Zhang, Hengrui
    Lin, Youfang
    Han, Sheng
    Lv, Kai
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (04) : 4308 - 4319
  • [36] Manoeuvre decision-making of unmanned aerial vehicles in air combat based on an expert actor-based soft actor critic algorithm
    Li, Bo
    Bai, Shuangxia
    Liang, Shiyang
    Ma, Rui
    Neretin, Evgeny
    Huang, Jingyi
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (04) : 1608 - 1619
  • [37] Actor-Critic Learning Based on Adaptive Importance Sampling
    Cheng Yuhu
    Feng Huanting
    Wang Xuesong
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04): : 583 - 588
  • [38] Actor-Critic reinforcement learning based on prior knowledge
    Yang, Zhenyu, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [39] A Review of Decision-Making and Planning for Autonomous Vehicles in Intersection Environments
    Chen, Shanzhi
    Hu, Xinghua
    Zhao, Jiahao
    Wang, Ran
    Qiao, Min
    WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (03):
  • [40] Believer-Skeptic Meets Actor-Critic: Rethinking the Role of Basal Ganglia Pathways during Decision-Making and Reinforcement Learning
    Dunoyan, Kyle
    Verstynen, Timothy
    FRONTIERS IN NEUROSCIENCE, 2016, 10