Local Planning Strategy Based on Deep Reinforcement Learning Over Estimation Suppression

被引:0
作者
Han, Ling [1 ]
Wang, Yiren [1 ]
Chi, Ruifeng [1 ]
Fang, Ruoyu [1 ]
Liu, Guopeng [1 ]
Yi, Qiang [2 ]
Zhu, Changsheng [1 ]
机构
[1] Changchun Univ Technol, Sch Mechatron Engn, Changchun 130012, Peoples R China
[2] Indiana Univ Purdue Univ, Indianapolis, IN 46074 USA
关键词
Intelligent vehicle; Deep reinforcement learning; Deep Q-Network; Information entropy; AUTONOMOUS VEHICLES; TRACKING; MODEL; AVOIDANCE; FRAMEWORK;
D O I
10.1007/s12239-024-00076-w
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Local planning is a critical and difficult task for intelligent vehicles in dynamic transportation environments. In this paper, a new method Suppress Q Deep Q Network (SQDQN) combining traditional deep reinforcement learning Deep Q Network (DQN) with information entropy is proposed for local planning in automatic driving. In the proposed approach, local planning strategy in complex traffic environment established by the actor-critic network based on DQN, the method adopts the way of execution action-evaluation action-update network to explore the optimal local planning strategy. Proposed strategy does not rely on accurate modeling of the scene, so it is suitable for complex and changeable traffic scenes. At the same time, evaluate the update process and determine the update range by using information entropy to solve a common problem in the network that over expectation of actions damage the performance of strategies. Use this approach to improve strategic performance. The trained local planning strategy is evaluated in three simulation scenarios: overtaking, following, driving in hazardous situations. The results illustrate the advantages of the proposed SQDQN method in solving local planning problem.
引用
收藏
页码:837 / 848
页数:12
相关论文
共 27 条
  • [1] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Local Path Planning for Off-oad Autonomous Driving With Avoidance of Static Obstacles
    Chu, Keonyup
    Lee, Minchae
    Sunwoo, Myoungho
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2012, 13 (04) : 1599 - 1616
  • [4] Situational Assessment for Intelligent Vehicles Based on Stochastic Model and Gaussian Distributions in Typical Traffic Scenarios
    Gao, Hongbo
    Zhu, Juping
    Zhang, Tong
    Xie, Guotao
    Kan, Zhen
    Hao, Zhengyuan
    Liu, Kang
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (03): : 1426 - 1436
  • [5] A REVIEW OF OPTIMAL MOTION PLANNING FOR UNMANNED VEHICLES
    Gao, Shuhua
    Huang, Sunan
    Xiang, Cheng
    Lee, Tong Heng
    [J]. JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2020, 28 (05): : 321 - 330
  • [6] Vehicle Detection and Tracking in Adverse Weather Using a Deep Learning Framework
    Hassaballah, M.
    Kenk, Mourad A.
    Muhammad, Khan
    Minaee, Shervin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (07) : 4230 - 4242
  • [7] Hasselt H., 2010, ADV NEURAL INFORM PR, V23, DOI DOI 10.5555/2997046.2997187
  • [8] A Motion Planning and Tracking Framework for Autonomous Vehicles Based on Artificial Potential Field Elaborated Resistance Network Approach
    Huang, Yanjun
    Ding, Haitao
    Zhang, Yubiao
    Wang, Hong
    Cao, Dongpu
    Xu, Nan
    Hu, Chuan
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (02) : 1376 - 1386
  • [9] Target Vehicle Motion Prediction-Based Motion Planning Framework for Autonomous Driving in Uncontrolled Intersections
    Jeong, Yonghwan
    Yi, Kyongsu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (01) : 168 - 177
  • [10] Modeling vehicle car-following behavior in congested traffic conditions based on different vehicle combinations
    Kong, Dewen
    List, George F.
    Guo, Xiucheng
    Wu, Dingxin
    [J]. TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2018, 10 (05): : 280 - 293