An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning

被引:3
作者
Cui, Jianxun [1 ]
Zhao, Boyuan [1 ]
Qu, Mingcheng [2 ]
机构
[1] Harbin Inst Technol, Sch Transportat Sci & Engn, Harbin 150090, Peoples R China
[2] Harbin Inst Technol, Dept Software, Harbin 150001, Peoples R China
关键词
VEHICLES;
D O I
10.1155/2023/1513008
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Decision-making is an important component of autonomous driving perception, decision-making, planning, and control pipeline, which undertakes the task of how the ego vehicle makes high-level decision-making behaviors (such as lane change and car following) after sensing the environmental state, and then these high-level decision-making behaviors can be transmitted to the downstream planning and control module for specific low-level action execution. Based on the method of deep reinforcement learning (specifically, Deep Q network (DQN) and its variants), an integrated lateral and longitudinal decision-making model for autonomous driving is proposed in a multilane highway environment with both autonomous driving vehicle (ADV) and manual driving vehicle (MDV). The classic MOBIL and IDM models are used for the lateral and longitudinal decisions of MDV (i.e., lane changing and car following), while the lateral and longitudinal decisions of ADV are dominated by deep reinforcement learning models. In addition, this paper also uses the nonlinear kinematic bicycle model and two-point visual control model to realize the low-level control of both MDV and ADV. By setting a reasonable state, action, and reward function, this paper has carried out a large number of simulation experiments on the proposed autonomous driving decision-making model based on deep reinforcement learning in a three-lane road environment. The results show that under such scenario setting conditions, the deep reinforcement learning-based model proposed in this paper performs well in autonomous driving safety and travel efficiency. At the same time, when compared with the classical rule-based decision-making model (MOBIL&IDM), it is found that the model proposed in this paper can significantly achieve better results in episode rewards after stable training. In addition, through a large number of hyper-parameter tuning experiments, the performance of DQN, DDQN, and dueling DQN models, which are also deep reinforcement learning-based decision-making models, under different hyper-parametric configurations is compared and analyzed, which can provide a valuable reference for the specific scenario application of these models.
引用
收藏
页数:13
相关论文
共 42 条
[1]   Reachability-Based Decision-Making for Autonomous Driving: Theory and Experiments [J].
Ahn, Heejin ;
Berntorp, Karl ;
Inani, Pranav ;
Ram, Arjun Jagdish ;
Di Cairano, Stefano .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (05) :1907-1921
[2]  
Alizadeh A, 2019, IEEE INT C INTELL TR, P1399, DOI [10.1109/itsc.2019.8917192, 10.1109/ITSC.2019.8917192]
[3]  
Bai ZW, 2019, CHIN CONTR CONF, P8600, DOI [10.23919/ChiCC.2019.8866005, 10.23919/chicc.2019.8866005]
[4]  
Bergstra J., 2011, Advances in Neural Information Processing Systems, V24, P2546
[5]  
Bi H., 2016, Eurographics/ACM SIGGRAPH Symposium on Computer Animation, P149
[6]  
Buehler M, 2009, SPRINGER TRAC ADV RO, V56, P1, DOI 10.1007/978-3-642-03991-1
[7]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[8]  
Chen JY, 2019, IEEE INT C INT ROBOT, P2884, DOI [10.1109/IROS40897.2019.8968225, 10.1109/iros40897.2019.8968225]
[9]  
Finn C, 2016, PR MACH LEARN RES, V48
[10]  
Fu JS, 2018, Arxiv, DOI [arXiv:1710.11248, 10.48550/arXiv.1710.11248]