A deep reinforcement learning based distributed control strategy for connected automated vehicles in mixed traffic platoon

被引:60
作者
Shi, Haotian
Chen, Danjue [1 ,2 ]
Zheng, Nan [3 ]
Wang, Xin [4 ]
Zhou, Yang [5 ]
Ran, Bin [1 ]
机构
[1] Univ Wisconsin, Dept Civil & Environm Engn, 1208 Engn Hall,1415 Engn Dr, Madison, WI 53706 USA
[2] Univ Massachusetts Lowell, Dept Civil & Environm Engn, 1 Univ Ave,Shal Hall 2000, Lowell, MA 01854 USA
[3] Monash Univ, Dept Civil Engn, 23 Coll Walk B60,Clayton Campus, Monash Clayton, Vic 3800, Australia
[4] Dept Ind Syst Engn, 3258 Mech Engn Bldg 1513 Univ Ave, Madison, WI 53706 USA
[5] Texas A&M Univ, Zachry Dept Civil & Environm Engn, 199 Spence St,DLEB 301,3136 TAMU, College Stn, TX 77843 USA
关键词
Mixed traffic environment; Distributed control; Deep reinforcement learning; Traffic oscillation dampening; Connected automated vehicle; MODEL-PREDICTIVE CONTROL; ROLLING HORIZON CONTROL; CAR-FOLLOWING MODEL; STRING STABILITY; SYSTEMS; VALIDATION; FRAMEWORK; WAVELET; IMPACT; CACC;
D O I
10.1016/j.trc.2023.104019
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
This paper proposes an innovative distributed longitudinal control strategy for connected auto-mated vehicles (CAVs) in the mixed traffic environment of CAV and human-driven vehicles (HDVs), incorporating high-dimensional platoon information. For mixed traffic, the traditional CAV control method focuses on microscopic trajectory information, which may not be efficient in handling the HDV stochasticity (e.g., long reaction time; various driving styles) and mixed traffic heterogeneities. Different from traditional methods, our method, for the first time, characterizes consecutive HDVs as a whole (i.e., AHDV) to reduce the HDV stochasticity and utilize its macroscopic features to control the following CAVs. The new control strategy takes advantage of platoon information to anticipate the disturbances and traffic features induced downstream under mixed traffic scenarios and greatly outperforms the traditional methods. In particular, the control algorithm is based on deep reinforcement learning (DRL) to fulfill car-following control efficiency and further address the stochasticity for the aggregated car following behavior by embedding it in the training environment. To better utilize the macroscopic traffic features, a general platoon of mixed traffic is categorized as a CAV-HDVs-CAV pattern and described by corresponding DRL states. The macroscopic traffic flow properties are built upon the Newell car-following model to capture the characteristics of aggregated HDVs' joint behaviors. Simulated experiments are conducted to validate our proposed strategy. The results demonstrate that the proposed control method has outstanding performances in terms of oscillation dampening, eco-driving, and generalization capability.
引用
收藏
页数:27
相关论文
共 72 条
[61]   Consensus and disturbance attenuation in multi-agent chains with nonlinear control and time delays [J].
Zhang, Linjun ;
Orosz, Gabor .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2017, 27 (05) :781-803
[62]   Motif-Based Design for Connected Vehicle Systems in Presence of Heterogeneous Connectivity Structures and Time Delays [J].
Zhang, Linjun ;
Orosz, Gabor .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (06) :1638-1651
[63]   Analysis of highway performance under mixed connected and regular vehicle environment [J].
Zhang Z. ;
Yang X. .
Journal of Intelligent and Connected Vehicles, 2021, 4 (02) :68-79
[64]   Analyzing the impact of automated vehicles on uncertainty and stability of the mixed traffic flow [J].
Zheng, Fangfang ;
Liu, Can ;
Liu, Xiaobo ;
Jabari, Saif Eddin ;
Lu, Liang .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 112 :203-219
[65]   Cooperative control strategies to stabilise the freeway mixed traffic stability and improve traffic throughput in an intelligent roadside system environment [J].
Zheng, Yuan ;
Zhang, Yu ;
Ran, Bin ;
Xu, Yueru ;
Qu, Xu .
IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (09) :1108-1115
[66]   On selecting an optimal wavelet for detecting singularities in traffic and vehicular data [J].
Zheng, Zuduo ;
Washington, Simon .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2012, 25 :18-33
[67]   Freeway traffic oscillations: Microscopic analysis of formations and propagations using Wavelet Transform [J].
Zheng, Zuduo ;
Ahn, Soyoung ;
Chen, Danjue ;
Laval, Jorge .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2011, 45 (09) :1378-1388
[68]   Stabilizing mixed vehicular platoons with connected automated vehicles: An H-infinity approach [J].
Zhou, Yang ;
Ahn, Soyoung ;
Wang, Meng ;
Hoogendoorn, Serge .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2020, 132 (152-170) :152-170
[69]   Distributed model predictive control approach for cooperative car-following with guaranteed local and string stability [J].
Zhou, Yang ;
Wang, Meng ;
Ahn, Soyoung .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2019, 128 :69-86
[70]   Rolling horizon stochastic optimal control strategy for ACC and CACC under uncertainty [J].
Zhou, Yang ;
Ahn, Soyoung ;
Chitturi, Madhav ;
Noyce, David A. .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 83 :61-76