Hybrid Car-Following Strategy Based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

被引:38
作者
Yan, Ruidong [1 ]
Jiang, Rui [1 ]
Jia, Bin [1 ]
Huang, Jin [2 ]
Yang, Diange [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100044, Peoples R China
[2] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Mathematical model; Differential equations; Cruise control; Training; Reinforcement learning; Adaptation models; Space exploration; Car-following; cooperative adaptive cruise control (CACC); deep deterministic policy gradient (DDPG); hybrid strategy;
D O I
10.1109/TASE.2021.3100709
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep deterministic policy gradient (DDPG)-based car-following strategy can break through the constraints of the differential equation model due to the ability of exploration on complex environments. However, the car-following performance of DDPG is usually degraded by unreasonable reward function design, insufficient training, and low sampling efficiency. In order to solve this kind of problem, a hybrid car-following strategy based on DDPG and cooperative adaptive cruise control (CACC) is proposed. First, the car-following process is modeled as the Markov decision process to calculate CACC and DDPG simultaneously at each frame. Given a current state, two actions are obtained from CACC and DDPG, respectively. Then, an optimal action, corresponding to the one offering a larger reward, is chosen as the output of the hybrid strategy. Meanwhile, a rule is designed to ensure that the change rate of acceleration is smaller than the desired value. Therefore, the proposed strategy not only guarantees the basic performance of car-following through CACC but also makes full use of the advantages of exploration on complex environments via DDPG. Finally, simulation results show that the car-following performance of the proposed strategy is improved compared with that of DDPG and CACC.
引用
收藏
页码:2816 / 2824
页数:9
相关论文
共 50 条
[31]   Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm [J].
Li, Ning ;
Tang, Jichuan ;
Li, Zhong-Xian ;
Gao, Xiuyu .
STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10)
[32]   Leaderless Cooperative Adaptive Cruise Control Based on the Constant Time-Gap Spacing Policy [J].
Rezaee, Hamed ;
Parisini, Thomas ;
Polycarpou, Marios M. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) :659-666
[33]   Obstacle avoidance strategy for an autonomous surface vessel based on modified deep deterministic policy gradient [J].
Zhou, Chang ;
Wang, Yiting ;
Wang, Lei ;
He, Huacheng .
OCEAN ENGINEERING, 2022, 243
[34]   A Deep Reinforcement Learning Method based on Deterministic Policy Gradient for Multi-Agent Cooperative Competition [J].
Zuo, Xuan ;
Xue, Hui-Feng ;
Wang, Xiao-Yin ;
Du, Wan-Ru ;
Tian, Tao ;
Gao, Shan ;
Zhang, Pu .
CONTROL ENGINEERING AND APPLIED INFORMATICS, 2021, 23 (03) :88-98
[35]   Deep Deterministic Policy Gradient Algorithm based Lateral and Longitudinal Control for Autonomous Driving [J].
Zhu Gongsheng ;
Pei Chunmei ;
Ding Jiang ;
Shi Junfeng .
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, :736-741
[36]   Reinforcement Learning-based Car-Following Control for Autonomous Vehicles with OTFS [J].
Liu, Yulin ;
Shi, Yuye ;
Zhang, Xiaoqi ;
Wu, Jun ;
Yang, Songyuan .
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[37]   STUDY ON JOINT CONTROL OF PUMP AND RADIATOR IN PEMFC BASED ON DEEP DETERMINISTIC POLICY GRADIENT [J].
Zhao H. ;
Pan S. ;
Wu Y. ;
Ma L. ;
Lyu T. .
Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2024, 45 (06) :92-101
[38]   Deep Deterministic Policy Gradient Based on Double Network Prioritized Experience Replay [J].
Kang, Chaohai ;
Rong, Chuiting ;
Ren, Weijian ;
Huo, Fengcai ;
Liu, Pengyun .
IEEE ACCESS, 2021, 9 :60296-60308
[39]   Speed planning and energy management strategy of hybrid electric vehicles in a car-following scenario [J].
Hou, Shengyan ;
Chen, Hong ;
Zhang, Yu ;
Gao, Jinwu .
CONTROL THEORY AND TECHNOLOGY, 2022, 20 (02) :185-196
[40]   Speed planning and energy management strategy of hybrid electric vehicles in a car-following scenario [J].
Shengyan Hou ;
Hong Chen ;
Yu Zhang ;
Jinwu Gao .
Control Theory and Technology, 2022, 20 :185-196