Hybrid Car-Following Strategy Based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

被引：38

作者：

Yan, Ruidong ^{[1
]}

Jiang, Rui ^{[1
]}

Jia, Bin ^{[1
]}

Huang, Jin ^{[2
]}

Yang, Diange ^{[2
]}

机构：

[1] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100044, Peoples R China

[2] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2022年 / 19卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Mathematical model; Differential equations; Cruise control; Training; Reinforcement learning; Adaptation models; Space exploration; Car-following; cooperative adaptive cruise control (CACC); deep deterministic policy gradient (DDPG); hybrid strategy;

D O I：

10.1109/TASE.2021.3100709

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep deterministic policy gradient (DDPG)-based car-following strategy can break through the constraints of the differential equation model due to the ability of exploration on complex environments. However, the car-following performance of DDPG is usually degraded by unreasonable reward function design, insufficient training, and low sampling efficiency. In order to solve this kind of problem, a hybrid car-following strategy based on DDPG and cooperative adaptive cruise control (CACC) is proposed. First, the car-following process is modeled as the Markov decision process to calculate CACC and DDPG simultaneously at each frame. Given a current state, two actions are obtained from CACC and DDPG, respectively. Then, an optimal action, corresponding to the one offering a larger reward, is chosen as the output of the hybrid strategy. Meanwhile, a rule is designed to ensure that the change rate of acceleration is smaller than the desired value. Therefore, the proposed strategy not only guarantees the basic performance of car-following through CACC but also makes full use of the advantages of exploration on complex environments via DDPG. Finally, simulation results show that the car-following performance of the proposed strategy is improved compared with that of DDPG and CACC.

引用

页码：2816 / 2824

页数：9

共 50 条

[31] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm [J].

Li, Ning ;

Tang, Jichuan ;

Li, Zhong-Xian ;

Gao, Xiuyu .

STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10)

[32] Leaderless Cooperative Adaptive Cruise Control Based on the Constant Time-Gap Spacing Policy [J].

Rezaee, Hamed ;

Parisini, Thomas ;

Polycarpou, Marios M. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) :659-666

[33] Obstacle avoidance strategy for an autonomous surface vessel based on modified deep deterministic policy gradient [J].

Zhou, Chang ;

Wang, Yiting ;

Wang, Lei ;

He, Huacheng .

OCEAN ENGINEERING, 2022, 243

[34] A Deep Reinforcement Learning Method based on Deterministic Policy Gradient for Multi-Agent Cooperative Competition [J].

Zuo, Xuan ;

Xue, Hui-Feng ;

Wang, Xiao-Yin ;

Du, Wan-Ru ;

Tian, Tao ;

Gao, Shan ;

Zhang, Pu .

CONTROL ENGINEERING AND APPLIED INFORMATICS, 2021, 23 (03) :88-98

[35] Deep Deterministic Policy Gradient Algorithm based Lateral and Longitudinal Control for Autonomous Driving [J].

Zhu Gongsheng ;

Pei Chunmei ;

Ding Jiang ;

Shi Junfeng .

2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, :736-741

[36] Reinforcement Learning-based Car-Following Control for Autonomous Vehicles with OTFS [J].

Liu, Yulin ;

Shi, Yuye ;

Zhang, Xiaoqi ;

Wu, Jun ;

Yang, Songyuan .

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,

[37] STUDY ON JOINT CONTROL OF PUMP AND RADIATOR IN PEMFC BASED ON DEEP DETERMINISTIC POLICY GRADIENT [J].

Zhao H. ;

Pan S. ;

Wu Y. ;

Ma L. ;

Lyu T. .

Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2024, 45 (06) :92-101

[38] Deep Deterministic Policy Gradient Based on Double Network Prioritized Experience Replay [J].

Kang, Chaohai ;

Rong, Chuiting ;

Ren, Weijian ;

Huo, Fengcai ;

Liu, Pengyun .

IEEE ACCESS, 2021, 9 :60296-60308

[39] Speed planning and energy management strategy of hybrid electric vehicles in a car-following scenario [J].

Hou, Shengyan ;

Chen, Hong ;

Zhang, Yu ;

Gao, Jinwu .

CONTROL THEORY AND TECHNOLOGY, 2022, 20 (02) :185-196

[40] Speed planning and energy management strategy of hybrid electric vehicles in a car-following scenario [J].

Shengyan Hou ;

Hong Chen ;

Yu Zhang ;

Jinwu Gao .

Control Theory and Technology, 2022, 20 :185-196

← 1 2 3 4 5 →