Heuristic-Based Multi-Agent Deep Reinforcement Learning Approach for Coordinating Connected and Automated Vehicles at Non-Signalized Intersection

被引：0

作者：

Guo, Zihan ^{[1
,2
]}

Wu, Yan ^{[1
,2
]}

Wang, Lifang ^{[1
,2
]}

Zhang, Junzhi ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Elect Engn, Key Lab High Dens Electromagnet Power & Syst, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Tsinghua Univ, Dept Automot Engn, Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 11期

关键词：

Heuristic algorithms; Deep reinforcement learning; Autonomous vehicles; Training; Delays; Transfer learning; Q-learning; Optimization; Merging; Game theory; Non-signalized intersection management; multi-agent deep reinforcement learning; zero-shot generalization; communication latency;

D O I：

10.1109/TITS.2024.3407760

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

One typical application of connected and automated vehicles (CAVs) is to coordinate multiple CAVs at a non-signalized intersection in mixed traffic, and it may take advantage of multi-agent deep reinforcement learning (MDRL) approaches to improve the overall coordination efficiency. This study proposes a heuristic-based MDRL algorithm (H-QMIX) developed based on a value-based MDRL algorithm, QMIX. This algorithm incorporates a heuristic-based action mask module to guide CAVs efficiently and safely through intersections, composed of a stimulative passing sequence and safety restrictions on CAVs' action space in the junction area. Compared with other MDRL algorithms (e.g., IPPO, QMIX), the H-QMIX algorithm demonstrates improved training performance in terms of safety and efficiency in two case studies, where the first requires all CAVs to affix their routes, and another allows CAVs to choose random routes. Concerning the model's generalization ability, the trained models with the maximal episodic return are then transferred to a more practical scenario with a certain vehicle-to-vehicle (V2V) communication delay in a zero-shot manner. The simulation results illustrate that H-QMIX is robust to a certain communication delay. The code for this paper is available at: https://github.com/flammingRaven/heuristic_based_qmix.

引用

页码：16235 / 16248

页数：14

共 50 条

[1] Coordination for Connected and Automated Vehicles at Non-Signalized Intersections: A Value Decomposition-Based Multiagent Deep Reinforcement Learning Approach
Guo, Zihan
Wu, Yan
Wang, Lifang
Zhang, Junzhi
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (03) : 3025 - 3034
[2] Comprehensive Automated Driving Maneuvers under a Non-Signalized Intersection Adopting Deep Reinforcement Learning
Quang-Duy Tran
Bae, Sang-Hoon
APPLIED SCIENCES-BASEL, 2022, 12 (19):
[3] A Comprehensive Survey on Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
Yadav, Pamul
Mishra, Ashutosh
Kim, Shiho
SENSORS, 2023, 23 (10)
[4] Novel Edge Caching Approach Based on Multi-Agent Deep Reinforcement Learning for Internet of Vehicles
Zhang, Degan
Wang, Wenjing
Zhang, Jie
Zhang, Ting
Du, Jinyu
Yang, Chun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8324 - 8338
[5] Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection
Duy Quang Tran
Bae, Sang-Hoon
APPLIED SCIENCES-BASEL, 2020, 10 (16):
[6] Online parking assignment in an environment of partially connected vehicles: A multi-agent deep reinforcement learning approach
Zhang, Xinyuan
Zhao, Cong
Liao, Feixiong
Li, Xinghua
Du, Yuchuan
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 138
[7] Multi-level objective control of AVs at a saturated signalized intersection with multi-agent deep reinforcement learning approach
Lin, Wenfeng
Hu, Xiaowei
Wang, Jian
JOURNAL OF INTELLIGENT AND CONNECTED VEHICLES, 2023, 6 (04) : 250 - 263
[8] Cooperative On-Ramp Merging Control of Connected and Automated Vehicles: Distributed Multi-Agent Deep Reinforcement Learning Approach
Zhou, Shanxing
Zhuang, Weichao
Yin, Guodong
Liu, Haoji
Qiu, Chunlong
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 402 - 408
[9] Longitudinal control of connected and automated vehicles among signalized intersections in mixed traffic flow with deep reinforcement learning approach
Liu, Chunyu
Sheng, Zihao
Chen, Sikai
Shi, Haotian
Ran, Bin
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 629
[10] A Collaborative Control Scheme for Smart Vehicles Based on Multi-Agent Deep Reinforcement Learning
Shi, Liyan
Chen, Hairui
IEEE ACCESS, 2023, 11 : 96221 - 96234

← 1 2 3 4 5 →