Heuristic-Based Multi-Agent Deep Reinforcement Learning Approach for Coordinating Connected and Automated Vehicles at Non-Signalized Intersection

被引：0

作者：

Guo, Zihan ^{[1
,2
]}

Wu, Yan ^{[1
,2
]}

Wang, Lifang ^{[1
,2
]}

Zhang, Junzhi ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Elect Engn, Key Lab High Dens Electromagnet Power & Syst, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Tsinghua Univ, Dept Automot Engn, Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 11期

关键词：

Heuristic algorithms; Deep reinforcement learning; Autonomous vehicles; Training; Delays; Transfer learning; Q-learning; Optimization; Merging; Game theory; Non-signalized intersection management; multi-agent deep reinforcement learning; zero-shot generalization; communication latency;

D O I：

10.1109/TITS.2024.3407760

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

One typical application of connected and automated vehicles (CAVs) is to coordinate multiple CAVs at a non-signalized intersection in mixed traffic, and it may take advantage of multi-agent deep reinforcement learning (MDRL) approaches to improve the overall coordination efficiency. This study proposes a heuristic-based MDRL algorithm (H-QMIX) developed based on a value-based MDRL algorithm, QMIX. This algorithm incorporates a heuristic-based action mask module to guide CAVs efficiently and safely through intersections, composed of a stimulative passing sequence and safety restrictions on CAVs' action space in the junction area. Compared with other MDRL algorithms (e.g., IPPO, QMIX), the H-QMIX algorithm demonstrates improved training performance in terms of safety and efficiency in two case studies, where the first requires all CAVs to affix their routes, and another allows CAVs to choose random routes. Concerning the model's generalization ability, the trained models with the maximal episodic return are then transferred to a more practical scenario with a certain vehicle-to-vehicle (V2V) communication delay in a zero-shot manner. The simulation results illustrate that H-QMIX is robust to a certain communication delay. The code for this paper is available at: https://github.com/flammingRaven/heuristic_based_qmix.

引用

页码：16235 / 16248

页数：14

共 50 条

[31] A multi-agent deep reinforcement learning approach for traffic signal coordination
Hu, Ta-Yin
Li, Zhuo-Yu
IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1428 - 1444
[32] Multi-agent reinforcement learning for cooperative lane changing of connected and autonomous vehicles in mixed traffic
Zhou W.
Chen D.
Yan J.
Li Z.
Yin H.
Ge W.
Autonomous Intelligent Systems, 2022, 2 (01):
[33] Heterogeneous multi-agent deep reinforcement learning for eco-driving of hybrid electric tracked vehicles: A heuristic training framework
Su, Qicong
Huang, Ruchen
He, Hongwen
JOURNAL OF POWER SOURCES, 2024, 601
[34] Distributed Task Offloading based on Multi-Agent Deep Reinforcement Learning
Hu, Shucheng
Ren, Tao
Niu, Jianwei
Hu, Zheyuan
Xing, Guoliang
2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 575 - 583
[35] Distributed interference coordination based on multi-agent deep reinforcement learning
Liu T.
Luo Y.
Yang C.
Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 38 - 48
[36] Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
Urmonov, Odilbek
Kim, HyungWon
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[37] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
Wang, Zhi
Liu, Fan
Guo, Jing
Hong, Chen
Chen, Ming
Wang, Ershen
Zhao, Yunbo
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
[38] Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach
Mlika, Zoubeir
Cherkaoui, Soumaya
ANNALS OF TELECOMMUNICATIONS, 2021, 76 (9-10) : 665 - 683
[39] MADDPGViz: a visual analytics approach to understand multi-agent deep reinforcement learning
Xiaoying Shi
Jiaming Zhang
Ziyi Liang
Dewen Seng
Journal of Visualization, 2023, 26 : 1189 - 1205
[40] Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach
Zoubeir Mlika
Soumaya Cherkaoui
Annals of Telecommunications, 2021, 76 : 665 - 683

← 1 2 3 4 5 →