Heuristic-Based Multi-Agent Deep Reinforcement Learning Approach for Coordinating Connected and Automated Vehicles at Non-Signalized Intersection

被引:0
|
作者
Guo, Zihan [1 ,2 ]
Wu, Yan [1 ,2 ]
Wang, Lifang [1 ,2 ]
Zhang, Junzhi [3 ]
机构
[1] Chinese Acad Sci, Inst Elect Engn, Key Lab High Dens Electromagnet Power & Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Tsinghua Univ, Dept Automot Engn, Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
关键词
Heuristic algorithms; Deep reinforcement learning; Autonomous vehicles; Training; Delays; Transfer learning; Q-learning; Optimization; Merging; Game theory; Non-signalized intersection management; multi-agent deep reinforcement learning; zero-shot generalization; communication latency;
D O I
10.1109/TITS.2024.3407760
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
One typical application of connected and automated vehicles (CAVs) is to coordinate multiple CAVs at a non-signalized intersection in mixed traffic, and it may take advantage of multi-agent deep reinforcement learning (MDRL) approaches to improve the overall coordination efficiency. This study proposes a heuristic-based MDRL algorithm (H-QMIX) developed based on a value-based MDRL algorithm, QMIX. This algorithm incorporates a heuristic-based action mask module to guide CAVs efficiently and safely through intersections, composed of a stimulative passing sequence and safety restrictions on CAVs' action space in the junction area. Compared with other MDRL algorithms (e.g., IPPO, QMIX), the H-QMIX algorithm demonstrates improved training performance in terms of safety and efficiency in two case studies, where the first requires all CAVs to affix their routes, and another allows CAVs to choose random routes. Concerning the model's generalization ability, the trained models with the maximal episodic return are then transferred to a more practical scenario with a certain vehicle-to-vehicle (V2V) communication delay in a zero-shot manner. The simulation results illustrate that H-QMIX is robust to a certain communication delay. The code for this paper is available at: https://github.com/flammingRaven/heuristic_based_qmix.
引用
收藏
页码:16235 / 16248
页数:14
相关论文
共 50 条
  • [31] A multi-agent deep reinforcement learning approach for traffic signal coordination
    Hu, Ta-Yin
    Li, Zhuo-Yu
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1428 - 1444
  • [32] Multi-agent reinforcement learning for cooperative lane changing of connected and autonomous vehicles in mixed traffic
    Zhou W.
    Chen D.
    Yan J.
    Li Z.
    Yin H.
    Ge W.
    Autonomous Intelligent Systems, 2022, 2 (01):
  • [33] Heterogeneous multi-agent deep reinforcement learning for eco-driving of hybrid electric tracked vehicles: A heuristic training framework
    Su, Qicong
    Huang, Ruchen
    He, Hongwen
    JOURNAL OF POWER SOURCES, 2024, 601
  • [34] Distributed Task Offloading based on Multi-Agent Deep Reinforcement Learning
    Hu, Shucheng
    Ren, Tao
    Niu, Jianwei
    Hu, Zheyuan
    Xing, Guoliang
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 575 - 583
  • [35] Distributed interference coordination based on multi-agent deep reinforcement learning
    Liu T.
    Luo Y.
    Yang C.
    Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 38 - 48
  • [36] Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
    Urmonov, Odilbek
    Kim, HyungWon
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [37] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
    Wang, Zhi
    Liu, Fan
    Guo, Jing
    Hong, Chen
    Chen, Ming
    Wang, Ershen
    Zhao, Yunbo
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
  • [38] Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach
    Mlika, Zoubeir
    Cherkaoui, Soumaya
    ANNALS OF TELECOMMUNICATIONS, 2021, 76 (9-10) : 665 - 683
  • [39] MADDPGViz: a visual analytics approach to understand multi-agent deep reinforcement learning
    Xiaoying Shi
    Jiaming Zhang
    Ziyi Liang
    Dewen Seng
    Journal of Visualization, 2023, 26 : 1189 - 1205
  • [40] Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach
    Zoubeir Mlika
    Soumaya Cherkaoui
    Annals of Telecommunications, 2021, 76 : 665 - 683