Heuristic-Based Multi-Agent Deep Reinforcement Learning Approach for Coordinating Connected and Automated Vehicles at Non-Signalized Intersection

被引:0
|
作者
Guo, Zihan [1 ,2 ]
Wu, Yan [1 ,2 ]
Wang, Lifang [1 ,2 ]
Zhang, Junzhi [3 ]
机构
[1] Chinese Acad Sci, Inst Elect Engn, Key Lab High Dens Electromagnet Power & Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Tsinghua Univ, Dept Automot Engn, Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
关键词
Heuristic algorithms; Deep reinforcement learning; Autonomous vehicles; Training; Delays; Transfer learning; Q-learning; Optimization; Merging; Game theory; Non-signalized intersection management; multi-agent deep reinforcement learning; zero-shot generalization; communication latency;
D O I
10.1109/TITS.2024.3407760
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
One typical application of connected and automated vehicles (CAVs) is to coordinate multiple CAVs at a non-signalized intersection in mixed traffic, and it may take advantage of multi-agent deep reinforcement learning (MDRL) approaches to improve the overall coordination efficiency. This study proposes a heuristic-based MDRL algorithm (H-QMIX) developed based on a value-based MDRL algorithm, QMIX. This algorithm incorporates a heuristic-based action mask module to guide CAVs efficiently and safely through intersections, composed of a stimulative passing sequence and safety restrictions on CAVs' action space in the junction area. Compared with other MDRL algorithms (e.g., IPPO, QMIX), the H-QMIX algorithm demonstrates improved training performance in terms of safety and efficiency in two case studies, where the first requires all CAVs to affix their routes, and another allows CAVs to choose random routes. Concerning the model's generalization ability, the trained models with the maximal episodic return are then transferred to a more practical scenario with a certain vehicle-to-vehicle (V2V) communication delay in a zero-shot manner. The simulation results illustrate that H-QMIX is robust to a certain communication delay. The code for this paper is available at: https://github.com/flammingRaven/heuristic_based_qmix.
引用
收藏
页码:16235 / 16248
页数:14
相关论文
共 50 条
  • [1] Coordination for Connected and Automated Vehicles at Non-Signalized Intersections: A Value Decomposition-Based Multiagent Deep Reinforcement Learning Approach
    Guo, Zihan
    Wu, Yan
    Wang, Lifang
    Zhang, Junzhi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (03) : 3025 - 3034
  • [2] Comprehensive Automated Driving Maneuvers under a Non-Signalized Intersection Adopting Deep Reinforcement Learning
    Quang-Duy Tran
    Bae, Sang-Hoon
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [3] A Comprehensive Survey on Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
    Yadav, Pamul
    Mishra, Ashutosh
    Kim, Shiho
    SENSORS, 2023, 23 (10)
  • [4] Novel Edge Caching Approach Based on Multi-Agent Deep Reinforcement Learning for Internet of Vehicles
    Zhang, Degan
    Wang, Wenjing
    Zhang, Jie
    Zhang, Ting
    Du, Jinyu
    Yang, Chun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8324 - 8338
  • [5] Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection
    Duy Quang Tran
    Bae, Sang-Hoon
    APPLIED SCIENCES-BASEL, 2020, 10 (16):
  • [6] Online parking assignment in an environment of partially connected vehicles: A multi-agent deep reinforcement learning approach
    Zhang, Xinyuan
    Zhao, Cong
    Liao, Feixiong
    Li, Xinghua
    Du, Yuchuan
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 138
  • [7] Multi-level objective control of AVs at a saturated signalized intersection with multi-agent deep reinforcement learning approach
    Lin, Wenfeng
    Hu, Xiaowei
    Wang, Jian
    JOURNAL OF INTELLIGENT AND CONNECTED VEHICLES, 2023, 6 (04) : 250 - 263
  • [8] Cooperative On-Ramp Merging Control of Connected and Automated Vehicles: Distributed Multi-Agent Deep Reinforcement Learning Approach
    Zhou, Shanxing
    Zhuang, Weichao
    Yin, Guodong
    Liu, Haoji
    Qiu, Chunlong
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 402 - 408
  • [9] Longitudinal control of connected and automated vehicles among signalized intersections in mixed traffic flow with deep reinforcement learning approach
    Liu, Chunyu
    Sheng, Zihao
    Chen, Sikai
    Shi, Haotian
    Ran, Bin
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 629
  • [10] A Collaborative Control Scheme for Smart Vehicles Based on Multi-Agent Deep Reinforcement Learning
    Shi, Liyan
    Chen, Hairui
    IEEE ACCESS, 2023, 11 : 96221 - 96234