CVDMARL: A Communication-Enhanced Value Decomposition Multi-Agent Reinforcement Learning Traffic Signal Control Method

被引:5
作者
Chang, Ande [1 ]
Ji, Yuting [2 ]
Wang, Chunguang [3 ]
Bie, Yiming [2 ]
机构
[1] Criminal Invest Police Univ China, Coll Forens Sci, Shenyang 110035, Peoples R China
[2] Jilin Univ, Sch Transportat, Changchun 130022, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Aerosp Engn, State Key Lab Strength & Vibrat Mech Struct, Xian 710049, Peoples R China
关键词
traffic signal control; deep reinforcement learning; multi-agent reinforcement learning; communication; traffic congestion; PREDICTION;
D O I
10.3390/su16052160
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Effective traffic signal control (TSC) plays an important role in reducing vehicle emissions and improving the sustainability of the transportation system. Recently, the feasibility of using multi-agent reinforcement learning technology for TSC has been widely verified. However, the process of mapping road network states onto actions has encountered many challenges, due to the limited communication between agents and the partial observability of the traffic environment. To address this problem, this paper proposes a communication-enhancement value decomposition, multi-agent reinforcement learning TSC method (CVDMARL). The model combines two communication methods: implicit and explicit communication, decouples the complex relationships among the multi-signal agents through the centralized-training and decentralized-execution paradigm, and uses a modified deep network to realize the mining and selective transmission of traffic flow features. We compare and analyze CVDMARL with six different baseline methods based on real datasets. The results show that compared to the optimal method MN_Light, among the baseline methods, CVDMARL's queue length during peak hours was reduced by 9.12%, the waiting time was reduced by 7.67%, and the convergence algebra was reduced by 7.97%. While enriching the information content, it also reduces communication overhead and has better control effects, providing a new idea for solving the collaborative control problem of multi-signalized intersections.
引用
收藏
页数:17
相关论文
共 47 条
[31]  
Wang Jianhao., 2020, arXiv
[32]   Meta-learning based spatial-temporal graph attention network for traffic signal control [J].
Wang, Min ;
Wu, Libing ;
Li, Man ;
Wu, Dan ;
Shi, Xiaochuan ;
Ma, Chao .
KNOWLEDGE-BASED SYSTEMS, 2022, 250
[33]   Traffic Signal Control With Reinforcement Learning Based on Region-Aware Cooperative Strategy [J].
Wang, Min ;
Wu, Libing ;
Li, Jianxin ;
He, Liu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) :6774-6785
[34]   Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning [J].
Wang, Tong ;
Cao, Jiahua ;
Hussain, Azhar .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
[35]   Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning [J].
Wang, Xiaoqiang ;
Ke, Liangjun ;
Qiao, Zhimin ;
Chai, Xinghua .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) :174-187
[36]   Multi-Agent Deep Reinforcement Learning for Urban Traffic Light Control in Vehicular Networks [J].
Wu, Tong ;
Zhou, Pan ;
Liu, Kai ;
Yuan, Yali ;
Wang, Xiumin ;
Huang, Huawei ;
Wu, Dapeng Oliver .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (08) :8243-8256
[37]   AGNP: Network-wide short-term probabilistic traffic speed prediction and imputation [J].
Xu, Meng ;
Di, Yining ;
Ding, Hongxing ;
Zhu, Zheng ;
Chen, Xiqun ;
Yang, Hai .
COMMUNICATIONS IN TRANSPORTATION RESEARCH, 2023, 3
[38]   Graph cooperation deep reinforcement learning for ecological urban traffic signal control [J].
Yan, Liping ;
Zhu, Lulong ;
Song, Kai ;
Yuan, Zhaohui ;
Yan, Yunjuan ;
Tang, Yue ;
Peng, Chan .
APPLIED INTELLIGENCE, 2023, 53 (06) :6248-6265
[39]   Hierarchical graph multi-agent reinforcement learning for traffic signal control [J].
Yang, Shantian .
INFORMATION SCIENCES, 2023, 634 :55-72
[40]   Intelligent vehicle pedestrian light (IVPL): A deep reinforcement learning approach for traffic signal control [J].
Yazdani, Mobin ;
Sarvi, Majid ;
Bagloee, Saeed Asadi ;
Nassir, Neema ;
Price, Jeff ;
Parineh, Hossein .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 149