Learning Decentralized Traffic Signal Controllers With Multi-Agent Graph Reinforcement Learning

被引:1
作者
Zhang, Yao [1 ]
Yu, Zhiwen [2 ]
Zhang, Jun [3 ]
Wang, Liang [1 ]
Luan, Tom H. [4 ]
Guo, Bin [1 ]
Yuen, Chau [5 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710060, Peoples R China
[2] Harbin Engn Univ, Harbin 150009, Heilongjiang, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong 999077, Peoples R China
[4] Xidian Univ, Sch Cyber Engn, Xian 710071, Peoples R China
[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Graph learning; intelligent transportation systems; MARL; traffic signal control; NETWORK;
D O I
10.1109/TMC.2023.3332081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.
引用
收藏
页码:7180 / 7195
页数:16
相关论文
共 37 条
[1]  
[Anonymous], 1968, SIAM J. Control, V6, P131
[2]  
[Anonymous], 2021, Car parc in China from 2011 to 2021
[3]  
[Anonymous], 2012, Int. J.Adv. Syst. Meas., V5, P128
[4]   Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events [J].
Aslani, Mohammad ;
Mesgari, Mohammad Saadi ;
Wiering, Marco .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 :732-752
[5]   Genetic algorithm solution for the stochastic equilibrium transportation networks under congestion [J].
Ceylan, H ;
Bell, MGH .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2005, 39 (02) :169-185
[6]   Discrete Signal Processing on Graphs: Sampling Theory [J].
Chen, Siheng ;
Varma, Rohan ;
Sandryhaila, Aliaksei ;
Kovacevic, Jelena .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (24) :6510-6523
[7]  
Cho KYHY, 2014, Arxiv, DOI [arXiv:1409.1259, DOI 10.48550/ARXIV.1409.1259]
[8]   Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control [J].
Chu, Tianshu ;
Wang, Jie ;
Codeca, Lara ;
Li, Zhaojian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) :1086-1095
[9]  
Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, DOI 10.48550/ARXIV.1412.3555]
[10]  
Codeca L., 2018, EPiC Series in Engineering, V2, P43, DOI [10.29007/1zt5,, DOI 10.29007/1ZT5]