Learning Decentralized Traffic Signal Controllers With Multi-Agent Graph Reinforcement Learning

被引：1

作者：

Zhang, Yao ^{[1
]}

Yu, Zhiwen ^{[2
]}

Zhang, Jun ^{[3
]}

Wang, Liang ^{[1
]}

Luan, Tom H. ^{[4
]}

Guo, Bin ^{[1
]}

Yuen, Chau ^{[5
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710060, Peoples R China

[2] Harbin Engn Univ, Harbin 150009, Heilongjiang, Peoples R China

[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong 999077, Peoples R China

[4] Xidian Univ, Sch Cyber Engn, Xian 710071, Peoples R China

[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Graph learning; intelligent transportation systems; MARL; traffic signal control; NETWORK;

D O I：

10.1109/TMC.2023.3332081

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.

引用

页码：7180 / 7195

页数：16

共 37 条

[1]

[Anonymous], 1968, SIAM J. Control, V6, P131

[2]

[Anonymous], 2021, Car parc in China from 2011 to 2021

[3]

[Anonymous], 2012, Int. J.Adv. Syst. Meas., V5, P128

[4] Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events [J].

Aslani, Mohammad ;

Mesgari, Mohammad Saadi ;

Wiering, Marco .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 :732-752

[5] Genetic algorithm solution for the stochastic equilibrium transportation networks under congestion [J].

Ceylan, H ;

Bell, MGH .

TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2005, 39 (02) :169-185

[6] Discrete Signal Processing on Graphs: Sampling Theory [J].

Chen, Siheng ;

Varma, Rohan ;

Sandryhaila, Aliaksei ;

Kovacevic, Jelena .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (24) :6510-6523

[7]

Cho KYHY, 2014, Arxiv, DOI [arXiv:1409.1259, DOI 10.48550/ARXIV.1409.1259]

[8] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control [J].

Chu, Tianshu ;

Wang, Jie ;

Codeca, Lara ;

Li, Zhaojian .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) :1086-1095

[9]

Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, DOI 10.48550/ARXIV.1412.3555]

[10]

Codeca L., 2018, EPiC Series in Engineering, V2, P43, DOI [10.29007/1zt5,, DOI 10.29007/1ZT5]

← 1 2 3 4 →