Model-Based Graph Reinforcement Learning for Inductive Traffic Signal Control

被引：3

作者：

Devailly, Francois-Xavier ^{[1
]}

Larocque, Denis ^{[1
]}

Charlin, Laurent ^{[1
]}

机构：

[1] HEC Montreal, Dept Decis Sci, Montreal, PQ H3T 2A7, Canada

来源：

IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 5卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Adaptive traffic signal control; transfer learning; multi-agent reinforcement learning; joint action modeling; model-based reinforcement learning; graph neural networks; NETWORK; GO;

D O I：

10.1109/OJITS.2024.3376583

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce MuJAM, an adaptive traffic signal control method which leverages model-based reinforcement learning to 1) extend recent generalization efforts (to road network architectures and traffic distributions) further by allowing a generalization to the controllers' constraints (cyclic and acyclic policies), 2) improve performance and data efficiency over related model-free approaches, and 3) enable explicit coordination at scale for the first time. In a zero-shot transfer setting involving both road networks and traffic settings never experienced during training, and in a larger transfer experiment involving the control of 3,971 traffic signal controllers in Manhattan, we show that MuJAM, using both cyclic and acyclic constraints, outperforms domain-specific baselines as well as a recent transferable approach.

引用

页码：238 / 250

页数：13

共 55 条

[1] Abdoos M, 2011, IEEE INT C INTELL TR, P1580, DOI 10.1109/ITSC.2011.6083114
[2] Reinforcement learning-based multi-agent system for network traffic signal control
Arel, I.
Liu, C.
Urbanik, T.
Kohls, A. G.
[J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) : 128 - 135
[3] Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran
Aslani, Mohammad
Seipel, Stefan
Mesgari, Mohammad Saadi
Wiering, Marco
[J]. ADVANCED ENGINEERING INFORMATICS, 2018, 38 : 639 - 655
[4] Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events
Aslani, Mohammad
Mesgari, Mohammad Saadi
Wiering, Marco
[J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 : 732 - 752
[5] Urban traffic signal control using reinforcement learning agents
Balaji, P. G.
German, X.
Srinivasan, D.
[J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (03) : 177 - 188
[6] Real-World Carbon Dioxide Impacts of Traffic Congestion
Barth, Matthew
Boriboonsomsin, Kanok
[J]. TRANSPORTATION RESEARCH RECORD, 2008, 2058 (2058) : 163 - 171
[7] Bracknell U.K., 1990, Scats-a Traffic Responsive Method of Controlling UrbanTraffic: Sales Information Brochure
[8] Casas N, 2017, Arxiv, DOI arXiv:1703.09035
[9] Chen CC, 2020, AAAI CONF ARTIF INTE, V34, P3414
[10] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
Chu, Tianshu
Wang, Jie
Codeca, Lara
Li, Zhaojian
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095

← 1 2 3 4 5 6 →