Model-Based Graph Reinforcement Learning for Inductive Traffic Signal Control

被引:3
作者
Devailly, Francois-Xavier [1 ]
Larocque, Denis [1 ]
Charlin, Laurent [1 ]
机构
[1] HEC Montreal, Dept Decis Sci, Montreal, PQ H3T 2A7, Canada
来源
IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 5卷
基金
加拿大自然科学与工程研究理事会;
关键词
Adaptive traffic signal control; transfer learning; multi-agent reinforcement learning; joint action modeling; model-based reinforcement learning; graph neural networks; NETWORK; GO;
D O I
10.1109/OJITS.2024.3376583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce MuJAM, an adaptive traffic signal control method which leverages model-based reinforcement learning to 1) extend recent generalization efforts (to road network architectures and traffic distributions) further by allowing a generalization to the controllers' constraints (cyclic and acyclic policies), 2) improve performance and data efficiency over related model-free approaches, and 3) enable explicit coordination at scale for the first time. In a zero-shot transfer setting involving both road networks and traffic settings never experienced during training, and in a larger transfer experiment involving the control of 3,971 traffic signal controllers in Manhattan, we show that MuJAM, using both cyclic and acyclic constraints, outperforms domain-specific baselines as well as a recent transferable approach.
引用
收藏
页码:238 / 250
页数:13
相关论文
共 55 条
  • [1] Abdoos M, 2011, IEEE INT C INTELL TR, P1580, DOI 10.1109/ITSC.2011.6083114
  • [2] Reinforcement learning-based multi-agent system for network traffic signal control
    Arel, I.
    Liu, C.
    Urbanik, T.
    Kohls, A. G.
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) : 128 - 135
  • [3] Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran
    Aslani, Mohammad
    Seipel, Stefan
    Mesgari, Mohammad Saadi
    Wiering, Marco
    [J]. ADVANCED ENGINEERING INFORMATICS, 2018, 38 : 639 - 655
  • [4] Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events
    Aslani, Mohammad
    Mesgari, Mohammad Saadi
    Wiering, Marco
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 : 732 - 752
  • [5] Urban traffic signal control using reinforcement learning agents
    Balaji, P. G.
    German, X.
    Srinivasan, D.
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (03) : 177 - 188
  • [6] Real-World Carbon Dioxide Impacts of Traffic Congestion
    Barth, Matthew
    Boriboonsomsin, Kanok
    [J]. TRANSPORTATION RESEARCH RECORD, 2008, 2058 (2058) : 163 - 171
  • [7] Bracknell U.K., 1990, Scats-a Traffic Responsive Method of Controlling UrbanTraffic: Sales Information Brochure
  • [8] Casas N, 2017, Arxiv, DOI arXiv:1703.09035
  • [9] Chen CC, 2020, AAAI CONF ARTIF INTE, V34, P3414
  • [10] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095