PressLight: Learning Max Pressure Control to Coordinate Traffic Signals in Arterial Network

被引:210
作者
Wei, Hua [1 ]
Chen, Chacha [2 ]
Zheng, Guanjie [1 ]
Wu, Kan [1 ]
Gayah, Vikash [1 ]
Xu, Kai [3 ]
Li, Zhenhui [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[3] Shanghai Tianrang Intelligent Technol Co Ltd, Shanghai, Peoples R China
来源
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2019年
基金
美国国家科学基金会;
关键词
Deep reinforcement learning; traffic signal control; multi-agent system; MULTIAGENT SYSTEM;
D O I
10.1145/3292500.3330949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic signal control is essential for transportation efficiency in road networks. It has been a challenging problem because of the complexity in traffic dynamics. Conventional transportation research suffers from the incompetency to adapt to dynamic traffic situations. Recent studies propose to use reinforcement learning (RL) to search for more efficient traffic signal plans. However, most existing RL-based studies design the key elements-reward and state - in a heuristic way. This results in highly sensitive performances and a long learning process. To avoid the heuristic design of RL elements, we propose to connect RL with recent studies in transportation research. Our method is inspired by the state-of-the-art method max pressure (MP) in the transportation field. The reward design of our method is well supported by the theory in MP, which can be proved to be maximizing the throughput of the traffic network, i.e., minimizing the overall network travel time. We also show that our concise state representation can fully support the optimization of the proposed reward function. Through comprehensive experiments, we demonstrate that our method outperforms both conventional transportation approaches and existing learning-based methods.
引用
收藏
页码:1290 / 1298
页数:9
相关论文
共 32 条
  • [1] Holonic multi-agent system for traffic signals control
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (5-6) : 1575 - 1587
  • [2] Reinforcement learning for True Adaptive traffic signal control
    Abdulhai, B
    Pringle, R
    Karakoulas, GJ
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) : 278 - 285
  • [3] [Anonymous], ARXIV170408883
  • [4] Reinforcement learning-based multi-agent system for network traffic signal control
    Arel, I.
    Liu, C.
    Urbanik, T.
    Kohls, A. G.
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) : 128 - 135
  • [5] The real-time urban traffic control system CRONOS:: Algorithm and experiments
    Boillot, Florence
    Midenet, Sophie
    Pierrelee, Jean-Claude
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2006, 14 (01) : 18 - 38
  • [6] Distributed learning and multi-objectivity in traffic light control
    Brys, Tim
    Pham, Tong T.
    Taylor, Matthew E.
    [J]. CONNECTION SCIENCE, 2014, 26 (01) : 65 - 83
  • [7] Casas Noe, 2017, arXiv:1703.09035
  • [8] da Silva A.B. C., 2006, Conference on Autonomous Agents and Multiagent Systems (AAMAS), P80
  • [9] El-Tantawy S., 2010, 2010 13th International IEEE Conference on Intelligent Transportation Systems (ITSC 2010), P665, DOI 10.1109/ITSC.2010.5625066
  • [10] Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto
    El-Tantawy, Samah
    Abdulhai, Baher
    Abdelgawad, Hossam
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2013, 14 (03) : 1140 - 1150