PressLight: Learning Max Pressure Control to Coordinate Traffic Signals in Arterial Network

被引：245

作者：

Wei, Hua ^{[1
]}

Chen, Chacha ^{[2
]}

Zheng, Guanjie ^{[1
]}

Wu, Kan ^{[1
]}

Gayah, Vikash ^{[1
]}

Xu, Kai ^{[3
]}

Li, Zhenhui ^{[1
]}

机构：

[1] Penn State Univ, University Pk, PA 16802 USA

[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[3] Shanghai Tianrang Intelligent Technol Co Ltd, Shanghai, Peoples R China

来源：

KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2019年

基金：

美国国家科学基金会;

关键词：

Deep reinforcement learning; traffic signal control; multi-agent system; MULTIAGENT SYSTEM;

D O I：

10.1145/3292500.3330949

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traffic signal control is essential for transportation efficiency in road networks. It has been a challenging problem because of the complexity in traffic dynamics. Conventional transportation research suffers from the incompetency to adapt to dynamic traffic situations. Recent studies propose to use reinforcement learning (RL) to search for more efficient traffic signal plans. However, most existing RL-based studies design the key elements-reward and state - in a heuristic way. This results in highly sensitive performances and a long learning process. To avoid the heuristic design of RL elements, we propose to connect RL with recent studies in transportation research. Our method is inspired by the state-of-the-art method max pressure (MP) in the transportation field. The reward design of our method is well supported by the theory in MP, which can be proved to be maximizing the throughput of the traffic network, i.e., minimizing the overall network travel time. We also show that our concise state representation can fully support the optimization of the proposed reward function. Through comprehensive experiments, we demonstrate that our method outperforms both conventional transportation approaches and existing learning-based methods.

引用

页码：1290 / 1298

页数：9

共 32 条

[1] Holonic multi-agent system for traffic signals control [J].