A Deep Reinforcement Learning Network for Traffic Light Cycle Control

被引：391

作者：

Liang, Xiaoyuan ^{[1
]}

Du, Xunsheng ^{[2
]}

Wang, Guiling ^{[1
]}

Han, Zhu ^{[2
,3
]}

机构：

[1] New Jersey Inst Technol, Dept Comp Sci, Newark, NJ 07102 USA

[2] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA

[3] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul, South Korea

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2019年 / 68卷 / 02期

基金：

美国国家科学基金会;

关键词：

Reinforcement learning; deep learning; traffic light control; vehicular network; MULTIAGENT SYSTEM; OPTIMIZATION; GAME; GO;

D O I：

10.1109/TVT.2018.2890726

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Existing inefficient traffic light cycle control causes numerous problems, such as long delay and waste of energy. To improve efficiency, taking real-time traffic information as an input and dynamically adjusting the traffic light duration accordingly is a must. Existing works either split the traffic signal into equal duration or only leverage limited traffic information. In this paper, we study how to decide the traffic signal duration based on the collected data from different sensors. We propose a deep reinforcement learning model to control the traffic light cycle. In the model, we quantify the complex traffic scenario as states by collecting traffic data and dividing the whole intersection into small grids. The duration changes of a traffic light are the actions, which are modeled as a high-dimension Markov decision process. The reward is the cumulative waiting time difference between two cycles. To solve the model, a convolutional neural network is employed to map states to rewards. The proposed model incorporates multiple optimization elements to improve the performance, such as dueling network, target network, double Q-learning network, and prioritized experience replay. We evaluate our model via simulation on a Simulation of Urban MObility simulator. Simulation results show the efficiency of our model in controlling traffic lights.

引用

页码：1243 / 1253

页数：11

共 33 条

[1]

Abadi M., 2015, P 12 USENIX S OPERAT

[2] Holonic multi-agent system for traffic signals control [J].

Abdoos, Monireh ;

Mozayani, Nasser ;

Bazzan, Ana L. C. .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (5-6) :1575-1587

[3]

[Anonymous], 2016, P 4 INT C LEARN REPR

[4] Reinforcement learning-based multi-agent system for network traffic signal control [J].

Arel, I. ;

Liu, C. ;

Urbanik, T. ;

Kohls, A. G. .

IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) :128-135

[5] Urban traffic signal control using reinforcement learning agents [J].

Balaji, P. G. ;

German, X. ;

Srinivasan, D. .

IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (03) :177-188

[6]

Casas N., 2017, DEEP DETERMINI UNPUB

[7]

CHIU S, 1993, SECOND IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, P1371, DOI 10.1109/FUZZY.1993.327593

[8] Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control [J].

El-Tantawy, Samah ;

Abdulhai, Baher ;

Abdelgawad, Hossam .

JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 18 (03) :227-245

[9] State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow's Intelligent Network Traffic Control Systems [J].

Fadlullah, Zubair Md. ;

Tang, Fengxiao ;

Mao, Bomin ;

Kato, Nei ;

Akashi, Osamu ;

Inoue, Takeru ;

Mizutani, Kimihiro .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2017, 19 (04) :2432-2455

[10]

Gao J., 2017, ADAPTIVE TRAFF UNPUB

← 1 2 3 4 →