Hierarchical multi-agent control of traffic lights based on collective learning

被引:46
作者
Jin, Junchen [1 ]
Ma, Xiaoliang [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Dept Transport Sci, Syst Simulat & Control S2CLab, Tekn Ringen 10, S-10044 Stockholm, Sweden
[2] iTekn Solut, Stockholm, Sweden
关键词
Hierarchical model of traffic system; Multi-agent traffic light control; Decentralized system; Learning-based control; Collective machine learning; SIGNAL CONTROL-SYSTEM;
D O I
10.1016/j.engappai.2017.10.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasing traffic congestion poses significant challenges for urban planning and management in metropolitan areas around the world. One way to tackle the problem is to resort to the emerging technologies in artificial intelligence. Traffic light control is one of the most traditional and important instruments for urban traffic management. The present study proposes a traffic light control system enabled by a hierarchical multi-agent modeling framework in a decentralized manner. in the framework, a traffic network is decomposed into regions represented by region agents. Each region consists of intersections, modeled by intersection agents who coordinate with neighboring intersection agents through communication. For each intersection, a collection of turning movement agents operate individually and implement optimal actions according to local control policies. By employing a reinforcement learning algorithm for each turning movement agent, the intersection controllers are enabled with the capability to make their timing decisions in a complex and dynamic environment. In addition, the traffic light control operates with an advanced phase composition process dynamically combining compatible turning movements. Moreover, the collective operations performed by the agents in a road network are further coordinated by varying priority settings for relevant turning movements. A case study was carried out by simulations to evaluate the performance of the proposed control system while comparing it with an optimized vehicle-actuated control system. The results show that the proposed traffic light system, after a collective machine learning process, not only improves the local signal operations at individual intersections but also enhances the traffic performance at the regional level through coordination of specific turning movements. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:236 / 248
页数:13
相关论文
共 31 条
[1]   Hierarchical control of traffic signals using Q-learning with tile coding [J].
Abdoos, Monireh ;
Mozayani, Nasser ;
Bazzan, Ana L. C. .
APPLIED INTELLIGENCE, 2014, 40 (02) :201-213
[2]   Holonic multi-agent system for traffic signals control [J].
Abdoos, Monireh ;
Mozayani, Nasser ;
Bazzan, Ana L. C. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (5-6) :1575-1587
[3]  
[Anonymous], 2013, ADV APPL SELF ORG SY, DOI [10.1007/978-1-4471-5113-5_3, DOI 10.1007/978-1-4471-5113-5_3]
[4]  
[Anonymous], 2012, International journal on advances in systems and measurements
[5]   Reinforcement learning-based multi-agent system for network traffic signal control [J].
Arel, I. ;
Liu, C. ;
Urbanik, T. ;
Kohls, A. G. .
IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) :128-135
[6]   Urban traffic signal control using reinforcement learning agents [J].
Balaji, P. G. ;
German, X. ;
Srinivasan, D. .
IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (03) :177-188
[7]  
Barto A., 1998, Reinforcement Learning: an Introduction
[8]   Learning in groups of traffic signals [J].
Bazzan, Ana L. C. ;
de Oliveira, Denise ;
da Silva, Bruno C. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (04) :560-568
[9]   The real-time urban traffic control system CRONOS:: Algorithm and experiments [J].
Boillot, Florence ;
Midenet, Sophie ;
Pierrelee, Jean-Claude .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2006, 14 (01) :18-38
[10]   Adaptive traffic signal control using approximate dynamic programming [J].
Cai, Chen ;
Wong, Chi Kwong ;
Heydecker, Benjamin G. .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (05) :456-474