Hierarchical control of traffic signals using Q-learning with tile coding

被引：59

作者：

Abdoos, Monireh ^{[1
]}

Mozayani, Nasser ^{[1
]}

Bazzan, Ana L. C. ^{[2
]}

机构：

[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran

[2] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil

来源：

APPLIED INTELLIGENCE | 2014年 / 40卷 / 02期

关键词：

Multi-agent systems; Hierarchical control; Traffic signals; Q-learning; Tile coding; MULTIAGENT SYSTEMS; AGENT TECHNOLOGY; REINFORCEMENT;

D O I：

10.1007/s10489-013-0455-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-agent systems are rapidly growing as powerful tools for Intelligent Transportation Systems (ITS). It is desirable that traffic signals control, as a part of ITS, is performed in a distributed model. Therefore agent-based technologies can be efficiently used for traffic signals control. For traffic networks which are composed of multiple intersections, distributed control achieves better results in comparison to centralized methods. Hierarchical structures are useful to decompose the network into multiple sub-networks and provide a mechanism for distributed control of the traffic signals. In this paper, a two-level hierarchical control of traffic signals based on Q-learning is presented. Traffic signal controllers, located at intersections, can be seen as autonomous agents in the first level (at the bottom of the hierarchy) which use Q-learning to learn a control policy. The network is divided into some regions where an agent is assigned to control each region at the second level (top of the hierarchy). Due to the combinational explosion in the number of states and actions, i.e. features, the use of Q-learning is impractical. Therefore, in the top level, tile coding is used as a linear function approximation method. A network composed of 9 intersections arranged in a 3 x 3 grid is used for the simulation. Experimental results show that the proposed hierarchical control improves the Q-learning efficiency of the bottom level agents. The impact of the parameters used in tile coding is also analyzed.

引用

页码：201 / 213

页数：13

共 30 条

[1]

Abdoos Monireh, 2012, Agent and Multi-Agent Systems. Technologies and Applications. Proceedings 6th KES International Conference, KES-AMSTA 2012, P379, DOI 10.1007/978-3-642-30947-2_42

[2]

Abdoos M, 2011, IEEE INT C INTELL TR, P1580, DOI 10.1109/ITSC.2011.6083114

[3]

[Anonymous], 2000, P MACHINE LEARNING

[4] Reinforcement learning-based multi-agent system for network traffic signal control [J].

Arel, I. ;

Liu, C. ;

Urbanik, T. ;

Kohls, A. G. .

IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) :128-135

[5] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control [J].

Bazzan, Ana L. C. .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) :342-375

[6]

BIELLI M, 1994, ARTIFICIAL INTELLIGE

[7] An automated signalized junction controller that learns strategies by temporal difference reinforcement learning [J].

Box, Simon ;

Waterson, Ben .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (01) :652-659

[8] An automated signalized junction controller that learns strategies from a human expert [J].

Box, Simon ;

Waterson, Ben .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (01) :107-118

[9]

Cai CQ, 2007, PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P25

[10] A Review of the Applications of Agent Technology in Traffic and Transportation Systems [J].

Chen, Bo ;

Cheng, Harry H. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2010, 11 (02) :485-497

← 1 2 3 →