A group-based traffic signal control with adaptive learning ability

被引:41
作者
Jin, Junchen [1 ]
Ma, Xiaoliang [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Syst Simulat & Control S2CLab, Tekn Ringen 10, S-10044 Stockholm, Sweden
[2] iTekn Solut, Stockholm, Sweden
关键词
Adaptive traffic signal system; Stochastic optimal control; Group-based phasing; Reinforcement learning; Multiple-step backups; CONTROL-SYSTEM;
D O I
10.1016/j.engappai.2017.07.022
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Group-based control is an advanced traffic signal strategy capable of dynamically generating phase sequences at intersections. Combined with the phasing scheme, vehicle actuated timing is often adopted to respond to the detected traffic. However, the parameters of a signal controller are often predetermined in practice, and the control performance may suffer from deterioration when dealing with highly fluctuating traffic demand. This study proposes a group-based signal control approach capable of making decisions based on its understanding of traffic conditions at the intersection level. In particular, the control problem is formulated using a framework of stochastic optimal control for multi-agent system in which each signal group is modeled as an intelligent agent. The agents learn how to react to traffic environment and make optimal timing decisions according to the perceived system states. Reinforcement learning, enhanced by multiple-step backups, is employed as the kernel of the intelligent control algorithm, where each agent updates its knowledge on-line based on a sequence of states during the process. In addition, the proposed system is designated to be compatible with the prevailing signal system. A case study was carried out in a simulation environment to compare the proposed control approach with a benchmark controller used in practice, group-based vehicle actuated (GBVA) controller, whose parameters were off-line optimized using a genetic algorithm. Simulation results show that the proposed adaptive group-based control system outperforms the optimized GBVA control system mainly because of its real-time adaptive learning capacity in response to the changes in traffic demand. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:282 / 293
页数:12
相关论文
共 22 条
  • [1] Holonic multi-agent system for traffic signals control
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (5-6) : 1575 - 1587
  • [2] [Anonymous], 2000, P MACHINE LEARNING
  • [3] [Anonymous], 2012, International journal on advances in systems and measurements
  • [4] [Anonymous], 2017, ARXIV
  • [5] Type-2 fuzzy logic based urban traffic management
    Balaji, P. G.
    Srinivasan, D.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (01) : 12 - 22
  • [6] Barto A., 1998, Reinforcement Learning: an Introduction
  • [7] Learning in groups of traffic signals
    Bazzan, Ana L. C.
    de Oliveira, Denise
    da Silva, Bruno C.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (04) : 560 - 568
  • [8] The real-time urban traffic control system CRONOS:: Algorithm and experiments
    Boillot, Florence
    Midenet, Sophie
    Pierrelee, Jean-Claude
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2006, 14 (01) : 18 - 38
  • [9] Adaptive traffic signal control using approximate dynamic programming
    Cai, Chen
    Wong, Chi Kwong
    Heydecker, Benjamin G.
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (05) : 456 - 474
  • [10] Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto
    El-Tantawy, Samah
    Abdulhai, Baher
    Abdelgawad, Hossam
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2013, 14 (03) : 1140 - 1150