Continuous residual reinforcement learning for traffic signal control optimization

被引:15
作者
Aslani, Mohainmad [1 ]
Seipel, Stefan [1 ,2 ]
Wiering, Marco [3 ]
机构
[1] Univ Gavle, Dept Ind Dev IT & Land Management, Gavle, Sweden
[2] Uppsala Univ, Dept Informat Technol, Div Visual Informat & Interact, Uppsala, Sweden
[3] Univ Groningen, Inst Artificial Intelligence & Cognit Engn, Groningen, Netherlands
关键词
continuous state reinforcement learning; adaptive traffic signal control; microscopic traffic simulation; ALGORITHMS; SYSTEM; MODEL;
D O I
10.1139/cjce-2017-0408
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic signal control can be naturally regarded as a reinforcement learning problem. Unfortunately, it is one of the most difficult classes of reinforcement learning problems owing to its large state space. A straightforward approach to address this challenge is to control traffic signals based on continuous reinforcement learning. Although they have been successful in traffic signal control, they may become unstable and fail to converge to near-optimal solutions. We develop adaptive traffic signal controllers based on continuous residual reinforcement learning (CRL-TSC) that is more stable. The effect of three feature functions is empirically investigated in a microscopic traffic simulation. Furthermore, the effects of departing streets, more actions, and the use of the spatial distribution of the vehicles on the performance of CRL-TSCs are assessed. The results show that the best setup of the CRL-TSC leads to saving average travel time by 15% in comparison to an optimized fixed-time controller.
引用
收藏
页码:690 / 702
页数:13
相关论文
共 48 条
[1]   Hierarchical control of traffic signals using Q-learning with tile coding [J].
Abdoos, Monireh ;
Mozayani, Nasser ;
Bazzan, Ana L. C. .
APPLIED INTELLIGENCE, 2014, 40 (02) :201-213
[2]   Holonic multi-agent system for traffic signals control [J].
Abdoos, Monireh ;
Mozayani, Nasser ;
Bazzan, Ana L. C. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (5-6) :1575-1587
[3]  
Abdoos M, 2011, IEEE INT C INTELL TR, P1580, DOI 10.1109/ITSC.2011.6083114
[4]  
Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P220, DOI 10.1115/1.3426922
[5]  
[Anonymous], 2000, P MACHINE LEARNING
[6]  
[Anonymous], METH CALC TRANSP EM
[7]  
[Anonymous], 1999, Reinforcement learning through gradient descent
[8]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[9]  
[Anonymous], 2016, Neural Networks for Applied Sciences and Engineering: From Fundamentals to Complex Pattern Recognition
[10]  
[Anonymous], 2010, Algorithms for Reinforcement Learning