Traffic Light Control Using Hierarchical Reinforcement Learning and Options Framework

被引:12
作者
Borges, Dimitrius F. [1 ]
Leite, Joao Paulo R. R. [1 ]
Moreira, Edmilson M. [1 ]
Carpinteiro, Otavio A. S. [1 ]
机构
[1] Univ Fed Itajuba, Inst Syst Engn & Informat Technol, BR-1303 Itajuba, MG, Brazil
关键词
Reinforcement learning; Vehicle dynamics; Tools; Mathematical model; Meters; Green products; Adaptation models; Intelligent systems; machine learning; reinforcement learning; simulation; traffic control; SIGNAL CONTROL; INTELLIGENCE;
D O I
10.1109/ACCESS.2021.3096666
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The number of vehicles worldwide has grown rapidly over the past decade, impacting how urban traffic is managed. Traffic light control is a well-known problem and, although an increasing number of technologies are used to solve it, it still poses challenges and opportunities, especially when considering the inefficiency of the popular fixed-time traffic controllers. This study aims to apply Hierarchical Reinforcement Learning (HRL) and Options Framework to control a signalized vehicular intersection and compare its performance with that of a fixed-time traffic controller, configured using the Webster Method. HRL combines the ability to learn and make decisions while taking observations from the environment in real-time. These capabilities bring a significant adaptive power to a highly dynamic problem. The test scenarios were built using the SUMO simulation tool. According to our results, HRL presents better performance than those of its own isolated sub-policies and the fixed-time model, indicating a simple and efficient alternative.
引用
收藏
页码:99155 / 99165
页数:11
相关论文
共 37 条
[1]  
Aggarwal C. C., 2015, DATA MINING
[2]  
[Anonymous], 1966, 56 ROAD RES
[3]  
[Anonymous], 2020, TRAFF CONTR INT TRAFF CONTR INT
[4]  
[Anonymous], 1984, MAN SEM MAN SEM
[5]   Traffic-signal control reinforcement learning approach for continuous-time Markov games [J].
Aragon-Gomez, Roman ;
Clempner, Julio B. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 89
[6]  
Barto AG, 2003, DISCRETE EVENT DYN S, V13, P343
[7]   ARTIFICIAL-INTELLIGENCE TECHNIQUES FOR URBAN TRAFFIC CONTROL [J].
BIELLI, M ;
AMBROSINO, G ;
BOERO, M ;
MASTRETTA, M .
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 1991, 25 (05) :319-325
[8]  
Borges Dimitrius F., 2021, ITNG 2021 18th International Conference on Information Technology-New Generations. Advances in Intelligent Systems and Computing (AISC 1346), P11, DOI 10.1007/978-3-030-70416-2_2
[9]   Adaptive traffic signal control based on bio-neural network [J].
Castro, Guilherme B. ;
Hirakawa, Andre R. ;
Martini, Jose S. C. .
8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 :1182-1187
[10]  
Dayan P., 1993, Advances in neural information processing systems, P271