Traffic Light Control Using Hierarchical Reinforcement Learning and Options Framework

被引：12

作者：

Borges, Dimitrius F. ^{[1
]}

Leite, Joao Paulo R. R. ^{[1
]}

Moreira, Edmilson M. ^{[1
]}

Carpinteiro, Otavio A. S. ^{[1
]}

机构：

[1] Univ Fed Itajuba, Inst Syst Engn & Informat Technol, BR-1303 Itajuba, MG, Brazil

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Reinforcement learning; Vehicle dynamics; Tools; Mathematical model; Meters; Green products; Adaptation models; Intelligent systems; machine learning; reinforcement learning; simulation; traffic control; SIGNAL CONTROL; INTELLIGENCE;

D O I：

10.1109/ACCESS.2021.3096666

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The number of vehicles worldwide has grown rapidly over the past decade, impacting how urban traffic is managed. Traffic light control is a well-known problem and, although an increasing number of technologies are used to solve it, it still poses challenges and opportunities, especially when considering the inefficiency of the popular fixed-time traffic controllers. This study aims to apply Hierarchical Reinforcement Learning (HRL) and Options Framework to control a signalized vehicular intersection and compare its performance with that of a fixed-time traffic controller, configured using the Webster Method. HRL combines the ability to learn and make decisions while taking observations from the environment in real-time. These capabilities bring a significant adaptive power to a highly dynamic problem. The test scenarios were built using the SUMO simulation tool. According to our results, HRL presents better performance than those of its own isolated sub-policies and the fixed-time model, indicating a simple and efficient alternative.

引用

页码：99155 / 99165

页数：11

共 37 条

[1]

Aggarwal C. C., 2015, DATA MINING

[2]

[Anonymous], 1966, 56 ROAD RES

[3]

[Anonymous], 2020, TRAFF CONTR INT TRAFF CONTR INT

[4]

[Anonymous], 1984, MAN SEM MAN SEM

[5] Traffic-signal control reinforcement learning approach for continuous-time Markov games [J].

Aragon-Gomez, Roman ;

Clempner, Julio B. .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 89

[6]

Barto AG, 2003, DISCRETE EVENT DYN S, V13, P343

[7] ARTIFICIAL-INTELLIGENCE TECHNIQUES FOR URBAN TRAFFIC CONTROL [J].

BIELLI, M ;

AMBROSINO, G ;

BOERO, M ;

MASTRETTA, M .

TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 1991, 25 (05) :319-325

[8]

Borges Dimitrius F., 2021, ITNG 2021 18th International Conference on Information Technology-New Generations. Advances in Intelligent Systems and Computing (AISC 1346), P11, DOI 10.1007/978-3-030-70416-2_2

[9] Adaptive traffic signal control based on bio-neural network [J].

Castro, Guilherme B. ;

Hirakawa, Andre R. ;

Martini, Jose S. C. .

8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 :1182-1187

[10]

Dayan P., 1993, Advances in neural information processing systems, P271

← 1 2 3 4 →