Hierarchical traffic signal optimization using reinforcement learning and traffic prediction with long-short term memory

被引：50

作者：

Abdoos, Monireh ^{[1
]}

Bazzan, Ana L. C. ^{[2
]}

机构：

[1] Shahid Beheshti Univ, Fac Comp Sci & Engn, Tehran, Iran

[2] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2021年 / 171卷

关键词：

Traffic signal control; Hierarchical multi-agent system; Reinforcement learning; Traffic prediction; Long short-term memory (LSTM); NETWORKS;

D O I：

10.1016/j.eswa.2021.114580

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-agent systems can be used for modelling large-scale distributed systems in real world applications. In intelligent transportation system (ITS), many interacting entities influence the performance of the system. As part of ITS, traffic signal control can be modelled using a multi-agent system. In this paper, a hierarchical multi-agent system including two levels is employed to control traffic signals. Each traffic signal is controlled by an agent that sits in the physical level, i.e., in the first level. For the other levels, the traffic network is divided into a number of regions, each controlled by a region controller agent. The first level agents utilize reinforcement learning to find the best policy, while they send their local information to the above level agents. The local information is used to train a long short-term memory (LSTM) neural network for traffic status prediction. The agents in the above level can control the traffic signals by finding the best joint policy using the predicted traffic information. Experimental results show the effectiveness of the proposed method in a traffic network including 16 intersections.

引用

页数：9

共 34 条

[1]

Abdoos Monireh, 2012, Agent and Multi-Agent Systems. Technologies and Applications. Proceedings 6th KES International Conference, KES-AMSTA 2012, P379, DOI 10.1007/978-3-642-30947-2_42

[2]

Abdoos M., 2020, IEEE INTELL SYST

[3] Hierarchical control of traffic signals using Q-learning with tile coding [J].