Hierarchical traffic signal optimization using reinforcement learning and traffic prediction with long-short term memory

被引:43
作者
Abdoos, Monireh [1 ]
Bazzan, Ana L. C. [2 ]
机构
[1] Shahid Beheshti Univ, Fac Comp Sci & Engn, Tehran, Iran
[2] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
关键词
Traffic signal control; Hierarchical multi-agent system; Reinforcement learning; Traffic prediction; Long short-term memory (LSTM); NETWORKS;
D O I
10.1016/j.eswa.2021.114580
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent systems can be used for modelling large-scale distributed systems in real world applications. In intelligent transportation system (ITS), many interacting entities influence the performance of the system. As part of ITS, traffic signal control can be modelled using a multi-agent system. In this paper, a hierarchical multi-agent system including two levels is employed to control traffic signals. Each traffic signal is controlled by an agent that sits in the physical level, i.e., in the first level. For the other levels, the traffic network is divided into a number of regions, each controlled by a region controller agent. The first level agents utilize reinforcement learning to find the best policy, while they send their local information to the above level agents. The local information is used to train a long short-term memory (LSTM) neural network for traffic status prediction. The agents in the above level can control the traffic signals by finding the best joint policy using the predicted traffic information. Experimental results show the effectiveness of the proposed method in a traffic network including 16 intersections.
引用
收藏
页数:9
相关论文
共 34 条
  • [1] Abdoos Monireh, 2012, Agent and Multi-Agent Systems. Technologies and Applications. Proceedings 6th KES International Conference, KES-AMSTA 2012, P379, DOI 10.1007/978-3-642-30947-2_42
  • [2] Abdoos M., 2020, IEEE INTELL SYST
  • [3] Hierarchical control of traffic signals using Q-learning with tile coding
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    [J]. APPLIED INTELLIGENCE, 2014, 40 (02) : 201 - 213
  • [4] Abdoos M, 2011, IEEE INT C INTELL TR, P1580, DOI 10.1109/ITSC.2011.6083114
  • [5] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [6] Continuous residual reinforcement learning for traffic signal control optimization
    Aslani, Mohainmad
    Seipel, Stefan
    Wiering, Marco
    [J]. CANADIAN JOURNAL OF CIVIL ENGINEERING, 2018, 45 (08) : 690 - 702
  • [7] Multi-Agent System in Urban Traffic Signal Control
    Balaji, P. G.
    Srinivasan, D.
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (04) : 43 - 51
  • [8] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [9] Chen YY, 2016, 2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), P132, DOI 10.1109/ITSC.2016.7795543
  • [10] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095