Extensible Hierarchical Multi-Agent Reinforcement-Learning Algorithm in Traffic Signal Control

被引:0
作者
Zhao, Pengqian [1 ]
Yuan, Yuyu [1 ]
Guo, Ting [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software Engn Sch, Key Lab Trustworthy Distributed Comp & Serv,Minist, Beijing 100876, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
reinforcement learning; multi-agent system; traffic signal control; hierarchical reinforcement learning; LEVEL;
D O I
10.3390/app122412783
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Reinforcement-learning (RL) algorithms have made great achievements in many scenarios. However, in large-scale traffic signal control (TSC) scenarios, RL still falls into local optima when controlling multiple signal lights. To solve this problem, we propose a novel goal-based multi-agent hierarchical model (GMHM). Specifically, we divide the traffic environment into several regions. The region contains a virtual manager and several workers who control the traffic lights. The manager assigns goals to each worker by observing the environment, and the worker makes decisions according to the environment state and the goal. For the worker, we adapted the goal-based multi-agent deep deterministic policy gradient (MADDPG) algorithm combined with hierarchical reinforcement learning. In this way, we simplify tasks and allow agents to cooperate more efficiently. We carried out experiments on both grid traffic scenarios and real-world scenarios in the SUMO simulator. The experimental results show the performance advantages of our algorithm compared with state-of-the-art algorithms.
引用
收藏
页数:14
相关论文
共 37 条
  • [1] Reinforcement Learning based Recommender Systems: A Survey
    Afsar, M. Mehdi
    Crump, Trafford
    Far, Behrouz
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (07)
  • [2] Andrychowicz M., 2017, Proceedings of the Advances in Neural Information Processing Systems, V30, P1
  • [3] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
  • [4] Chen CC, 2020, AAAI CONF ARTIF INTE, V34, P3414
  • [5] Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning
    Chen, Jianyu
    Li, Shengbo Eben
    Tomizuka, Masayoshi
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5068 - 5078
  • [6] Chen L., 2021, Advances in neural information processing systems, V15084, P15097
  • [7] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [8] de Witt Christian Schroeder, 2020, arXiv
  • [9] Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations
    Duan, Jiajun
    Shi, Di
    Diao, Ruisheng
    Li, Haifeng
    Wang, Zhiwei
    Zhang, Bei
    Bian, Desong
    Yi, Zhehan
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) : 814 - 817
  • [10] Hunt P.B., 1982, Traffic Engineering Control, V23, P190