Extensible Hierarchical Multi-Agent Reinforcement-Learning Algorithm in Traffic Signal Control

被引:1
作者
Zhao, Pengqian [1 ]
Yuan, Yuyu [1 ]
Guo, Ting [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software Engn Sch, Key Lab Trustworthy Distributed Comp & Serv,Minist, Beijing 100876, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
reinforcement learning; multi-agent system; traffic signal control; hierarchical reinforcement learning; LEVEL;
D O I
10.3390/app122412783
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Reinforcement-learning (RL) algorithms have made great achievements in many scenarios. However, in large-scale traffic signal control (TSC) scenarios, RL still falls into local optima when controlling multiple signal lights. To solve this problem, we propose a novel goal-based multi-agent hierarchical model (GMHM). Specifically, we divide the traffic environment into several regions. The region contains a virtual manager and several workers who control the traffic lights. The manager assigns goals to each worker by observing the environment, and the worker makes decisions according to the environment state and the goal. For the worker, we adapted the goal-based multi-agent deep deterministic policy gradient (MADDPG) algorithm combined with hierarchical reinforcement learning. In this way, we simplify tasks and allow agents to cooperate more efficiently. We carried out experiments on both grid traffic scenarios and real-world scenarios in the SUMO simulator. The experimental results show the performance advantages of our algorithm compared with state-of-the-art algorithms.
引用
收藏
页数:14
相关论文
共 37 条
[1]   Reinforcement Learning based Recommender Systems: A Survey [J].
Afsar, M. Mehdi ;
Crump, Trafford ;
Far, Behrouz .
ACM COMPUTING SURVEYS, 2023, 55 (07)
[2]  
Andrychowicz M., 2017, P ADV NEUR INF PROC
[3]  
Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[4]  
Chen CC, 2020, AAAI CONF ARTIF INTE, V34, P3414
[5]   Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning [J].
Chen, Jianyu ;
Li, Shengbo Eben ;
Tomizuka, Masayoshi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :5068-5078
[6]  
Chen L., 2021, Advances in neural information processing systems, V34, p15084 15097, DOI 10.2139/ssrn.3971444
[7]   Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control [J].
Chu, Tianshu ;
Wang, Jie ;
Codeca, Lara ;
Li, Zhaojian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) :1086-1095
[8]  
de Witt C. S., 2020, ARXIV
[9]   Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations [J].
Duan, Jiajun ;
Shi, Di ;
Diao, Ruisheng ;
Li, Haifeng ;
Wang, Zhiwei ;
Zhang, Bei ;
Bian, Desong ;
Yi, Zhehan .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) :814-817
[10]  
Hunt P., 1982, Traffic Eng Control, V23, P190