Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning

被引：6

作者：

Qiao, Zhimin ^{[1
]}

Ke, Liangjun ^{[1
]}

Wang, Xiaoqiang ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, Xian 710049, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Mean-field; Traffic signal control; TD3; Multi-agent reinforcement learning; NETWORK; ALGORITHM; COORDINATION;

D O I：

10.1007/s10489-022-03643-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In contemporary urban, traffic signal control is still enormously difficult. Multi-agent reinforcement learning (MARL) is a promising ways to solve this problem. However, most MARL algorithms can not effectively transfer learning strategies when the agents increase or decrease. This paper proposes a new MARL algorithm called cooperative dynamic delay updating twin delayed deep deterministic policy gradient based on the exponentially weighted moving average (CoTD3-EWMA) to solve the problem. By introducing mean-field theory, the algorithm implicitly models the interaction between agents and environment. It reduces the dimension of action space and improves the scalability of the algorithm. In addition, we propose a dynamic delay updating method based on the exponentially weighted moving average (EWMA), which improves the Q value overestimation problem of the traditional TD3 algorithm. Moreover, a joint reward allocation mechanism and state sharing mechanism are proposed to improve the global strategy learning ability and robustness of the agent. The simulation results show that the performance of the new algorithm is better than the current state-of-the-art algorithms, which effectively reduces the delay time of vehicles and improves the traffic efficiency of the traffic network.

引用

页码：4483 / 4498

页数：16

共 50 条

[21] XLight: An interpretable multi-agent reinforcement learning approach for traffic signal control
Cai, Sibin
Fang, Jie
Xu, Mengyun
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
[22] Multi-Agent Reinforcement Learning for Traffic Signal Control: Algorithms and Robustness Analysis
Wu, Chunliang
Ma, Zhenliang
Kim, Inhi
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[23] PyTSC: A Unified Platform for Multi-Agent Reinforcement Learning in Traffic Signal Control
Bokade, Rohit
Jin, Xiaoning
SENSORS, 2025, 25 (05)
[24] Traffic signal priority control based on shared experience multi-agent deep reinforcement learning
Wang, Zhiwen
Yang, Kangkang
Li, Long
Lu, Yanrong
Tao, Yufei
IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (07) : 1363 - 1379
[25] Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning
Wang, Tong
Cao, Jiahua
Hussain, Azhar
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
[26] Cooperative multi-agent system for production control using reinforcement learning
Dittrich, Marc-Andre
Fohlmeister, Silas
CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2020, 69 (01) : 389 - 392
[27] Learning Multi-Intersection Traffic Signal Control via Coevolutionary Multi-Agent Reinforcement Learning
Chen, Wubing
Yang, Shangdong
Li, Wenbin
Hu, Yujing
Liu, Xiao
Gao, Yang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15947 - 15963
[28] A Meta Multi-agent Reinforcement Learning Algorithm for Multi-intersection Traffic Signal Control
Yang, Shantian
Yang, Bo
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 18 - 25
[29] Distributed Signal Control of Multi-agent Reinforcement Learning Based on Game
Qu Z.-W.
Pan Z.-T.
Chen Y.-H.
Li H.-T.
Wang X.
Chen, Yong-Heng (cyh@jlu.edu.cn), 1600, Science Press (20): : 76 - 82and100
[30] Multi-agent Cooperative Search based on Reinforcement Learning
Sun, Yinjiang
Zhang, Rui
Liang, Wenbao
Xu, Cheng
PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896

← 1 2 3 4 5 →