Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning

被引:6
|
作者
Qiao, Zhimin [1 ]
Ke, Liangjun [1 ]
Wang, Xiaoqiang [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Mean-field; Traffic signal control; TD3; Multi-agent reinforcement learning; NETWORK; ALGORITHM; COORDINATION;
D O I
10.1007/s10489-022-03643-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In contemporary urban, traffic signal control is still enormously difficult. Multi-agent reinforcement learning (MARL) is a promising ways to solve this problem. However, most MARL algorithms can not effectively transfer learning strategies when the agents increase or decrease. This paper proposes a new MARL algorithm called cooperative dynamic delay updating twin delayed deep deterministic policy gradient based on the exponentially weighted moving average (CoTD3-EWMA) to solve the problem. By introducing mean-field theory, the algorithm implicitly models the interaction between agents and environment. It reduces the dimension of action space and improves the scalability of the algorithm. In addition, we propose a dynamic delay updating method based on the exponentially weighted moving average (EWMA), which improves the Q value overestimation problem of the traditional TD3 algorithm. Moreover, a joint reward allocation mechanism and state sharing mechanism are proposed to improve the global strategy learning ability and robustness of the agent. The simulation results show that the performance of the new algorithm is better than the current state-of-the-art algorithms, which effectively reduces the delay time of vehicles and improves the traffic efficiency of the traffic network.
引用
收藏
页码:4483 / 4498
页数:16
相关论文
共 50 条
  • [21] XLight: An interpretable multi-agent reinforcement learning approach for traffic signal control
    Cai, Sibin
    Fang, Jie
    Xu, Mengyun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [22] Multi-Agent Reinforcement Learning for Traffic Signal Control: Algorithms and Robustness Analysis
    Wu, Chunliang
    Ma, Zhenliang
    Kim, Inhi
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [23] PyTSC: A Unified Platform for Multi-Agent Reinforcement Learning in Traffic Signal Control
    Bokade, Rohit
    Jin, Xiaoning
    SENSORS, 2025, 25 (05)
  • [24] Traffic signal priority control based on shared experience multi-agent deep reinforcement learning
    Wang, Zhiwen
    Yang, Kangkang
    Li, Long
    Lu, Yanrong
    Tao, Yufei
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (07) : 1363 - 1379
  • [25] Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning
    Wang, Tong
    Cao, Jiahua
    Hussain, Azhar
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
  • [26] Cooperative multi-agent system for production control using reinforcement learning
    Dittrich, Marc-Andre
    Fohlmeister, Silas
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2020, 69 (01) : 389 - 392
  • [27] Learning Multi-Intersection Traffic Signal Control via Coevolutionary Multi-Agent Reinforcement Learning
    Chen, Wubing
    Yang, Shangdong
    Li, Wenbin
    Hu, Yujing
    Liu, Xiao
    Gao, Yang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15947 - 15963
  • [28] A Meta Multi-agent Reinforcement Learning Algorithm for Multi-intersection Traffic Signal Control
    Yang, Shantian
    Yang, Bo
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 18 - 25
  • [29] Distributed Signal Control of Multi-agent Reinforcement Learning Based on Game
    Qu Z.-W.
    Pan Z.-T.
    Chen Y.-H.
    Li H.-T.
    Wang X.
    Chen, Yong-Heng (cyh@jlu.edu.cn), 1600, Science Press (20): : 76 - 82and100
  • [30] Multi-agent Cooperative Search based on Reinforcement Learning
    Sun, Yinjiang
    Zhang, Rui
    Liang, Wenbao
    Xu, Cheng
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896