Dynamic traffic signal control using mean field multi-agent reinforcement learning in large scale road-networks

被引:3
作者
Hu, Tianfeng [1 ]
Hu, Zhiqun [1 ]
Lu, Zhaoming [1 ]
Wen, Xiangming [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst Architecture & Conver, Beijing, Peoples R China
关键词
cooperative communication; learning (artificial intelligence); road traffic control; signalling; traffic control; OPTIMIZATION;
D O I
10.1049/itr2.12364
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-agent reinforcement learning has played an increasingly important role in intelligent traffic signal control due to its self-learning ability. However, existing algorithms only focus on signal timing mechanism design while ignoring the exponential growth of the joint action dimension as the number of intersections increases, which will ultimately face the learning difficulty. In this paper, traditional traffic methods are introduced into MARL to flexibly determine the phase and duration of each intersection. The proposed MARL algorithm based on mean field theory has the ability to convert a large number of agents to approximately binary interaction, which can effectively reduce the dimension of joint action space in multi-agent environment and learn in a robust process. Besides, to improve the performance of traditional traffic methods, the recurrent neural network (RNN) and an improved Webster's formula with revised parameters are combined to dynamically determine the phase duration according to the historical volume of traffic flow. The simulation results indicate that the proposed algorithm shows superior scalability compared to baseline methods and has great potential to be applied in the large scale road-networks scenario.
引用
收藏
页码:1715 / 1728
页数:14
相关论文
共 50 条
  • [21] Consensus Multi-Agent Reinforcement Learning for Volt-VAR Control in Power Distribution Networks
    Gao, Yuanqi
    Wang, Wei
    Yu, Nanpeng
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 3594 - 3604
  • [22] Collaborative Intelligent Reflecting Surface Networks With Multi-Agent Reinforcement Learning
    Zhang, Jie
    Li, Jun
    Zhang, Yijin
    Wu, Qingqing
    Wu, Xiongwei
    Shu, Feng
    Jin, Shi
    Chen, Wen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (03) : 532 - 545
  • [23] Resilient adaptive optimal control of distributed multi-agent systems using reinforcement learning
    Moghadam, Rohollah
    Modares, Hamidreza
    IET CONTROL THEORY AND APPLICATIONS, 2018, 12 (16) : 2165 - 2174
  • [24] Multi-Agent Deep Reinforcement Learning for HVAC Control in Commercial Buildings
    Yu, Liang
    Sun, Yi
    Xu, Zhanbo
    Shen, Chao
    Yue, Dong
    Jiang, Tao
    Guan, Xiaohong
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (01) : 407 - 419
  • [25] Iterative learning approach for traffic signal control of urban road networks
    Yan, Fei
    Tian, Fuli
    Shi, Zhongke
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (04) : 466 - 475
  • [26] Smart City: Application of Multi-agent Reinforcement Learning Systems in Adaptive Traffic Management
    Ahmadi, Kamilia
    Allan, Vicki H.
    2021 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2021,
  • [27] Multi-Agent Optimal Control for Central Chiller Plants Using Reinforcement Learning and Game Theory
    Qiu, Shunian
    Li, Zhenhai
    Pang, Zhihong
    Li, Zhengwei
    Tao, Yinying
    SYSTEMS, 2023, 11 (03):
  • [28] Dynamic power allocation in IIoT based on multi-agent deep reinforcement learning
    Li, Fenglei
    Liu, Zhixin
    Zhang, Xinzhe
    Yang, Yi
    NEUROCOMPUTING, 2022, 505 : 10 - 18
  • [29] A Multi-Agent Reinforcement Learning Approach to the Dynamic Job Shop Scheduling Problem
    Inal, Ali Firat
    Sel, Cagri
    Aktepe, Adnan
    Turker, Ahmet Kursad
    Ersoz, Suleyman
    SUSTAINABILITY, 2023, 15 (10)
  • [30] Multi-Agent Deep Reinforcement Learning Based Downlink Beamforming in Heterogeneous Networks
    Zhang, Zitian
    Hou, Jinbo
    Chu, Xiaoli
    Zhou, Haibo
    Wei, Guiyi
    Zhang, Jie
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (06) : 4247 - 4263