Multi-Agent Deep Reinforcement Learning for Urban Traffic Light Control in Vehicular Networks

被引:166
|
作者
Wu, Tong [1 ]
Zhou, Pan [2 ]
Liu, Kai [3 ]
Yuan, Yali [4 ]
Wang, Xiumin [5 ]
Huang, Huawei [6 ]
Wu, Dapeng Oliver [7 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun Engn, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Hubei Engn Res Ctr Big Data Secur, Sch Cyber Sci & Engn, Wuhan 430074, Peoples R China
[3] Chongqing Univ, Coll Comp Sci, Chongqing 400040, Peoples R China
[4] Gottingen Univ, Inst Comp Sci, D-37077 Gottingen, Germany
[5] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[6] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[7] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
基金
中国国家自然科学基金;
关键词
Traffic light control; deep reinforcement learning; deep deterministic policy gradient algorithm; markov decision process; vehicular network;
D O I
10.1109/TVT.2020.2997896
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As urban traffic condition is diverse and complicated, applying reinforcement learning to reduce traffic congestion becomes one of the hot and promising topics. Especially, how to coordinate the traffic light controllers of multiple intersections is a key challenge for multi-agent reinforcement learning (MARL). Most existing MARL studies are based on traditional Q-learning, but unstable environment leads to poor learning in the complicated and dynamic traffic scenarios. In this paper, we propose a novel multi-agent recurrent deep deterministic policy gradient (MARDDPG) algorithm based on deep deterministic policy gradient (DDPG) algorithm for traffic light control (TLC) in vehiclar networks. Specifically, the centralized learning in each critic network enables each agent to estimate the policies of other agents in the decision-making process and each agent can coordinate with each other, alleviating the problem of poor learning performance caused by environmental instability. The decentralized execution enables each agent to make decisions independently. We share parameters in actor networks to speed up the training process and reduce the memory footprint. The addition of LSTM is beneficial to alleviate the instability of the environment caused by partial observable state. We utilize surveillance cameras and vehicular networks to collect status information for each intersection. Unlike previous work, we have not only considered the vehicle but also considered the pedestrians waiting to pass through the intersection. Moreover, we also set different priorities for buses and ordinary vehicles. The experimental results in a vehicular network show that our method can run stably in various scenarios and coordinate multiple intersections, which significantly reduces vehicle congestion and pedestrian congestion.
引用
收藏
页码:8243 / 8256
页数:14
相关论文
共 50 条
  • [1] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
    Kitagawa, Shunya
    Moustafa, Ahmed
    Ito, Takayuki
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
  • [2] Adaptive Multi-Agent Deep Mixed Reinforcement Learning for Traffic Light Control
    Li, Lulu
    Zhu, Ruijie
    Wu, Shuning
    Ding, Wenting
    Xu, Mingliang
    Lu, Jiwen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 1803 - 1816
  • [3] Application of Traffic Light Control in Oversaturated Urban Network Using Multi-Agent Deep Reinforcement Learning
    Ei Mon, Ei
    Ochiai, Hideya
    Aswakul, Chaodit
    IEEE ACCESS, 2024, 12 : 82384 - 82395
  • [4] A traffic light control method based on multi-agent deep reinforcement learning algorithm
    Liu, Dongjiang
    Li, Leixiao
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [5] A traffic light control method based on multi-agent deep reinforcement learning algorithm
    Dongjiang Liu
    Leixiao Li
    Scientific Reports, 13
  • [6] Multi-Agent Deep Reinforcement Learning in Vehicular OCC
    Islam, Amirul
    Musavian, Leila
    Thomos, Nikolaos
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [7] Multi-agent deep reinforcement learning with traffic flow for traffic signal control
    Hou, Liang
    Huang, Dailin
    Cao, Jie
    Ma, Jialin
    JOURNAL OF CONTROL AND DECISION, 2025, 12 (01) : 81 - 92
  • [8] Multi-Agent Deep Reinforcement Learning with Clustering and Information Sharing for Traffic Light Cooperative Control
    Du T.
    Wang B.
    Cheng H.
    Luo L.
    Zeng N.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (02): : 538 - 545
  • [9] Multi-agent deep reinforcement learning with actor-attention-critic for traffic light control
    Wang, Bin
    He, ZhengKun
    Sheng, JinFang
    Liu, YingXian
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (09) : 2880 - 2888
  • [10] Multi-agent broad reinforcement learning for intelligent traffic light control
    Zhu, Ruijie
    Li, Lulu
    Wu, Shuning
    Lv, Pei
    Li, Yafei
    Xu, Mingliang
    INFORMATION SCIENCES, 2023, 619 : 509 - 525