Multi-Agent Deep Reinforcement Learning for Urban Traffic Light Control in Vehicular Networks

被引：166

作者：

Wu, Tong ^{[1
]}

Zhou, Pan ^{[2
]}

Liu, Kai ^{[3
]}

Yuan, Yali ^{[4
]}

Wang, Xiumin ^{[5
]}

Huang, Huawei ^{[6
]}

Wu, Dapeng Oliver ^{[7
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun Engn, Wuhan 430074, Peoples R China

[2] Huazhong Univ Sci & Technol, Hubei Engn Res Ctr Big Data Secur, Sch Cyber Sci & Engn, Wuhan 430074, Peoples R China

[3] Chongqing Univ, Coll Comp Sci, Chongqing 400040, Peoples R China

[4] Gottingen Univ, Inst Comp Sci, D-37077 Gottingen, Germany

[5] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China

[6] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China

[7] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2020年 / 69卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Traffic light control; deep reinforcement learning; deep deterministic policy gradient algorithm; markov decision process; vehicular network;

D O I：

10.1109/TVT.2020.2997896

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As urban traffic condition is diverse and complicated, applying reinforcement learning to reduce traffic congestion becomes one of the hot and promising topics. Especially, how to coordinate the traffic light controllers of multiple intersections is a key challenge for multi-agent reinforcement learning (MARL). Most existing MARL studies are based on traditional Q-learning, but unstable environment leads to poor learning in the complicated and dynamic traffic scenarios. In this paper, we propose a novel multi-agent recurrent deep deterministic policy gradient (MARDDPG) algorithm based on deep deterministic policy gradient (DDPG) algorithm for traffic light control (TLC) in vehiclar networks. Specifically, the centralized learning in each critic network enables each agent to estimate the policies of other agents in the decision-making process and each agent can coordinate with each other, alleviating the problem of poor learning performance caused by environmental instability. The decentralized execution enables each agent to make decisions independently. We share parameters in actor networks to speed up the training process and reduce the memory footprint. The addition of LSTM is beneficial to alleviate the instability of the environment caused by partial observable state. We utilize surveillance cameras and vehicular networks to collect status information for each intersection. Unlike previous work, we have not only considered the vehicle but also considered the pedestrians waiting to pass through the intersection. Moreover, we also set different priorities for buses and ordinary vehicles. The experimental results in a vehicular network show that our method can run stably in various scenarios and coordinate multiple intersections, which significantly reduces vehicle congestion and pedestrian congestion.

引用

页码：8243 / 8256

页数：14

共 50 条

[1] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
Kitagawa, Shunya
Moustafa, Ahmed
Ito, Takayuki
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
[2] Adaptive Multi-Agent Deep Mixed Reinforcement Learning for Traffic Light Control
Li, Lulu
Zhu, Ruijie
Wu, Shuning
Ding, Wenting
Xu, Mingliang
Lu, Jiwen
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 1803 - 1816
[3] Application of Traffic Light Control in Oversaturated Urban Network Using Multi-Agent Deep Reinforcement Learning
Ei Mon, Ei
Ochiai, Hideya
Aswakul, Chaodit
IEEE ACCESS, 2024, 12 : 82384 - 82395
[4] A traffic light control method based on multi-agent deep reinforcement learning algorithm
Liu, Dongjiang
Li, Leixiao
SCIENTIFIC REPORTS, 2023, 13 (01)
[5] A traffic light control method based on multi-agent deep reinforcement learning algorithm
Dongjiang Liu
Leixiao Li
Scientific Reports, 13
[6] Multi-Agent Deep Reinforcement Learning in Vehicular OCC
Islam, Amirul
Musavian, Leila
Thomos, Nikolaos
2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
[7] Multi-agent deep reinforcement learning with traffic flow for traffic signal control
Hou, Liang
Huang, Dailin
Cao, Jie
Ma, Jialin
JOURNAL OF CONTROL AND DECISION, 2025, 12 (01) : 81 - 92
[8] Multi-Agent Deep Reinforcement Learning with Clustering and Information Sharing for Traffic Light Cooperative Control
Du T.
Wang B.
Cheng H.
Luo L.
Zeng N.
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (02): : 538 - 545
[9] Multi-agent deep reinforcement learning with actor-attention-critic for traffic light control
Wang, Bin
He, ZhengKun
Sheng, JinFang
Liu, YingXian
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (09) : 2880 - 2888
[10] Multi-agent broad reinforcement learning for intelligent traffic light control
Zhu, Ruijie
Li, Lulu
Wu, Shuning
Lv, Pei
Li, Yafei
Xu, Mingliang
INFORMATION SCIENCES, 2023, 619 : 509 - 525

← 1 2 3 4 5 →