Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning

被引：92

作者：

Wang, Xiaoqiang ^{[1
,2
]}

Ke, Liangjun ^{[1
]}

Qiao, Zhimin ^{[1
]}

Chai, Xinghua ^{[2
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China

[2] CETC Key Lab Aerosp Informat Applicat, Shijiazhuang 050081, Hebei, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2021年 / 51卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Double estimators; mean-field approximation; multiagent reinforcement learning (MARL); traffic signal control (TSC); NETWORK; COORDINATION; OPTIMIZATION;

D O I：

10.1109/TCYB.2020.3015811

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Finding the optimal signal timing strategy is a difficult task for the problem of large-scale traffic signal control (TSC). Multiagent reinforcement learning (MARL) is a promising method to solve this problem. However, there is still room for improvement in extending to large-scale problems and modeling the behaviors of other agents for each individual agent. In this article, a new MARL, called cooperative double Q-learning (Co-DQL), is proposed, which has several prominent features. It uses a highly scalable independent double Q-learning method based on double estimators and the upper confidence bound (UCB) policy, which can eliminate the over-estimation problem existing in traditional independent Q-learning while ensuring exploration. It uses mean-field approximation to model the interaction among agents, thereby making agents learn a better cooperative strategy. In order to improve the stability and robustness of the learning process, we introduce a new reward allocation mechanism and a local state sharing method. In addition, we analyze the convergence properties of the proposed algorithm. Co-DQL is applied to TSC and tested on various traffic flow scenarios of TSC simulators. The results show that Co-DQL outperforms the state-of-the-art decentralized MARL algorithms in terms of multiple traffic metrics.

引用

页码：174 / 187

页数：14

共 50 条

[1] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
Chu, Tianshu
Qu, Shuhui
Wang, Jie
2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
[2] Large-Scale Traffic Grid Signal Control Using Decentralized Fuzzy Reinforcement Learning
Tan, Tian
Chu, TianShu
Peng, Bo
Wang, Jie
PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 1, 2018, 15 : 652 - 662
[3] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
Tan, Tian
Bao, Feng
Deng, Yue
Jin, Alex
Dai, Qionghai
Wang, Jie
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
[4] Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning
Gu, Hankang
Wang, Shangbo
Ma, Xiaoguang
Jia, Dongyao
Mao, Guoqiang
Lim, Eng Gee
Wong, Cheuk Pong Ryan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7619 - 7632
[5] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
Chu, Tianshu
Wang, Jie
Codeca, Lara
Li, Zhaojian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
[6] Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach
Chen, Yue
Li, Changle
Yue, Wenwei
Zhang, Hehe
Mao, Guoqiang
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
[7] Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control
Chen, Chacha
Wei, Hua
Xu, Nan
Zheng, Guanjie
Yang, Ming
Xiong, Yuanhao
Xu, Kai
Li, Zhenhui
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3414 - 3421
[8] GPLight: Grouped Multi-agent Reinforcement Learning for Large-scale Traffic Signal Control
Liu, Yilin
Luo, Guiyang
Yuan, Quan
Li, Jinglin
Jin, Lei
Chen, Bo
Pan, Rui
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 199 - 207
[9] Implementing Traffic Signal Optimal Control by Multiagent Reinforcement Learning
Song, Jiong
Jin, Zhao
Zhu, WenJun
2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2578 - 2582
[10] Large-scale traffic control using autonomous vehicles and decentralized deep reinforcement learning
Maske, Harshal
Chu, Tianshu
Kalabic, Uros
2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3816 - 3821

← 1 2 3 4 5 →