Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning

被引:92
|
作者
Wang, Xiaoqiang [1 ,2 ]
Ke, Liangjun [1 ]
Qiao, Zhimin [1 ]
Chai, Xinghua [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
[2] CETC Key Lab Aerosp Informat Applicat, Shijiazhuang 050081, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Double estimators; mean-field approximation; multiagent reinforcement learning (MARL); traffic signal control (TSC); NETWORK; COORDINATION; OPTIMIZATION;
D O I
10.1109/TCYB.2020.3015811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding the optimal signal timing strategy is a difficult task for the problem of large-scale traffic signal control (TSC). Multiagent reinforcement learning (MARL) is a promising method to solve this problem. However, there is still room for improvement in extending to large-scale problems and modeling the behaviors of other agents for each individual agent. In this article, a new MARL, called cooperative double Q-learning (Co-DQL), is proposed, which has several prominent features. It uses a highly scalable independent double Q-learning method based on double estimators and the upper confidence bound (UCB) policy, which can eliminate the over-estimation problem existing in traditional independent Q-learning while ensuring exploration. It uses mean-field approximation to model the interaction among agents, thereby making agents learn a better cooperative strategy. In order to improve the stability and robustness of the learning process, we introduce a new reward allocation mechanism and a local state sharing method. In addition, we analyze the convergence properties of the proposed algorithm. Co-DQL is applied to TSC and tested on various traffic flow scenarios of TSC simulators. The results show that Co-DQL outperforms the state-of-the-art decentralized MARL algorithms in terms of multiple traffic metrics.
引用
收藏
页码:174 / 187
页数:14
相关论文
共 50 条
  • [1] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
    Chu, Tianshu
    Qu, Shuhui
    Wang, Jie
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
  • [2] Large-Scale Traffic Grid Signal Control Using Decentralized Fuzzy Reinforcement Learning
    Tan, Tian
    Chu, TianShu
    Peng, Bo
    Wang, Jie
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 1, 2018, 15 : 652 - 662
  • [3] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
    Tan, Tian
    Bao, Feng
    Deng, Yue
    Jin, Alex
    Dai, Qionghai
    Wang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
  • [4] Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning
    Gu, Hankang
    Wang, Shangbo
    Ma, Xiaoguang
    Jia, Dongyao
    Mao, Guoqiang
    Lim, Eng Gee
    Wong, Cheuk Pong Ryan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7619 - 7632
  • [5] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [6] Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach
    Chen, Yue
    Li, Changle
    Yue, Wenwei
    Zhang, Hehe
    Mao, Guoqiang
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [7] Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chen, Chacha
    Wei, Hua
    Xu, Nan
    Zheng, Guanjie
    Yang, Ming
    Xiong, Yuanhao
    Xu, Kai
    Li, Zhenhui
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3414 - 3421
  • [8] GPLight: Grouped Multi-agent Reinforcement Learning for Large-scale Traffic Signal Control
    Liu, Yilin
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    Jin, Lei
    Chen, Bo
    Pan, Rui
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 199 - 207
  • [9] Implementing Traffic Signal Optimal Control by Multiagent Reinforcement Learning
    Song, Jiong
    Jin, Zhao
    Zhu, WenJun
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2578 - 2582
  • [10] Large-scale traffic control using autonomous vehicles and decentralized deep reinforcement learning
    Maske, Harshal
    Chu, Tianshu
    Kalabic, Uros
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3816 - 3821