Network-Wide Traffic Signal Control Based on MARL With Hierarchical Nash-Stackelberg Game Model

被引:0
作者
Shen, Hui [1 ,2 ]
Zhao, Hongxia [3 ]
Zhang, Zundong [4 ]
Yang, Xun [4 ]
Song, Yutong [4 ]
Liu, Xiaoming [4 ]
机构
[1] North China Univ Technol, Sch Elect & Control Engn, Beijing 100037, Peoples R China
[2] Beijing Municipal Traff Management Bur, Beijing 100037, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[4] North China Univ Technol, Beijing Key Lab Urban Rd Traff Intelligent Technol, Beijing 100144, Peoples R China
关键词
Games; Roads; Approximation algorithms; Q-learning; Multi-agent systems; Process control; Optimization; Reinforcement learning; Traffic control; Network-wide traffic signal control; hierarchical game model; multi-agent reinforcement learning; MULTIAGENT SYSTEMS; REINFORCEMENT;
D O I
10.1109/ACCESS.2023.3345448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network-wide traffic signal control is an important means of relieving urban congestion, reducing traffic accidents, and improving traffic efficiency. However, solving the problem of computational complexity caused by multi-intersection games is challenging. To address this issue, we propose a Nash-Stackelberg hierarchical game model that considers the importance of different intersections in the road network and the game relationships between intersections. The model takes into account traffic control strategies between and within sub-areas of the road network, with important intersections in the two sub-areas as the game subject at the upper layer and secondary intersections as the game subject at the lower layer. Furthermore, we propose two reinforcement learning algorithms (NSHG-QL and NSHG-DQN) based on the Nash-Stackelberg hierarchical game model to realize coordinated control of traffic signals in urban areas. Experimental results show that, compared to basic game model solving algorithms, NSHG-QL and NSHG-DQN algorithms can reduce the average travel time and time loss of vehicles at intersections, increase average speed and road occupancy, and coordinate secondary intersections to make optimal strategy selections based on satisfying the upper-layer game between important intersections. Moreover, the multi-agent reinforcement learning algorithms based on this hierarchical game model can significantly improve learning performance and convergence.
引用
收藏
页码:145085 / 145100
页数:16
相关论文
共 48 条
[1]   A Novel Decentralized Game-Theoretic Adaptive Traffic Signal Controller: Large-Scale Testing [J].
Abdelghaffar, Hossam M. ;
Rakha, Hesham A. .
SENSORS, 2019, 19 (10)
[3]   Signalized Intersection Control in Mixed Autonomous and Regular Vehicles Traffic Environment-A Critical Review Focusing on Future Control [J].
Al-Turki, Mohammed ;
Ratrout, Nedal T. ;
Rahman, Syed Masiur ;
Assi, Khaled J. .
IEEE ACCESS, 2022, 10 :16942-16951
[4]   AlphaStar: An Evolutionary Computation Perspective [J].
Arulkumaran, Kai ;
Cully, Antoine ;
Togelius, Julian .
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, :314-315
[5]   The Hanabi challenge: A new frontier for AI research [J].
Bard, Nolan ;
Foerster, Jakob N. ;
Chandar, Sarath ;
Burch, Neil ;
Lanctot, Marc ;
Song, H. Francis ;
Parisotto, Emilio ;
Dumoulin, Vincent ;
Moitra, Subhodeep ;
Hughes, Edward ;
Dunning, Iain ;
Mourad, Shibl ;
Larochelle, Hugo ;
Bellemare, Marc G. ;
Bowling, Michael .
ARTIFICIAL INTELLIGENCE, 2020, 280
[6]   Opportunities for multiagent systems and multiagent reinforcement learning in traffic control [J].
Bazzan, Ana L. C. .
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) :342-375
[7]  
Bianchi R. A., 2012, P 11 INT C AUT AG MU, P1395
[8]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[9]  
Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
[10]   Solving traffic queues at controlled-signalized intersections in continuous-time Markov games [J].
Castillo Gonzalez, Rodrigo ;
Clempner, Julio B. ;
Poznyak, Alexander S. .
MATHEMATICS AND COMPUTERS IN SIMULATION, 2019, 166 :283-297