Multi-Agent Attention Double Actor-Critic Framework for Intelligent Traffic Light Control in Urban Scenarios With Hybrid Traffic

被引：15

作者：

Liu, Bingyi ^{[1
,2
]}

Han, Weizhen ^{[1
,2
]}

Wang, Enshu ^{[3
]}

Xiong, Shengwu ^{[1
,2
]}

Wu, Libing ^{[4
]}

Wang, Qian ^{[4
]}

Wang, Jianping ^{[5
]}

Qiao, Chunming

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 542024, Peoples R China

[3] SUNY Buffalo, Buffalo, NY 14260 USA

[4] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China

[5] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Graph attention networks; multi-agent reinforcement learning; options framework; traffic light control;

D O I：

10.1109/TMC.2022.3233879

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In real-world urban environments, hybrid and disorder traffic brings new challenges for the intelligent traffic light control system (ITLCS). Apart from coordinating traffic flows around intersections, the ITLCS is responsive to ensuring high priority vehicles pass through intersections quickly. To this end, we formulate the multiple intersections' decision-making problem as a Semi-Markov game and propose a multi-agent attention double actor-critic (MAADAC) framework to solve this game, integrating the options framework with graph attention networks (GATs). Specifically, the options framework empowers agents to learn to make a long sequence of satisfactory decisions, such as keeping a reasonable phase for a short period to ensure high priority vehicles pass through intersections quickly. Besides, we adopt GATs to capture graph-structure mutual influences among agents. We set up a simulator based on real-world city road networks and conduct extensive experiments to evaluate the performance of MAADAC. The experimental results show that MAADAC can reduce high priority vehicles' waiting time in the interval of 18.16%-38.14% versus the density of vehicles in real-world urban scenarios over several state-of-the-art approaches. Also, our framework can guarantee the passing efficiency of high priority vehicles under various traffic conditions with the change in the proportion of high priority vehicles.

引用

页码：660 / 672

页数：13

共 34 条

[1] Asaduzzaman K., 2017, IEEE 86 VEH TECHNOL, P1
[2] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[3] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
Chu, Tianshu
Wang, Jie
Codeca, Lara
Li, Zhaojian
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
[4] Cools SB., 2013, ADV APPL SELF ORG SY, P45, DOI DOI 10.1007/978-1-4471-5113-5_3
[5] Hardinda Krishna Priawan, 2020, 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), P113, DOI 10.1109/ICAIIC48513.2020.9065242
[6] Holzleitner M., 2021, T LARGE SCALE DATA A, P105
[7] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
[8] Jiang J C, 2020, P 8 INT C LEARN REPR
[9] Krajzewicz D., 2012, INT J ADV SYST MEAS, V3, P128
[10] Deep Reinforcement Learning-Based Traffic Light Scheduling Framework for SDN-Enabled Smart Transportation System
Kumar, Neetesh
Mittal, Sarthak
Garg, Vaibhav
Kumar, Neeraj
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (03) : 2411 - 2421

← 1 2 3 4 →