Multi-Agent Attention Double Actor-Critic Framework for Intelligent Traffic Light Control in Urban Scenarios With Hybrid Traffic

被引:15
作者
Liu, Bingyi [1 ,2 ]
Han, Weizhen [1 ,2 ]
Wang, Enshu [3 ]
Xiong, Shengwu [1 ,2 ]
Wu, Libing [4 ]
Wang, Qian [4 ]
Wang, Jianping [5 ]
Qiao, Chunming
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 542024, Peoples R China
[3] SUNY Buffalo, Buffalo, NY 14260 USA
[4] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China
[5] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph attention networks; multi-agent reinforcement learning; options framework; traffic light control;
D O I
10.1109/TMC.2022.3233879
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In real-world urban environments, hybrid and disorder traffic brings new challenges for the intelligent traffic light control system (ITLCS). Apart from coordinating traffic flows around intersections, the ITLCS is responsive to ensuring high priority vehicles pass through intersections quickly. To this end, we formulate the multiple intersections' decision-making problem as a Semi-Markov game and propose a multi-agent attention double actor-critic (MAADAC) framework to solve this game, integrating the options framework with graph attention networks (GATs). Specifically, the options framework empowers agents to learn to make a long sequence of satisfactory decisions, such as keeping a reasonable phase for a short period to ensure high priority vehicles pass through intersections quickly. Besides, we adopt GATs to capture graph-structure mutual influences among agents. We set up a simulator based on real-world city road networks and conduct extensive experiments to evaluate the performance of MAADAC. The experimental results show that MAADAC can reduce high priority vehicles' waiting time in the interval of 18.16%-38.14% versus the density of vehicles in real-world urban scenarios over several state-of-the-art approaches. Also, our framework can guarantee the passing efficiency of high priority vehicles under various traffic conditions with the change in the proportion of high priority vehicles.
引用
收藏
页码:660 / 672
页数:13
相关论文
共 34 条
  • [1] Asaduzzaman K., 2017, IEEE 86 VEH TECHNOL, P1
  • [2] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
  • [3] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [4] Cools SB., 2013, ADV APPL SELF ORG SY, P45, DOI DOI 10.1007/978-1-4471-5113-5_3
  • [5] Hardinda Krishna Priawan, 2020, 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), P113, DOI 10.1109/ICAIIC48513.2020.9065242
  • [6] Holzleitner M., 2021, T LARGE SCALE DATA A, P105
  • [7] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [8] Jiang J C, 2020, P 8 INT C LEARN REPR
  • [9] Krajzewicz D., 2012, INT J ADV SYST MEAS, V3, P128
  • [10] Deep Reinforcement Learning-Based Traffic Light Scheduling Framework for SDN-Enabled Smart Transportation System
    Kumar, Neetesh
    Mittal, Sarthak
    Garg, Vaibhav
    Kumar, Neeraj
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (03) : 2411 - 2421