A multi-agent reinforcement learning based approach for intelligent traffic signal control

被引:0
作者
Benhamza, Karima [1 ]
Seridi, Hamid [1 ]
Agguini, Meriem [2 ]
Bentagine, Amel [2 ]
机构
[1] Univ 08 May 1945, LabSTIC, Guelma, Algeria
[2] Univ 08 May 1945, Comp Sci Dept, Guelma, Algeria
关键词
Multi-agent reinforcement learning; Traffic signal control; Congestion management; Mutual reward; Replay memory; Urban traffic optimization; OPTIMIZATION; INTERSECTION;
D O I
10.1007/s12530-024-09622-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses the intricate challenges of urban traffic congestion by presenting a novel Multi-Agent Reinforcement Learning (MARL) approach. In response to the critical need for adaptive traffic management solutions in multiple intersections networks, the proposed model integrates a dual-tiered reward system, encompassing personalized and mutual rewards, alongside the incorporation of the replay memory concept. This integration aims to thoroughly evaluate traffic light contributions and enhance traffic signal control strategies. The carefully defined parameters within the reward function are closely aligned with overarching system objectives, specifically targeting the minimization of congestion, delays, and emergency response times. Through simulated scenarios featuring diverse traffic conditions, the proposed MARL model exhibits promising outcomes, demonstrating remarkable adaptability and efficiency even under high traffic volumes. Comparative results with traditional methods, including fixed-time, actuated control, green wave strategy and distributed multi-agent Q-learning method, show the model's robust learning and adaptive capabilities, resulting in substantial reductions in queue lengths and delays. These results validate the model's effectiveness in optimizing traffic flow and efficiently managing congestion, thus making a significant contribution to the advancement of intelligent traffic systems.
引用
收藏
页码:2383 / 2397
页数:15
相关论文
共 63 条
  • [31] BRANCH-AND-BOUND METHODS - A SURVEY
    LAWLER, EL
    WOOD, DE
    [J]. OPERATIONS RESEARCH, 1966, 14 (04) : 699 - +
  • [32] Li T, 2008, PROCEEDINGS OF THE 11TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, P286, DOI 10.1109/ITSC.2008.4732718
  • [33] Li YZ, 2017, ADV NEUR IN, V30
  • [34] Liang X., 2018, ARXIV
  • [35] A Deep Reinforcement Learning Network for Traffic Light Cycle Control
    Liang, Xiaoyuan
    Du, Xunsheng
    Wang, Guiling
    Han, Zhu
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) : 1243 - 1253
  • [36] Lopez PA, 2018, IEEE INT C INTELL TR, P2575, DOI 10.1109/ITSC.2018.8569938
  • [37] MILLER AJ, 1963, OPER RES QUART, V14, P373, DOI 10.2307/3006800
  • [38] Rhodes to Intelligent Transportation Systems
    Mirchandani, P
    Wang, FY
    [J]. IEEE INTELLIGENT SYSTEMS, 2005, 20 (01): : 10 - 15
  • [39] Review of road traffic control strategies
    Papageorgiou, M
    Diakaki, C
    Dinopoulou, V
    Kotsialos, A
    Wang, YB
    [J]. PROCEEDINGS OF THE IEEE, 2003, 91 (12) : 2043 - 2067
  • [40] Papageorgiou M, 2007, HBK OPERAT RES MANAG, V14, P715, DOI 10.1016/S0927-0507(06)14011-6