eMARLIN plus : Addressing Partial Observability to Promote Traffic Signal Coordination by Leveraging Historical Information

被引:1
作者
Wang, Xiaoyu [1 ]
Taitler, Ayal [2 ]
Smirnov, Ilia [1 ]
Sanner, Scott [3 ]
Abdulhai, Baher [1 ]
机构
[1] Univ Toronto, Dept Civil & Mineral Engn, Toronto, ON M5G 2C4, Canada
[2] Ben Gurion Univ Negev, Dept Ind Engn & Man agement, IL-8410501 Beer Sheva, Israel
[3] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON M5G 2C4, Canada
关键词
Training; Collaboration; Observability; Decision making; Decentralized control; Cameras; Optimization; Delays; Vehicle dynamics; Uncertainty; Adaptive traffic signal control; Markov decision processes; partial observability; state factorization; multi-agent reinforcement learning;
D O I
10.1109/TITS.2024.3462951
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In Adaptive Traffic Signal Control (ATSC) systems, real-time responsiveness relies on sensor data for signal timing adjustments. However, limitations in sensor capabilities result in an incomplete representation of the true system state. Hence, practical controllers can only access restricted dynamical features within specific detection areas that lead to partial observability, where identical observations may correspond to different system dynamics, hindering optimal decision-making. To address these challenges, we explore the existence and sources of partial observability in ATSC, formulating it within the framework of Markov decision processes. The global ATSC problem is factorized and decoupled to reveal structural properties in underlying system dynamics. This enhanced understanding reveals the dominant information that should be considered by decentralized controllers and guides the derivation of eMARLIN + . Experimental validation on synthetic and real-world scenarios demonstrates eMARLIN + 's effectiveness in enhancing agent-level coordination and surpassing strong baselines in minimizing travel delay. Additional diagnostic analysis of our learned controller further validates the effectiveness of our information-sharing scheme.
引用
收藏
页码:21380 / 21392
页数:13
相关论文
共 2 条
  • [1] Integrating Multimodality and Partial Observability Solutions Into Decentralized Multiagent Reinforcement Learning Adaptive Traffic Signal Control
    Othman, Kareem
    Wang, Xiaoyu
    Shalaby, Amer
    Abdulhai, Baher
    IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 6 : 322 - 334
  • [2] A real-time network-level traffic signal control methodology with partial connected vehicle information
    Bin Al Islam, S. M. A.
    Hajbabaie, Ali
    Aziz, H. M. Abdul
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 121