Decentralized multi-region perimeter control in complex urban environments using reinforcement and imitation learning

被引：0

作者：

Kampitakis, Emmanouil ^{[1
]}

Vlahogianni, Eleni I. ^{[1
]}

机构：

[1] Natl Tech Univ Athens, 5 Iroon Polytech Str,Zografou Campus, GR-15773 Athens, Greece

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2025年 / 178卷

关键词：

Multi-region perimeter control; Reinforcement learning; Imitation learning; Queue balancing; Decentralization; FUNDAMENTAL DIAGRAM; NETWORKS;

D O I：

10.1016/j.trc.2025.105253

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Multi-region Perimeter Control is a promising traffic control approach, effectively regulating vehicle flow between congested regions to improve travel efficiency and prevent network oversaturation. Recent advancements in AI and deep reinforcement learning (DRL) have led researchers to explore the application of DRL for perimeter control applications. This study introduces two complementary methods for multi-region perimeter control. The first is a centralized and model-free DRL controller that operates without requiring prior knowledge of a macroscopic fundamental diagram (MFD). The second one involves decentralization at two different levels, i.e., region- and boundary-level, achieved by using multi-agent imitation learning aiming to decentralize the policy of the centralized controller. Both approaches are augmented by a queue balancing framework and operate in a hierarchical manner. The queue balancing framework is responsible for the translation of ordered inflows to optimal green time durations based on relative queues to account for the heterogeneity found in real-world networks. The proposed controllers, trained directly in a realistic large-scale microsimulation network, are evaluated and compared against two widely used perimeter control methods found in literature, i. e., a multivariable Proportional-Integral (PI) feedback regulator and a model predictive control approach. Results show that both proposed methods outperform the baselines in terms of efficiency. Furthermore, generalization tests under various synthetic demand scenarios, including scenarios affected by signal noise, demonstrate the robustness of the learned policies. These findings highlight the potential of imitation learning to effectively decentralize DRL-based perimeter control while maintaining high performance levels.

引用

页数：23

共 53 条

[1]

Abdeen Mohammad A. R., 2023, Personal and Ubiquitous Computing, P1747, DOI [10.1007/s00779-023-01738-9, 10.1007/s00779-023-01738-9]

[2] Perimeter and boundary flow control in multi-reservoir heterogeneous networks [J].

Aboudolas, Konstantinos ;

Geroliminis, Nikolas .

TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2013, 55 :265-281

[3] Identification of optimal locations of adaptive traffic signal control using heuristic methods [J].

Ahmed, Tanveer ;

Liu, Hao ;

Gayah, Vikash V. .

INTERNATIONAL JOURNAL OF TRANSPORTATION SCIENCE AND TECHNOLOGY, 2024, 13 :122-136

[4]

Bynum Michael L., 2021, Pyomo-Optimization Modeling in Python, V67, DOI DOI 10.1007/978-3-030-68928-5

[5] Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics [J].

Chen, C. ;

Huang, Y. P. ;

Lam, W. H. K. ;

Pan, T. L. ;

Hsu, S. C. ;

Sumalee, A. ;

Zhong, R. X. .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 142

[6] An Iterative Adaptive Dynamic Programming Approach for Macroscopic Fundamental Diagram-Based Perimeter Control and Route Guidance [J].

Chen, Can ;

Geroliminis, Nikolas ;

Zhong, Renxin .

TRANSPORTATION SCIENCE, 2024, 58 (04) :896-918

[7] Tracking perimeter control for two-region macroscopic traffic dynamics: An adaptive dynamic programming approach [J].

Chen, Can ;

Huang, Yunping ;

Zhang, Hongwei ;

Hsu, Shu-Chien ;

Zhong, Renxin .

2024 IEEE 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2024, :1342-1347

[8]

Cplex I.I., 2009, International Business Machines Corporation, V46, P157

[9] An analytical approximation for the macroscopic fundamental diagram of urban traffic [J].

Daganzo, Carlos F. ;

Geroliminis, Nikolas .

TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2008, 42 (09) :771-781

[10] Urban gridlock: Macroscopic modeling and mitigation approaches [J].

Daganzo, Carlos F. .

TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2007, 41 (01) :49-62

← 1 2 3 4 5 6 →