Adaptive network traffic control with approximate dynamic programming based on a non-homogeneous Poisson demand model

被引:21
作者
Chen, Siqi [1 ,2 ]
Lu, Xing [1 ,2 ,3 ]
机构
[1] Beijing Jiaotong Univ, Dept Math, Beijing 100044, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Peoples R China
[3] Beijing Jiaotong Univ, Beijing Lab Natl Econ Secur Early warning Engn, Beijing, Peoples R China
关键词
Non-homogeneous Poisson process; cell transmission method; traffic signal control; approximate dynamic programming; temporal difference learning;
D O I
10.1080/21680566.2024.2336029
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
In this study, we develop a stochastic dynamic traffic-flow model subject to practical restrictions under the non-homogeneous Poisson vehicle arrival process. Using the cell transmission strategy, we establish traffic dynamics between two intersections. We also discuss simulating the random demand of source links from an estimated intensity function. Additionally, we propose an algorithm to optimize time interval division for aggregated data, aiming to enhance estimation performance. We explore applying our traffic flow model to the adaptive traffic network management problem, which is formulated as a Markov decision process. Leveraging approximate dynamic programming with recursive least squares-temporal difference learning, we achieve adaptive optimal policies. To validate our approach, we conduct a series of numerical experiments with random demands. The results of non-homogeneous Poisson demand conducted using random numbers and a real-word dataset indicate high efficiency with the piecewise constant, I-SMOOTH, and MNO-PQRS estimators. Compared to the Webster and Max-pressure control systems, our proposed approximate dynamic programming-based model exhibits superior stability and applicability.
引用
收藏
页数:30
相关论文
共 51 条
[1]   Discharge control policy based on density and speed for deep Q-learning adaptive traffic signal [J].
Ahmed, Muaid Abdulkareem Alnazir ;
Khoo, Hooi Ling ;
Ng, Oon-Ee .
TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2023, 11 (01) :1707-1726
[2]   Eligibility traces and forgetting factor in recursive least-squares-based temporal difference [J].
Baldi, Simone ;
Zhang, Zichen ;
Liu, Di .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (02) :334-353
[3]   Throughput properties and optimal locations for limited deployment of Max-pressure controls [J].
Barman, Simanta ;
Levin, Michael W. .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 150
[4]  
Bowman Casey N., 2018, WINTER SIMULATION C
[5]   Technical update: Least-squares temporal difference learning [J].
Boyan, JA .
MACHINE LEARNING, 2002, 49 (2-3) :233-246
[6]   Adaptive traffic signal control using approximate dynamic programming [J].
Cai, Chen ;
Wong, Chi Kwong ;
Heydecker, Benjamin G. .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (05) :456-474
[7]   MNO-PQRS: Max Nonnegativity Ordering-Piecewise-Quadratic Rate Smoothing [J].
Chen, Huifen ;
Schmeiser, Bruce W. .
ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2017, 27 (03)
[8]   I-SMOOTH: Iteratively Smoothing Mean-Constrained and Nonnegative Piecewise-Constant Functions [J].
Chen, Huifen ;
Schmeiser, Bruce .
INFORMS JOURNAL ON COMPUTING, 2013, 25 (03) :432-445
[9]  
Chen Huifen, 2019, WINTER SIMULATION C
[10]  
Chen Huifen, 2018, WINTER SIMULATION C