Threshold Tuning Using Stochastic Optimization for Graded Signal Control

被引：31

作者：

Prashanth, L. A. ^{[1
]}

Bhatnagar, Shalabh ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore 560012, Karnataka, India

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2012年 / 61卷 / 09期

关键词：

Deterministic perturbation sequences; intelligent transportation systems; simultaneous perturbation stochastic approximation (SPSA); stochastic optimization; threshold tuning; traffic signal control; TRAFFIC SIGNALS; REAL-TIME; APPROXIMATION; SYSTEM; NETWORKS;

D O I：

10.1109/TVT.2012.2209904

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Adaptive control of traffic lights is a key component of any intelligent transportation system. Many real-time traffic light control (TLC) algorithms are based on graded thresholds, because precise information about the traffic congestion in the road network is hard to obtain in practice. For example, using thresholds L-1 and L-2, we could mark the congestion level on a particular lane as "low," "medium," or "high" based on whether the queue length on the lane is below L-1, between L-1 and L-2, or above L-2, respectively. However, the TLC algorithms that were proposed in the literature incorporate fixed values for the thresholds, which, in general, are not optimal for all traffic conditions. In this paper, we present an algorithm based on stochastic optimization to tune the thresholds that are associated with a TLC algorithm for optimal performance. We also propose the following three novel TLC algorithms: 1) a full-state Q-learning algorithm with state aggregation, 2) a Q-learning algorithm with function approximation that involves an enhanced feature selection scheme, and 3) a priority-based TLC scheme. All these algorithms are threshold based. Next, we combine the threshold-tuning algorithm with the three aforementioned algorithms. Such a combination results in several interesting consequences. For example, in the case of Q-learning with full-state representation, our threshold-tuning algorithm suggests an optimal way of clustering states to reduce the cardinality of the state space, and in the case of the Q-learning algorithm with function approximation, our (threshold-tuning) algorithm provides a novel feature adaptation scheme to obtain an "optimal" selection of features. Our tuning algorithm is an incremental-update online scheme with proven convergence to the optimal values of thresholds. Moreover, the additional computational effort that is required because of the integration of the tuning scheme in any of the graded-threshold-based TLC algorithms is minimal. Simulation results show a significant gain in performance when our threshold-tuning algorithm is used in conjunction with various TLC algorithms compared to the original TLC algorithms without tuning and with fixed thresholds.

引用

页码：3865 / 3880

页数：16

共 50 条

[31] Multi-stage stochastic program to optimize signal timings under coordinated adaptive control
Ma, Wanjing
An, Kun
Lo, Hong K.
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 72 : 342 - 359
[32] Adaptive Neural Stochastic Control With Lipschitz Constant Optimization
Geng, Lian
Qu, Qingyu
Ran, Maopeng
Liu, Kexin
Lu, Jinhu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (07) : 3294 - 3306
[33] A TWO-TIME-SCALE STOCHASTIC OPTIMIZATION FRAMEWORK WITH APPLICATIONS IN CONTROL AND REINFORCEMENT LEARNING
Zeng, Sihan
Doan, Thinh T.
Romberg, Justin
SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (01) : 946 - 976
[34] Medical image registration using stochastic optimization
Mohamed, Waleed
Ben Hamza, A.
OPTICS AND LASERS IN ENGINEERING, 2010, 48 (12) : 1213 - 1223
[35] Stochastic Online Optimization using Kalman Recursion
de Vilmarest, Joseph
Wintenberger, Olivier
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[36] System Optimization Using a Parallel Stochastic Approach
Zaplatilek, Karel
Leuchter, Jan
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2013, 13 (02) : 73 - 76
[37] Optimal Threshold Levels in Stochastic Fluid Models via Simulation-based Optimization
Gül Gürkan
Fikri Karaesmen
Özge Özdemir
Discrete Event Dynamic Systems, 2007, 17 : 53 - 97
[38] Optimal threshold levels in stochastic fluid models via simulation-based optimization
Gurkan, Gul
Karaesmen, Fikri
Ozdemir, Ozge
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (01): : 53 - 97
[39] Signal adaptive cooperative control of two adjacent traffic intersections using a two-stage algorithm
Zou, Yuanyang
Liu, Renhuai
Li, Ya
Ma, Yingshuang
Wang, Guoxin
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 174
[40] Cooperative Method of Traffic Signal Optimization and Speed Control of Connected Vehicles at Isolated Intersections
Xu, Biao
Ban, Xuegang Jeff
Bian, Yougang
Li, Wan
Wang, Jianqiang
Li, Shengbo Eben
Li, Keqiang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (04) : 1390 - 1403

← 1 2 3 4 5 →