Threshold Tuning Using Stochastic Optimization for Graded Signal Control

被引:31
|
作者
Prashanth, L. A. [1 ]
Bhatnagar, Shalabh [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore 560012, Karnataka, India
关键词
Deterministic perturbation sequences; intelligent transportation systems; simultaneous perturbation stochastic approximation (SPSA); stochastic optimization; threshold tuning; traffic signal control; TRAFFIC SIGNALS; REAL-TIME; APPROXIMATION; SYSTEM; NETWORKS;
D O I
10.1109/TVT.2012.2209904
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Adaptive control of traffic lights is a key component of any intelligent transportation system. Many real-time traffic light control (TLC) algorithms are based on graded thresholds, because precise information about the traffic congestion in the road network is hard to obtain in practice. For example, using thresholds L-1 and L-2, we could mark the congestion level on a particular lane as "low," "medium," or "high" based on whether the queue length on the lane is below L-1, between L-1 and L-2, or above L-2, respectively. However, the TLC algorithms that were proposed in the literature incorporate fixed values for the thresholds, which, in general, are not optimal for all traffic conditions. In this paper, we present an algorithm based on stochastic optimization to tune the thresholds that are associated with a TLC algorithm for optimal performance. We also propose the following three novel TLC algorithms: 1) a full-state Q-learning algorithm with state aggregation, 2) a Q-learning algorithm with function approximation that involves an enhanced feature selection scheme, and 3) a priority-based TLC scheme. All these algorithms are threshold based. Next, we combine the threshold-tuning algorithm with the three aforementioned algorithms. Such a combination results in several interesting consequences. For example, in the case of Q-learning with full-state representation, our threshold-tuning algorithm suggests an optimal way of clustering states to reduce the cardinality of the state space, and in the case of the Q-learning algorithm with function approximation, our (threshold-tuning) algorithm provides a novel feature adaptation scheme to obtain an "optimal" selection of features. Our tuning algorithm is an incremental-update online scheme with proven convergence to the optimal values of thresholds. Moreover, the additional computational effort that is required because of the integration of the tuning scheme in any of the graded-threshold-based TLC algorithms is minimal. Simulation results show a significant gain in performance when our threshold-tuning algorithm is used in conjunction with various TLC algorithms compared to the original TLC algorithms without tuning and with fixed thresholds.
引用
收藏
页码:3865 / 3880
页数:16
相关论文
共 50 条
  • [31] Multi-stage stochastic program to optimize signal timings under coordinated adaptive control
    Ma, Wanjing
    An, Kun
    Lo, Hong K.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 72 : 342 - 359
  • [32] Adaptive Neural Stochastic Control With Lipschitz Constant Optimization
    Geng, Lian
    Qu, Qingyu
    Ran, Maopeng
    Liu, Kexin
    Lu, Jinhu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (07) : 3294 - 3306
  • [33] A TWO-TIME-SCALE STOCHASTIC OPTIMIZATION FRAMEWORK WITH APPLICATIONS IN CONTROL AND REINFORCEMENT LEARNING
    Zeng, Sihan
    Doan, Thinh T.
    Romberg, Justin
    SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (01) : 946 - 976
  • [34] Medical image registration using stochastic optimization
    Mohamed, Waleed
    Ben Hamza, A.
    OPTICS AND LASERS IN ENGINEERING, 2010, 48 (12) : 1213 - 1223
  • [35] Stochastic Online Optimization using Kalman Recursion
    de Vilmarest, Joseph
    Wintenberger, Olivier
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [36] System Optimization Using a Parallel Stochastic Approach
    Zaplatilek, Karel
    Leuchter, Jan
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2013, 13 (02) : 73 - 76
  • [37] Optimal Threshold Levels in Stochastic Fluid Models via Simulation-based Optimization
    Gül Gürkan
    Fikri Karaesmen
    Özge Özdemir
    Discrete Event Dynamic Systems, 2007, 17 : 53 - 97
  • [38] Optimal threshold levels in stochastic fluid models via simulation-based optimization
    Gurkan, Gul
    Karaesmen, Fikri
    Ozdemir, Ozge
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (01): : 53 - 97
  • [39] Signal adaptive cooperative control of two adjacent traffic intersections using a two-stage algorithm
    Zou, Yuanyang
    Liu, Renhuai
    Li, Ya
    Ma, Yingshuang
    Wang, Guoxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 174
  • [40] Cooperative Method of Traffic Signal Optimization and Speed Control of Connected Vehicles at Isolated Intersections
    Xu, Biao
    Ban, Xuegang Jeff
    Bian, Yougang
    Li, Wan
    Wang, Jianqiang
    Li, Shengbo Eben
    Li, Keqiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (04) : 1390 - 1403