Discharge control policy based on density and speed for deep Q-learning adaptive traffic signal

被引:5
作者
Ahmed, Muaid Abdulkareem Alnazir [1 ,3 ]
Khoo, Hooi Ling [1 ]
Ng, Oon-Ee [2 ]
机构
[1] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Dept Civil Engn, Kajang, Malaysia
[2] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Dept Mechatron & Biomed Engn, Kajang, Malaysia
[3] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Dept Civil Engn, Kajang 43000, Malaysia
关键词
downstream discharge routes; traffic control policy; adaptive signal controller; deep Q-learning; traffic network management; intersection capacity; CONNECTED VEHICLE TECHNOLOGY; FLOW;
D O I
10.1080/21680566.2023.2243388
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
This study introduces a control strategy based on intersection capacity. The optimisation technique is formulated from available space at discharge routes. The downstream policy utilises density and speed (k-v) measurements to guide a deep Q-learning agent (DQLA) in managing a signalised junction using a constrained local communication protocol. Testing of the DQLA k-v strategy against other control methods is carried out in a simulated micro-model of a real urban traffic network. Though the adaptive signal system design is decentralised, statistical analyses explicitly prove the effectiveness of the discharge-based controller in mitigating operation at a global scale. The DQLA k-v controller has achieved significant cost savings in waiting time (10%-36%) and travel time (5%-25%) and asserted the highest mean travel speed (3.4 m/s). Consequently, vehicular traffic experienced the least time loss when traversing routes and witnessed fewer stops leading to close to optimum network operation at a 0.80 clearance ratio.
引用
收藏
页码:1707 / 1726
页数:20
相关论文
共 45 条
[1]   Hierarchical control of traffic signals using Q-learning with tile coding [J].
Abdoos, Monireh ;
Mozayani, Nasser ;
Bazzan, Ana L. C. .
APPLIED INTELLIGENCE, 2014, 40 (02) :201-213
[2]  
Ahmed M. A. A., 2022, KSCE J CIV ENG, P1
[3]   An Adaptive Method for Traffic Signal Control Based on Fuzzy Logic With Webster and Modified Webster Formula Using SUMO Traffic Simulator [J].
Ali, Muzamil Eltejani Mohammed ;
Durdu, Akif ;
Celtek, Seyit Alperen ;
Yilmaz, Alper .
IEEE ACCESS, 2021, 9 :102985-102997
[4]   Learning in groups of traffic signals [J].
Bazzan, Ana L. C. ;
de Oliveira, Denise ;
da Silva, Bruno C. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (04) :560-568
[5]   Distributed coordinated signal timing optimization in connected transportation networks [J].
Bin Al Islam, S. M. A. ;
Hajbabaie, Ali .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 80 :272-285
[6]   Distributed learning and multi-objectivity in traffic light control [J].
Brys, Tim ;
Pham, Tong T. ;
Taylor, Matthew E. .
CONNECTION SCIENCE, 2014, 26 (01) :65-83
[7]   Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control [J].
Chu, Tianshu ;
Wang, Jie ;
Codeca, Lara ;
Li, Zhaojian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) :1086-1095
[8]   Traffic Graph Convolutional Recurrent Neural Network: A Deep Learning Framework for Network-Scale Traffic Learning and Forecasting [J].
Cui, Zhiyong ;
Henrickson, Kristian ;
Ke, Ruimin ;
Wang, Yinhai .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (11) :4883-4894
[9]   Multi-agent immune networks to control interrupted flow at signalized intersections [J].
Darmoul, Saber ;
Elkosantini, Sabeur ;
Louati, Ali ;
Ben Said, Lamjed .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 82 :290-313
[10]   A real-time adaptive signal control in a connected vehicle environment [J].
Feng, Yiheng ;
Head, K. Larry ;
Khoshmagham, Shayan ;
Zamanipour, Mehdi .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2015, 55 :460-473