EMVLight: A multi-agent reinforcement learning framework for an emergency vehicle decentralized routing and traffic signal control system

被引:30
作者
Su, Haoran [1 ]
Zhong, Yaofeng D. [2 ]
Chow, Joseph Y. J. [1 ]
Dey, Biswadip [2 ]
Jin, Li [3 ]
机构
[1] NYU, New York, NY USA
[2] Siemens Corp, Technol, Princeton, NJ USA
[3] Shanghai Jiao Tong Univ, UM Joint Inst, Dept Automat, Shanghai, Peoples R China
关键词
Emergency vehicle management; Traffic signal control; Deep reinforcement learning; Multi-agent system; PREEMPTION; NETWORKS; PATHS; GUIDANCE; DESIGN; LIGHTS; MODEL;
D O I
10.1016/j.trc.2022.103955
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Emergency vehicles (EMVs) play a crucial role in responding to time-critical calls such as medical emergencies and fire outbreaks in urban areas. Existing methods for EMV dispatch typically optimize routes based on historical traffic-flow data and design traffic signal pre-emption accordingly; however, we still lack a systematic methodology to address the coupling between EMV routing and traffic signal control. In this paper, we propose EMVLight, a decentralized reinforcement learning (RL) framework for joint dynamic EMV routing and traffic signal pre-emption. We adopt the multi-agent advantage actor-critic method with policy sharing and spatial discounted factor. This framework addresses the coupling between EMV navigation and traffic signal control via an innovative design of multi-class RL agents and a novel pressure -based reward function. The proposed methodology enables EMVLight to learn network-level cooperative traffic signal phasing strategies that not only reduce EMV travel time but also shortens the travel time of non-EMVs. Simulation-based experiments indicate that EMVLight enables up to a 42.6% reduction in EMV travel time as well as an 23.5% shorter average travel time compared with existing approaches.
引用
收藏
页数:25
相关论文
共 85 条
[1]   Reinforcement learning for True Adaptive traffic signal control [J].
Abdulhai, B ;
Pringle, R ;
Karakoulas, GJ .
JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) :278-285
[2]   Decentralized route guidance architectures with user preferences in urban transportation networks [J].
Adacher, L. ;
Oliva, G. ;
Pascucci, F. .
TRANSPORTATION: CAN WE DO MORE WITH LESS RESOURCES? - 16TH MEETING OF THE EURO WORKING GROUP ON TRANSPORTATION - PORTO 2013, 2014, 111 :1054-1062
[3]  
Administration D.F.H., 2020, US FREEW SHOULD TRAV
[4]  
Agarwal A., 2016, ARXIV, DOI [10.1109/IVS.2016.7535412,arXiv.org, DOI 10.1109/IVS.2016.7535412,ARXIV.ORG]
[5]  
Analytics N, 2021, END TO END RESPONSE
[6]  
[Anonymous], 2009, Socio-Economic Planning Sciences
[7]  
[Anonymous], 1993, Transp. Res. Rec
[8]  
Asaduzzaman M., 2017, 2017 IEEE 86 VEHICUL, P1
[9]   Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events [J].
Aslani, Mohammad ;
Mesgari, Mohammad Saadi ;
Wiering, Marco .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 :732-752
[10]   Global incidences of out-of-hospital cardiac arrest and survival rates: Systematic review of 67 prospective studies [J].
Berdowski, Jocelyn ;
Berg, Robert A. ;
Tijssen, Jan G. P. ;
Koster, Rudolph W. .
RESUSCITATION, 2010, 81 (11) :1479-1487