Multiagent reinforcement learning for autonomous driving in traffic zones with unsignalized intersections

被引:18
作者
Spatharis, Christos [1 ]
Blekas, Konstantinos [1 ]
机构
[1] Univ Ioannina, Dept Comp Sci & Engn, Ioannina, Greece
关键词
autonomous driving; coordinating vehicles; knowledge reusing; multiagent reinforcement learning; traffic unsignalized intersections; VEHICLES;
D O I
10.1080/15472450.2022.2109416
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
In this work we present a multiagent deep reinforcement learning approach for autonomous driving vehicles that is able to operate in traffic networks with unsignalized intersections. The key aspects of the proposed study are the introduction of route-agents as the main building block of the system, as well as a collision term that allows the cooperation among vehicles and the construction of an efficient reward function. These have the advantage of establishing an enhanced collaborative multiagent deep reinforcement learning scheme that manages to control multiple vehicles and navigate them safely and efficiently-economically to their destination. In addition, it provides the beneficial flexibility to lay down a platform for transfer learning and reusing knowledge from the agents' policies in handling unknown traffic scenarios. We provide several experimental results in simulated road traffic networks of variable complexity and diverse characteristics using the SUMO environment that empirically illustrate the efficiency of the proposed multiagent framework.
引用
收藏
页码:103 / 119
页数:17
相关论文
共 39 条
[1]   Intelligent Transportation and Control Systems Using Data Mining and Machine Learning Techniques: A Comprehensive Study [J].
Alsrehin, Nawaf O. ;
Klaib, Ahmad F. ;
Magableh, Aws .
IEEE ACCESS, 2019, 7 :49830-49857
[2]   Reinforcement learning-based multi-agent system for network traffic signal control [J].
Arel, I. ;
Liu, C. ;
Urbanik, T. ;
Kohls, A. G. .
IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) :128-135
[3]  
Bouton M, 2017, IEEE INT VEH SYM, P825, DOI 10.1109/IVS.2017.7995818
[4]  
Camponogara E, 2003, LECT NOTES ARTIF INT, V2902, P324
[5]  
Cao ZG, 2017, AAAI CONF ARTIF INTE, P4481
[6]  
Cao ZG, 2016, AAAI CONF ARTIF INTE, P3814
[7]  
Da Silva FL, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P5487
[8]  
Da Silva FL, 2019, J ARTIF INTELL RES, V64, P645
[9]   Traffic Coordination at Road Intersections: Autonomous Decision-Making Algorithms Using Model-Based Heuristics [J].
de Campos, Gabriel Rodrigues ;
Falcone, Paolo ;
Hult, Robert ;
Wymeersch, Henk ;
Sjoberg, Jonas .
IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2017, 9 (01) :8-21
[10]  
De La Fortelle A., 2014, P 21 WORLD C ITS WOR, P2618