Multi-Agent Reinforcement Learning for Highway Platooning

被引：5

作者：

Kolat, Mate ^{[1
]}

Becsi, Tamas ^{[1
]}

机构：

[1] Budapest Univ Technol & Econ, Dept Control Transportat & Vehicle Syst, H-1111 Budapest, Hungary

来源：

ELECTRONICS | 2023年 / 12卷 / 24期

关键词：

deep learning; reinforcement learning; platooning; road traffic control; multi-agent systems; VEHICLE; GAME;

D O I：

10.3390/electronics12244963

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The advent of autonomous vehicles has opened new horizons for transportation efficiency and safety. Platooning, a strategy where vehicles travel closely together in a synchronized manner, holds promise for reducing traffic congestion, lowering fuel consumption, and enhancing overall road safety. This article explores the application of Multi-Agent Reinforcement Learning (MARL) combined with Proximal Policy Optimization (PPO) to optimize autonomous vehicle platooning. We delve into the world of MARL, which empowers vehicles to communicate and collaborate, enabling real-time decision making in complex traffic scenarios. PPO, a cutting-edge reinforcement learning algorithm, ensures stable and efficient training for platooning agents. The synergy between MARL and PPO enables the development of intelligent platooning strategies that adapt dynamically to changing traffic conditions, minimize inter-vehicle gaps, and maximize road capacity. In addition to these insights, this article introduces a cooperative approach to Multi-Agent Reinforcement Learning (MARL), leveraging Proximal Policy Optimization (PPO) to further optimize autonomous vehicle platooning. This cooperative framework enhances the adaptability and efficiency of platooning strategies, marking a significant advancement in the pursuit of intelligent and responsive autonomous vehicle systems.

引用

页数：13

共 41 条

[1]

Aki M., 2012, P 19 INT TRANSP SYST

[2] Security of Vehicle Platooning: A Game-Theoretic Approach [J].

Basiri, Mohammad Hossein ;

Pirani, Mohammad ;

Azad, Nasser L. ;

Fischmeister, Sebastian .

IEEE ACCESS, 2019, 7 :185565-185579

[3]

Bergenhem C., 2010, P 17 WORLD C INT TRA, P1

[4] Rate-Diverse Multiple Access Over Gaussian Channels [J].

Chen, Pingping ;

Shi, Long ;

Fang, Yi ;

Lau, Francis C. M. ;

Cheng, Jun .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) :5399-5413

[5]

Chu TS, 2019, IEEE DECIS CONTR P, P4079

[6]

Claus C, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P746

[7]

Cools Seung-Bae., 2013, Springer Advances in applied selforganizing systems, P45

[8]

Davila A., 2010, P C PERS RAP TRANS P, VVolume 3, P2

[9]

Egea AC, 2020, IEEE SYS MAN CYBERN, P965, DOI [10.1109/smc42975.2020.9283498, 10.1109/SMC42975.2020.9283498]

[10] Spatiotemporal intersection control in a connected and automated vehicle environment [J].

Feng, Yiheng ;

Yu, Chunhui ;

Liu, Henry X. .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 89 :364-383

← 1 2 3 4 5 →