Optimizing Mixed Autonomy Traffic Flow with Decentralized Autonomous Vehicles and Multi-Agent Reinforcement Learning

被引：1

作者：

Vinitsky, Eugene ^{[1
]}

Lichtle, Nathan ^{[2
]}

Parvate, Kanaad ^{[3
]}

Bayen, Alexandre ^{[4
]}

机构：

[1] Univ Calif Berkeley, Mech Engn, Berkeley, CA 94720 USA

[2] Ecole Ponts ParisTech, F-77420 Champs Sur Marne, France

[3] Univ Calif Berkeley, Berkeley, CA 94720 USA

[4] UC Berkeley EECS, Inst Transportat Syst, Berkeley, CA 94720 USA

来源：

ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS | 2023年 / 7卷 / 02期

基金：

美国国家科学基金会;

关键词：

Reinforcement learning; mixed autonomy; autonomous vehicles; traffic optimization;

D O I：

10.1145/3582576

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We study the ability of autonomous vehicles to improve the throughput of a bottleneck using a fully decentralized control scheme in a mixed autonomy setting. We consider the problem of improving the throughput of a scaled model of the San Francisco-Oakland Bay Bridge: a two-stage bottleneck where four lanes reduce to two and then reduce to one. Although there is extensive work examining variants of bottleneck control in a centralized setting, there is less study of the challenging multi-agent setting where the large number of interacting AVs leads to significant optimization difficulties for reinforcement learning methods. We apply multi-agent reinforcement algorithms to this problem and demonstrate that significant improvements in bottleneck throughput, from 20% at a 5% penetration rate to 33% at a 40% penetration rate, can be achieved. We compare our results to a hand-designed feedback controller and demonstrate that our results sharply outperform the feedback controller despite extensive tuning. Additionally, we demonstrate that the RL-based controllers adopt a robust strategy that works across penetration rates whereas the feedback controllers degrade immediately upon penetration rate variation. We investigate the feasibility of both action and observation decentralization and demonstrate that effective strategies are possible using purely local sensing. Finally, we open-source our code at https://github.com/eugenevinitsky/decentralized_bottlenecks.

引用

页码：1 / 22

页数：22

共 50 条

[41] A Multi-Agent Deep Reinforcement Learning Approach for Practical Decentralized UAV Collision Avoidance
Thumiger, Nicholas
Deghat, Mohammad
IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2174 - 2179
[42] Decentralized graph-based multi-agent reinforcement learning using reward machines
Hu, Jueming
Xu, Zhe
Wang, Weichang
Qu, Guannan
Pang, Yutian
Liu, Yongming
NEUROCOMPUTING, 2024, 564
[43] Decentralized Computation Offloading with Cooperative UAVs: Multi-Agent Deep Reinforcement Learning Perspective
Hwang, Sangwon
Lee, Hoon
Park, Juseong
Lee, Inkyu
IEEE WIRELESS COMMUNICATIONS, 2022, 29 (04) : 24 - 31
[44] QMNet: Importance-Aware Message Exchange for Decentralized Multi-Agent Reinforcement Learning
Huang, Xiufeng
Zhou, Sheng
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 4739 - 4751
[45] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
Kitagawa, Shunya
Moustafa, Ahmed
Ito, Takayuki
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
[46] Cooperative Multi-agent Reinforcement Learning Models (CMRLM) for Intelligent Traffic Control
Vidhate, Deepak A.
Kulkarni, Parag
2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 325 - 331
[47] Satellite Network Traffic Scheduling Algorithm Based on Multi-Agent Reinforcement Learning
Zhang, Tingting
Zhang, Mingqi
Yang, Lintao
Dong, Tao
Yin, Jie
Liu, Zhihui
Wu, Jing
Jiang, Hao
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 761 - 768
[48] A Multi-Agent Reinforcement Learning Approach for Conflict Resolution in Dense Traffic Scenarios
Lai, Jiajian
Cai, Kaiquan
Liu, Zhaoxuan
Yang, Yang
2021 IEEE/AIAA 40TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2021,
[49] Optimal Formation of Autonomous Vehicles in Mixed Traffic Flow
Li, Keqiang
Wang, Jiawei
Zheng, Yang
IFAC PAPERSONLINE, 2020, 53 (02): : 15204 - 15210
[50] Eco-cooperative adaptive cruise control for platoons in mixed traffic using single-agent and multi-agent reinforcement learning
Yang, Zhiwei
Zheng, Zuduo
Kim, Jiwon
Rakha, Hesham
TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2025, 142

← 1 2 3 4 5 →