Transferring Multi-Agent Reinforcement Learning Policies for Autonomous Driving using Sim-to-Real

被引：21

作者：

Candela, Eduardo ^{[1
]}

Parada, Leandro ^{[1
]}

Marques, Luis ^{[1
]}

Georgescu, Tiberiu-Andrei ^{[1
]}

Demiris, Yiannis ^{[2
]}

Angeloudis, Panagiotis ^{[1
]}

机构：

[1] Imperial Coll London, Dept Civil & Environm Engn, Ctr Transport Studies, London, England

[2] Imperial Coll London, Dept Elect & Elect Engn, Personal Robot Lab, London, England

来源：

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年

关键词：

D O I：

10.1109/IROS47612.2022.9981319

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Autonomous Driving requires high levels of coordination and collaboration between agents. Achieving effective coordination in multi-agent systems is a difficult task that remains largely unresolved. Multi-Agent Reinforcement Learning has arisen as a powerful method to accomplish this task because it considers the interaction between agents and also allows for decentralized training-which makes it highly scalable. However, transferring policies from simulation to the real world is a big challenge, even for single-agent applications. Multi-agent systems add additional complexities to the Sim-to-Real gap due to agent collaboration and environment synchronization. In this paper, we propose a method to transfer multi-agent autonomous driving policies to the real world. For this, we create a multi-agent environment that imitates the dynamics of the Duckietown multi-robot testbed, and train multi-agent policies using the MAPPO algorithm with different levels of domain randomization. We then transfer the trained policies to the Duckietown testbed and show that when using our method, domain randomization can reduce the reality gap by 90%. Moreover, we show that different levels of parameter randomization have a substantial impact on the Sim-to-Real gap. Finally, our approach achieves significantly better results than a rule-based benchmark.

引用

页码：8814 / 8820

页数：7

共 30 条

[1]

[Anonymous], 2017, ARXIV170806374

[2]

[Anonymous], 2020, 2020 INT JOINT C NEU, DOI DOI 10.1145/3399715.3400872

[3]

Balaji Bharathan, 2019, ARXIV191101562

[4]

Bassani H. F., 2020, ARXIV200311102CSSTAT

[5] Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles [J].

Chen, Sikai ;

Dong, Jiqian ;

Ha, Paul ;

Li, Yujie ;

Labi, Samuel .

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (07) :838-857

[6]

Chevalier-Boisvert M., 2018, Duckietown Environments for OpenAI Gym

[7]

Dosovitskiy A, 2017, PR MACH LEARN RES, V78

[8]

Duckietown, 2020, DUCKIEBOT MOOC FOUND

[9]

Ercan S., 2019, THESIS

[10]

Eysenbach B., 2017, ARXIV171106782

← 1 2 3 →