Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning

被引：0

作者：

An, Tianxu ^{[1
]}

Lee, Joonho ^{[2
]}

Bjelonic, Marko ^{[3
]}

De Vincenti, Flavio ^{[4
]}

Hutter, Marco ^{[1
]}

机构：

[1] Robot Syst Lab, CH-8092 Zurich, Switzerland

[2] Neuromeka Co Ltd, Seoul 04782, South Korea

[3] Swiss Mile Robot AG, CH-8092 Zurich, Switzerland

[4] Swiss Fed Inst Technol, Computat Robot Lab, CH-8092 Zurich, Switzerland

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 02期

基金：

瑞士国家科学基金会; 欧洲研究理事会;

关键词：

Robots; Navigation; Training; Neural networks; Collision avoidance; Mobile robots; Reinforcement learning; Quadrupedal robots; Vectors; Scalability; Legged locomotion; multi-robot systems; reinforcement learning;

D O I：

10.1109/LRA.2024.3521183

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Coordinated navigation of an arbitrary number of robots to an arbitrary number of goals is a big challenge in robotics, often hindered by scalability limitations of existing strategies. This letter introduces a decentralized multi-agent control system using neural network policies trained in simulation. By leveraging permutation invariant neural network architectures and model-free reinforcement learning, our policy enables robots to prioritize varying numbers of collaborating robots and goals in a zero-shot manner without being biased by ordering or limited by a fixed capacity. We validate the task performance and scalability of our policies through experiments in both simulation and real-world settings. Our approach achieves a 10.3% higher success rate in collaborative navigation tasks compared to a policy without a permutation invariant encoder. Additionally, it finds near-optimal solutions for multi-robot navigation problems while being two orders of magnitude faster than an optimization-based centralized controller. We deploy our multi-goal navigation policies on two wheeled-legged quadrupedal robots, which successfully complete a series of multi-goal navigation missions.

引用

页码：1585 / 1592

页数：8

共 50 条

[31] Hierarchical reinforcement learning for handling sparse rewards in multi-goal navigation
Yan, Jiangyue
Luo, Biao
Xu, Xiaodong
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (06)
[32] Distributed safe reinforcement learning for multi-robot motion planning
Lu, Yang
Guo, Yaohua
Zhao, Guoxiang
Zhu, Minghui
[J]. 2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 1209 - 1214
[33] Distributed scalable multi-robot learning using particle swarm optimization
Pugh J.
Martinoli A.
[J]. Swarm Intelligence, 2009, 3 (03) : 203 - 222
[34] Multi-Robot Flocking Control Based on Deep Reinforcement Learning
Zhu, Pengming
Dai, Wei
Yao, Weijia
Ma, Junchong
Zeng, Zhiwen
Lu, Huimin
[J]. IEEE ACCESS, 2020, 8 : 150397 - 150406
[35] Improving the robustness of reinforcement learning for a multi-robot system environment
Yasuda, T
Ohkura, K
[J]. SOFT COMPUTING AS TRANSDISCIPLINARY SCIENCE AND TECHNOLOGY, 2005, : 263 - 272
[36] COOPERATIVE MULTI-ROBOT SYSTEM FOR INFRASTRUCTURE SECURITY TASKS
Hernandez, Erik
Barrientos, Antonio
Rossi, Claudio
del Cerro, Jaime
[J]. ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL. 2, 2012, : 313 - 316
[37] A Fuzzy Control Strategy for Multi-Goal Autonomous Robot Navigation
Stavrinidis, Stavros
Zacharia, Paraskevi
Xidias, Elias
[J]. SENSORS, 2025, 25 (02)
[38] Preservation and Application of Acquired Knowledge Using Instance-Based Reinforcement Learning for Multi-Robot Systems
Sakanoue, Junki
Yasuda, Toshiyuki
Ohkura, Kazuhiro
[J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (08) : 1109 - 1115
[39] Learning scalable and efficient communication policies for multi-robot collision avoidance
Serra-Gomez, Alvaro
Zhu, Hai
Brito, Bruno
Bohmer, Wendelin
Alonso-Mora, Javier
[J]. AUTONOMOUS ROBOTS, 2023, 47 (08) : 1275 - 1297
[40] Learning scalable and efficient communication policies for multi-robot collision avoidance
Álvaro Serra-Gómez
Hai Zhu
Bruno Brito
Wendelin Böhmer
Javier Alonso-Mora
[J]. Autonomous Robots, 2023, 47 : 1275 - 1297

← 1 2 3 4 5 →