Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning

被引：0

作者：

An, Tianxu ^{[1
]}

Lee, Joonho ^{[2
]}

Bjelonic, Marko ^{[3
]}

De Vincenti, Flavio ^{[4
]}

Hutter, Marco ^{[1
]}

机构：

[1] Robot Syst Lab, CH-8092 Zurich, Switzerland

[2] Neuromeka Co Ltd, Seoul 04782, South Korea

[3] Swiss Mile Robot AG, CH-8092 Zurich, Switzerland

[4] Swiss Fed Inst Technol, Computat Robot Lab, CH-8092 Zurich, Switzerland

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 02期

基金：

欧洲研究理事会; 瑞士国家科学基金会;

关键词：

Robots; Navigation; Training; Neural networks; Collision avoidance; Mobile robots; Reinforcement learning; Quadrupedal robots; Vectors; Scalability; Legged locomotion; multi-robot systems; reinforcement learning;

D O I：

10.1109/LRA.2024.3521183

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Coordinated navigation of an arbitrary number of robots to an arbitrary number of goals is a big challenge in robotics, often hindered by scalability limitations of existing strategies. This letter introduces a decentralized multi-agent control system using neural network policies trained in simulation. By leveraging permutation invariant neural network architectures and model-free reinforcement learning, our policy enables robots to prioritize varying numbers of collaborating robots and goals in a zero-shot manner without being biased by ordering or limited by a fixed capacity. We validate the task performance and scalability of our policies through experiments in both simulation and real-world settings. Our approach achieves a 10.3% higher success rate in collaborative navigation tasks compared to a policy without a permutation invariant encoder. Additionally, it finds near-optimal solutions for multi-robot navigation problems while being two orders of magnitude faster than an optimization-based centralized controller. We deploy our multi-goal navigation policies on two wheeled-legged quadrupedal robots, which successfully complete a series of multi-goal navigation missions.

引用

页码：1585 / 1592

页数：8

共 50 条

[41] Learning scalable and efficient communication policies for multi-robot collision avoidance [J].

Serra-Gomez, Alvaro ;

Zhu, Hai ;

Brito, Bruno ;

Bohmer, Wendelin ;

Alonso-Mora, Javier .

AUTONOMOUS ROBOTS, 2023, 47 (08) :1275-1297

[42] Learning scalable and efficient communication policies for multi-robot collision avoidance [J].

Álvaro Serra-Gómez ;

Hai Zhu ;

Bruno Brito ;

Wendelin Böhmer ;

Javier Alonso-Mora .

Autonomous Robots, 2023, 47 :1275-1297

[43] Stochastic Learning Automata for Self-coordination in Heterogeneous Multi-Tasks Selection in Multi-Robot Systems [J].

Quinonez, Yadira ;

Maravall, Dario ;

de Lope, Javier .

ADVANCES IN ARTIFICIAL INTELLIGENCE, PT I, 2011, 7094 :443-453

[44] Learning to Coordinate for a Worker-Station Multi-Robot System in Planar Coverage Tasks [J].

Tang, Jingtao ;

Gao, Yuan ;

Lam, Tin Lun .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :12315-12322

[45] ArenaSim: A High-Performance Simulation Platform for Multi-Robot Self-Play Learning [J].

Ke, Yuxin ;

Li, Shaohui ;

Li, Zhi ;

Li, Haoran ;

Liu, Yu ;

He, You .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (06) :5497-5504

[46] Overfitting-avoiding goal-guided exploration for hard-exploration multi-goal reinforcement learning [J].

Han, Changlin ;

Peng, Zhiyong ;

Liu, Yadong ;

Tang, Jingsheng ;

Yu, Yang ;

Zhou, Zongtan .

NEUROCOMPUTING, 2023, 525 :76-87

[47] Attention Enhanced Reinforcement Learning for Multi agent Cooperation [J].

Pu, Zhiqiang ;

Wang, Huimu ;

Liu, Zhen ;

Yi, Jianqiang ;

Wu, Shiguang .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) :8235-8249

[48] A reinforcement learning technique with an adaptive action generator for a multi-robot system [J].

Yasuda, Toshiyuki ;

Ohkura, Kazuhiro .

FROM ANIMALS TO ANIMATS 10, PROCEEDINGS, 2008, 5040 :250-259

[49] Multi-Robot Gas-Source Localization based on Reinforcement Learning [J].

Wei, Jian-Long ;

Meng, Qing-Hao ;

Yan, Ci ;

Zeng, Ming ;

Li, Wei .

2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,

[50] Effect of Virtual Work Braking on Distributed Multi-Robot Reinforcement Learning [J].

Kawano, Hiroshi .

2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, :1987-1994

← 1 2 3 4 5 →