Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning

被引:0
作者
An, Tianxu [1 ]
Lee, Joonho [2 ]
Bjelonic, Marko [3 ]
De Vincenti, Flavio [4 ]
Hutter, Marco [1 ]
机构
[1] Robot Syst Lab, CH-8092 Zurich, Switzerland
[2] Neuromeka Co Ltd, Seoul 04782, South Korea
[3] Swiss Mile Robot AG, CH-8092 Zurich, Switzerland
[4] Swiss Fed Inst Technol, Computat Robot Lab, CH-8092 Zurich, Switzerland
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 02期
基金
瑞士国家科学基金会; 欧洲研究理事会;
关键词
Robots; Navigation; Training; Neural networks; Collision avoidance; Mobile robots; Reinforcement learning; Quadrupedal robots; Vectors; Scalability; Legged locomotion; multi-robot systems; reinforcement learning;
D O I
10.1109/LRA.2024.3521183
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Coordinated navigation of an arbitrary number of robots to an arbitrary number of goals is a big challenge in robotics, often hindered by scalability limitations of existing strategies. This letter introduces a decentralized multi-agent control system using neural network policies trained in simulation. By leveraging permutation invariant neural network architectures and model-free reinforcement learning, our policy enables robots to prioritize varying numbers of collaborating robots and goals in a zero-shot manner without being biased by ordering or limited by a fixed capacity. We validate the task performance and scalability of our policies through experiments in both simulation and real-world settings. Our approach achieves a 10.3% higher success rate in collaborative navigation tasks compared to a policy without a permutation invariant encoder. Additionally, it finds near-optimal solutions for multi-robot navigation problems while being two orders of magnitude faster than an optimization-based centralized controller. We deploy our multi-goal navigation policies on two wheeled-legged quadrupedal robots, which successfully complete a series of multi-goal navigation missions.
引用
收藏
页码:1585 / 1592
页数:8
相关论文
共 50 条
[41]   Stochastic Learning Automata for Self-coordination in Heterogeneous Multi-Tasks Selection in Multi-Robot Systems [J].
Quinonez, Yadira ;
Maravall, Dario ;
de Lope, Javier .
ADVANCES IN ARTIFICIAL INTELLIGENCE, PT I, 2011, 7094 :443-453
[42]   Learning to Coordinate for a Worker-Station Multi-Robot System in Planar Coverage Tasks [J].
Tang, Jingtao ;
Gao, Yuan ;
Lam, Tin Lun .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :12315-12322
[43]   Attention Enhanced Reinforcement Learning for Multi agent Cooperation [J].
Pu, Zhiqiang ;
Wang, Huimu ;
Liu, Zhen ;
Yi, Jianqiang ;
Wu, Shiguang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) :8235-8249
[44]   Overfitting-avoiding goal-guided exploration for hard-exploration multi-goal reinforcement learning [J].
Han, Changlin ;
Peng, Zhiyong ;
Liu, Yadong ;
Tang, Jingsheng ;
Yu, Yang ;
Zhou, Zongtan .
NEUROCOMPUTING, 2023, 525 :76-87
[45]   A reinforcement learning technique with an adaptive action generator for a multi-robot system [J].
Yasuda, Toshiyuki ;
Ohkura, Kazuhiro .
FROM ANIMALS TO ANIMATS 10, PROCEEDINGS, 2008, 5040 :250-259
[46]   Multi-Robot Gas-Source Localization based on Reinforcement Learning [J].
Wei, Jian-Long ;
Meng, Qing-Hao ;
Yan, Ci ;
Zeng, Ming ;
Li, Wei .
2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
[47]   A Survey of Reinforcement Learning Research and Its Application for Multi-Robot Systems [J].
Yang Yuequan ;
Jin Lu ;
Cao Zhiqiang ;
Tang Hongru ;
Xia Yang ;
Ni Chunbo .
PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, :3068-3074
[48]   Effect of Virtual Work Braking on Distributed Multi-Robot Reinforcement Learning [J].
Kawano, Hiroshi .
2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, :1987-1994
[49]   Multi-robot systems with agent-based reinforcement learning: evolution, opportunities and challenges [J].
Yang, Erfu ;
Gu, Dongbing .
INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2009, 6 (04) :271-286
[50]   Multi-robot concurrent learning in museum problem [J].
Liu, Zheng ;
Ang, Marcelo H., Jr. ;
Seah, Winston Khoon Guan .
DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS 6, 2007, :65-+