Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning

被引:0
作者
An, Tianxu [1 ]
Lee, Joonho [2 ]
Bjelonic, Marko [3 ]
De Vincenti, Flavio [4 ]
Hutter, Marco [1 ]
机构
[1] Robot Syst Lab, CH-8092 Zurich, Switzerland
[2] Neuromeka Co Ltd, Seoul 04782, South Korea
[3] Swiss Mile Robot AG, CH-8092 Zurich, Switzerland
[4] Swiss Fed Inst Technol, Computat Robot Lab, CH-8092 Zurich, Switzerland
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 02期
基金
瑞士国家科学基金会; 欧洲研究理事会;
关键词
Robots; Navigation; Training; Neural networks; Collision avoidance; Mobile robots; Reinforcement learning; Quadrupedal robots; Vectors; Scalability; Legged locomotion; multi-robot systems; reinforcement learning;
D O I
10.1109/LRA.2024.3521183
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Coordinated navigation of an arbitrary number of robots to an arbitrary number of goals is a big challenge in robotics, often hindered by scalability limitations of existing strategies. This letter introduces a decentralized multi-agent control system using neural network policies trained in simulation. By leveraging permutation invariant neural network architectures and model-free reinforcement learning, our policy enables robots to prioritize varying numbers of collaborating robots and goals in a zero-shot manner without being biased by ordering or limited by a fixed capacity. We validate the task performance and scalability of our policies through experiments in both simulation and real-world settings. Our approach achieves a 10.3% higher success rate in collaborative navigation tasks compared to a policy without a permutation invariant encoder. Additionally, it finds near-optimal solutions for multi-robot navigation problems while being two orders of magnitude faster than an optimization-based centralized controller. We deploy our multi-goal navigation policies on two wheeled-legged quadrupedal robots, which successfully complete a series of multi-goal navigation missions.
引用
收藏
页码:1585 / 1592
页数:8
相关论文
共 50 条
  • [31] Hierarchical reinforcement learning for handling sparse rewards in multi-goal navigation
    Yan, Jiangyue
    Luo, Biao
    Xu, Xiaodong
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (06)
  • [32] Distributed safe reinforcement learning for multi-robot motion planning
    Lu, Yang
    Guo, Yaohua
    Zhao, Guoxiang
    Zhu, Minghui
    [J]. 2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 1209 - 1214
  • [33] Distributed scalable multi-robot learning using particle swarm optimization
    Pugh J.
    Martinoli A.
    [J]. Swarm Intelligence, 2009, 3 (03) : 203 - 222
  • [34] Multi-Robot Flocking Control Based on Deep Reinforcement Learning
    Zhu, Pengming
    Dai, Wei
    Yao, Weijia
    Ma, Junchong
    Zeng, Zhiwen
    Lu, Huimin
    [J]. IEEE ACCESS, 2020, 8 : 150397 - 150406
  • [35] Improving the robustness of reinforcement learning for a multi-robot system environment
    Yasuda, T
    Ohkura, K
    [J]. SOFT COMPUTING AS TRANSDISCIPLINARY SCIENCE AND TECHNOLOGY, 2005, : 263 - 272
  • [36] COOPERATIVE MULTI-ROBOT SYSTEM FOR INFRASTRUCTURE SECURITY TASKS
    Hernandez, Erik
    Barrientos, Antonio
    Rossi, Claudio
    del Cerro, Jaime
    [J]. ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL. 2, 2012, : 313 - 316
  • [37] A Fuzzy Control Strategy for Multi-Goal Autonomous Robot Navigation
    Stavrinidis, Stavros
    Zacharia, Paraskevi
    Xidias, Elias
    [J]. SENSORS, 2025, 25 (02)
  • [38] Preservation and Application of Acquired Knowledge Using Instance-Based Reinforcement Learning for Multi-Robot Systems
    Sakanoue, Junki
    Yasuda, Toshiyuki
    Ohkura, Kazuhiro
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (08) : 1109 - 1115
  • [39] Learning scalable and efficient communication policies for multi-robot collision avoidance
    Serra-Gomez, Alvaro
    Zhu, Hai
    Brito, Bruno
    Bohmer, Wendelin
    Alonso-Mora, Javier
    [J]. AUTONOMOUS ROBOTS, 2023, 47 (08) : 1275 - 1297
  • [40] Learning scalable and efficient communication policies for multi-robot collision avoidance
    Álvaro Serra-Gómez
    Hai Zhu
    Bruno Brito
    Wendelin Böhmer
    Javier Alonso-Mora
    [J]. Autonomous Robots, 2023, 47 : 1275 - 1297