Collision avoidance in multi-robot systems based on multi-layered reinforcement learning

被引:15
作者
Arai, Y
Fujii, T
Asama, H
Kaetsu, H
Endo, I
机构
[1] Iwate Prefectural Univ, Fac Software & Informat Sci, Morioka, Iwate 0200173, Japan
[2] RIKEN, Inst Phys & Chem Res, Wako, Saitama 3510198, Japan
关键词
collision avoidance; reinforcement learning; multi-layered learning; local communication; mobile robot;
D O I
10.1016/S0921-8890(99)00035-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is important for a robot to acquire adaptive behaviors for avoiding surrounding robots and obstacles in complicated environments. Although the introduction of a learning scheme is expected to be one of the solutions for this purpose, a large size of memory and a large calculation cost are required to handle useful information such as motions of robots. In this paper, we introduce the multi-layered reinforcement learning method. By dividing a learning curriculum into multiple layers, the number of expected situations can be reduced. It is shown that real robots can adaptively avoid collision with each other and to obstacles in a complicated situation. (C) 1999 Elsevier Science B.V. All right reserved.
引用
收藏
页码:21 / 32
页数:12
相关论文
共 50 条
  • [31] A Survey of Reinforcement Learning Research and Its Application for Multi-Robot Systems
    Yang Yuequan
    Jin Lu
    Cao Zhiqiang
    Tang Hongru
    Xia Yang
    Ni Chunbo
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3068 - 3074
  • [32] Multi-Agent Reinforcement Learning based on K-Means Clustering in Multi-Robot Cooperative Systems
    Liu Chang-an
    Liu Fei
    Liu Chun-yang
    Wu Hua
    OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS, PTS 1-2, 2011, 216 : 75 - 80
  • [33] Parameter Learning Based Multi-robot Collaboration Anti-collision
    Liao, WeiQiang
    PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE OF MODELLING AND SIMULATION, VOL V: MODELLING AND SIMULATION IN MECHANICS AND MANUFACTURE, 2008, : 296 - 301
  • [34] Prioritized planning algorithm for multi-robot collision avoidance based on artificial untraversable vertex
    Haodong Li
    Tao Zhao
    Songyi Dian
    Applied Intelligence, 2022, 52 : 429 - 451
  • [35] Learning-Based Multi-Robot Formation Control With Obstacle Avoidance
    Bai, Chengchao
    Yan, Peng
    Pan, Wei
    Guo, Jifeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11811 - 11822
  • [36] Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning
    An, Tianxu
    Lee, Joonho
    Bjelonic, Marko
    De Vincenti, Flavio
    Hutter, Marco
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1585 - 1592
  • [37] Cloud based Real-time Multi-robot Collision Avoidance for Swarm Robotics
    He, Hengjing
    Kamburugamuve, Supun
    Fox, Geoffrey C.
    Zhao, Wei
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (06): : 339 - 357
  • [38] Multi-Robot Gas-Source Localization based on Reinforcement Learning
    Wei, Jian-Long
    Meng, Qing-Hao
    Yan, Ci
    Zeng, Ming
    Li, Wei
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [39] Integrating collision avoidance strategies into multi-robot task allocation for inspection
    Chakraa, Hamza
    Leclercq, Edouard
    Guerin, Francois
    Lefebvre, Dimitri
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2025, 47 (07)
  • [40] Applying Reinforcement Learning to Multi-robot Team Coordination
    Sanz, Yolanda
    de Lope, Javier
    Antonio Martin H, Jose
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2008, 5271 : 625 - +