Reinforcement learning for multi-agent formation navigation with scalability

被引:3
作者
Gong, Yalei [1 ]
Xiong, Hongyun [1 ]
Li, Mengmeng [1 ]
Wang, Haibo [1 ]
Nian, Xiaohong [1 ]
机构
[1] Cent South Univ, Clustered Unmanned Syst Res Inst, Sch Automat, Changsha 410073, Hunan, Peoples R China
关键词
Deep reinforcement learning; Multi-agent formations; Collision avoidance; Scalability; AVOIDANCE;
D O I
10.1007/s10489-023-05007-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the multi-agent formation obstacle avoidance (MAFOA) problem using multi-agent deep reinforcement learning (MADRL). MAFOA control aims to achieve and maintain a desired formation while avoiding collisions among agents or with obstacles. It is a research hotspot in multi-agent cooperation due to its wide applications and challenges. However, current MADRL methods face two major difficulties in solving this problem: 1) the high complexity and uncertainty of the environment when there are many agents; 2) the lack of scalability when the number of agents varies. To overcome these difficulties, we propose: 1) A local multi-agent deep deterministic policy gradient algorithm that allows each agent to learn from its local neighbors' strategies during training and act independently during execution; 2) A reinforcement learning framework based on local information that uses partial observation as input and adapts to different numbers of agents; 3) A hybrid control method that switches between reinforcement learning and PID control to ensure formation stability. We evaluate our method on the multiagent particle environment environment and compare it with other algorithms to demonstrate its feasibility and superiority for solving the MAFOA problem.
引用
收藏
页码:28207 / 28225
页数:19
相关论文
共 32 条
  • [1] Multi-robot exploration in task allocation problem
    Alitappeh, Reza Javanmard
    Jeddisaravi, Kossar
    [J]. APPLIED INTELLIGENCE, 2022, 52 (02) : 2189 - 2211
  • [2] Learning-Based Multi-Robot Formation Control With Obstacle Avoidance
    Bai, Chengchao
    Yan, Peng
    Pan, Wei
    Guo, Jifeng
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11811 - 11822
  • [3] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
    Chen, Haoqiang
    Liu, Yadong
    Zhou, Zongtan
    Hu, Dewen
    Zhang, Ming
    [J]. APPLIED INTELLIGENCE, 2020, 50 (12) : 4195 - 4205
  • [4] Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios
    Fan, Tingxiang
    Long, Pinxin
    Liu, Wenxi
    Pan, Jia
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (07) : 856 - 892
  • [5] Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards
    Han, Ruihua
    Chen, Shengduo
    Wang, Shuaijun
    Zhang, Zeqing
    Gao, Rui
    Hao, Qi
    Pan, Jia
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 5896 - 5903
  • [6] A novel hybrid particle swarm optimization for multi-UAV cooperate path planning
    He, Wenjian
    Qi, Xiaogang
    Liu, Lifang
    [J]. APPLIED INTELLIGENCE, 2021, 51 (10) : 7350 - 7364
  • [7] Decentralized Multi-Agent Path Finding for UAV Traffic Management
    Ho, Florence
    Geraldes, Ruben
    Goncalves, Artur
    Rigault, Bastien
    Sportich, Benjamin
    Kubo, Daisuke
    Cavazza, Marc
    Prendinger, Helmut
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 997 - 1008
  • [8] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [9] Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning
    Hu, Junyan
    Niu, Hanlin
    Carrasco, Joaquin
    Lennox, Barry
    Arvin, Farshad
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 14413 - 14423
  • [10] Iqbal S, 2019, PR MACH LEARN RES, V97