Deterministic Policy Gradient Based Formation Control for Multi-Agent Systems

被引:0
|
作者
Hong, Zhiying [1 ]
Wang, Qingling [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing, Peoples R China
来源
2019 CHINESE AUTOMATION CONGRESS (CAC2019) | 2019年
关键词
formation control; multi-agent reinforcement learning; deterministic policy gradient;
D O I
10.1109/cac48633.2019.8996660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the problem of formation control of multi-agent systems with the reinforcement learning method. A novel multi-agent formation control algorithm is first proposed, which adopts the framework of centralized training with decentralized execution, and combines the deterministic policy gradient (DPG) method with multi-agent advantage function. Then, three scenarios under partial observable Markov games are presented to study the multi-agent formation control problem and verify the proposed algorithm. Simulation results show that the proposed algorithm is effective in achieving the multi-agent formation control tasks.
引用
收藏
页码:4349 / 4354
页数:6
相关论文
共 50 条
  • [21] Improved Multi-Agent Deep Deterministic Policy Gradient for Path Planning-Based Crowd Simulation
    Zheng, Shangfei
    Liu, Hong
    IEEE ACCESS, 2019, 7 : 147755 - 147770
  • [22] Power Allocation Based on Multi-Agent Deep Deterministic Policy Gradient for Underwater Acoustic Communication Networks
    Geng, Xuan
    Hui, Xinyu
    ELECTRONICS, 2024, 13 (02)
  • [23] Linear formation control of multi-agent systems
    Zhang, Xiaozhen
    Yang, Qingkai
    Xiao, Fan
    Fang, Hao
    Chen, Jie
    AUTOMATICA, 2025, 171
  • [24] Consensus for formation control of multi-agent systems
    Dong, Runsha
    Geng, Zhiyong
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (14) : 2481 - 2501
  • [25] The Formation Control of Multi-agent Systems on a Circle
    Wang, Qiang
    Wang, Yuzhen
    Zhang, Huaxiang
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (01) : 148 - 154
  • [26] Formation Control for Thermal Multi-agent Systems
    Lopez-Gonzalez, Hector
    Hernandez-Martinez, Eduardo G.
    Portillo-Velez, Rogelio de J.
    Ferreira-Vazquez, Enrique D.
    Flores-Godoy, Jose J.
    Fernandez-Anaya, Guillermo
    2021 IEEE URUCON, 2021, : 390 - 394
  • [27] Cooperative Multi-agent Policy Gradient
    Bono, Guillaume
    Dibangoye, Jilles Steeve
    Matignon, Laetitia
    Pereyron, Florian
    Simonin, Olivier
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I, 2019, 11051 : 459 - 476
  • [28] UAVs rounding up inspired by communication multi-agent depth deterministic policy gradient
    Jiang, Longting
    Wei, Ruixuan
    Wang, Dong
    APPLIED INTELLIGENCE, 2023, 53 (10) : 11474 - 11489
  • [29] Reducing overestimation with attentional multi-agent twin delayed deep deterministic policy gradient
    Cao, Yizhi
    Tian, Zijian
    Liu, Zhaoran
    Jia, Naizheng
    Liu, Xinggao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 146
  • [30] Decentralized Formation Control of Multi-agent Robot Systems based on Formation Graphs
    Hernandez-Martinez, Eduardo G.
    Aranda-Bricaire, Eduardo
    STUDIES IN INFORMATICS AND CONTROL, 2012, 21 (01): : 7 - 16