Deterministic Policy Gradient Based Formation Control for Multi-Agent Systems

被引:0
|
作者
Hong, Zhiying [1 ]
Wang, Qingling [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing, Peoples R China
来源
2019 CHINESE AUTOMATION CONGRESS (CAC2019) | 2019年
关键词
formation control; multi-agent reinforcement learning; deterministic policy gradient;
D O I
10.1109/cac48633.2019.8996660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the problem of formation control of multi-agent systems with the reinforcement learning method. A novel multi-agent formation control algorithm is first proposed, which adopts the framework of centralized training with decentralized execution, and combines the deterministic policy gradient (DPG) method with multi-agent advantage function. Then, three scenarios under partial observable Markov games are presented to study the multi-agent formation control problem and verify the proposed algorithm. Simulation results show that the proposed algorithm is effective in achieving the multi-agent formation control tasks.
引用
收藏
页码:4349 / 4354
页数:6
相关论文
共 50 条
  • [1] Hybrid Formation Control for Multi-Robot Hunters Based on Multi-Agent Deep Deterministic Policy Gradient
    Hamed O.
    Hamlich M.
    Mendel, 2021, 27 (02) : 23 - 29
  • [2] Multi-Agent Deep Deterministic Policy Gradient Method Based on Double Critics
    Ding S.
    Du W.
    Guo L.
    Zhang J.
    Xu X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (10): : 2394 - 2404
  • [3] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient
    Zhan, Mengying
    Chen, Jinchao
    Du, Chenglie
    Duan, Yuxin
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52
  • [4] A Multi-Agent Deep Deterministic Policy Gradient Method for Multi-Zone HVAC Control
    Liu, Xuebo
    Wu, Yingying
    Liu, Bo
    Wu, Hongyu
    2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [5] Asynchronous Methods for Multi-agent Deep Deterministic Policy Gradient
    Jiang, Xuesong
    Li, Zhipeng
    Wei, Xiumei
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 711 - 721
  • [6] Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation
    Zhang, Xiaoping
    Zheng, Yuanpeng
    Wang, Li
    Abdulali, Arsen
    Iida, Fumiya
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [7] Multi-Agent Recurrent Deterministic Policy Gradient with Inter-Agent Communication
    Cho, Joohyun
    Liu, Mingxi
    Zhou, Yi
    Chen, Rong-Rong
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1394 - 1398
  • [8] Multi-Agent Deep Deterministic Policy Gradient Algorithm Based on Classification Experience Replay
    Sun, Xiaoying
    Chen, Jinchao
    Du, Chenglie
    Zhan, Mengying
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 988 - 992
  • [9] Intrinsic Motivation for Deep Deterministic Policy Gradient in Multi-Agent Environments
    Cao, Xiaoge
    Lu, Tao
    Cai, Yinghao
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1628 - 1633
  • [10] Optimal control for the evolution of deterministic multi-agent systems
    Bivas, Mira
    Quincampoix, Marc
    JOURNAL OF DIFFERENTIAL EQUATIONS, 2020, 269 (03) : 2228 - 2263