Deterministic Policy Gradient Based Formation Control for Multi-Agent Systems

被引：0

作者：

Hong, Zhiying ^{[1
]}

Wang, Qingling ^{[1
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing, Peoples R China

来源：

2019 CHINESE AUTOMATION CONGRESS (CAC2019) | 2019年

关键词：

formation control; multi-agent reinforcement learning; deterministic policy gradient;

D O I：

10.1109/cac48633.2019.8996660

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the problem of formation control of multi-agent systems with the reinforcement learning method. A novel multi-agent formation control algorithm is first proposed, which adopts the framework of centralized training with decentralized execution, and combines the deterministic policy gradient (DPG) method with multi-agent advantage function. Then, three scenarios under partial observable Markov games are presented to study the multi-agent formation control problem and verify the proposed algorithm. Simulation results show that the proposed algorithm is effective in achieving the multi-agent formation control tasks.

引用

页码：4349 / 4354

页数：6

共 50 条

[1] Hybrid Formation Control for Multi-Robot Hunters Based on Multi-Agent Deep Deterministic Policy Gradient
Hamed O.
Hamlich M.
Mendel, 2021, 27 (02) : 23 - 29
[2] Multi-Agent Deep Deterministic Policy Gradient Method Based on Double Critics
Ding S.
Du W.
Guo L.
Zhang J.
Xu X.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (10): : 2394 - 2404
[3] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient
Zhan, Mengying
Chen, Jinchao
Du, Chenglie
Duan, Yuxin
PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52
[4] A Multi-Agent Deep Deterministic Policy Gradient Method for Multi-Zone HVAC Control
Liu, Xuebo
Wu, Yingying
Liu, Bo
Wu, Hongyu
2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
[5] Asynchronous Methods for Multi-agent Deep Deterministic Policy Gradient
Jiang, Xuesong
Li, Zhipeng
Wei, Xiumei
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 711 - 721
[6] Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation
Zhang, Xiaoping
Zheng, Yuanpeng
Wang, Li
Abdulali, Arsen
Iida, Fumiya
APPLIED SCIENCES-BASEL, 2023, 13 (21):
[7] Multi-Agent Recurrent Deterministic Policy Gradient with Inter-Agent Communication
Cho, Joohyun
Liu, Mingxi
Zhou, Yi
Chen, Rong-Rong
FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1394 - 1398
[8] Multi-Agent Deep Deterministic Policy Gradient Algorithm Based on Classification Experience Replay
Sun, Xiaoying
Chen, Jinchao
Du, Chenglie
Zhan, Mengying
2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 988 - 992
[9] Intrinsic Motivation for Deep Deterministic Policy Gradient in Multi-Agent Environments
Cao, Xiaoge
Lu, Tao
Cai, Yinghao
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1628 - 1633
[10] Optimal control for the evolution of deterministic multi-agent systems
Bivas, Mira
Quincampoix, Marc
JOURNAL OF DIFFERENTIAL EQUATIONS, 2020, 269 (03) : 2228 - 2263

← 1 2 3 4 5 →