Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

被引：4

作者：

Ji, Lianghao ^{[1
,3
]}

Jian, Kai ^{[1
]}

Zhang, Cuijuan ^{[1
]}

Yang, Shasha ^{[1
]}

Guo, Xing ^{[1
]}

Li, Huaqing ^{[2
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing, Peoples R China

[2] Southwest Univ, Coll Elect & Informat Engn, Chongqing, Peoples R China

[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China

来源：

IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 11期

基金：

中国国家自然科学基金;

关键词：

complex networks; dynamic programming; intelligent control; multi-agent systems; optimal control; OPTIMAL TRACKING CONTROL; ALGORITHM; FRAMEWORK;

D O I：

10.1049/cth2.12473

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete-time multi-agent systems with completely unknown dynamics. Different from the classical RL-based optimal control algorithms based on one-step temporal difference method, a multi-step-based (also call n-step) policy gradient ADP (MS-PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q-function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor-critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.

引用

页码：1443 / 1457

页数：15

共 50 条

[1] Dynamic Event-Triggered Consensus Control for Multi-Agent Systems Using Adaptive Dynamic Programming
Zhang, Qi
Yang, Yang
Xie, Xiaoran
Xu, Chunming
Yang, Han
IEEE ACCESS, 2022, 10 : 110285 - 110293
[2] Optimal Consensus Control of Unknown Nonlinear Multi-Agent Systems Using Adaptive Dynamic Programming via MRAC
Fu, Hao
Chen, Xin
Wang, Wei
Li, Jinbin
Zhang, Yaodong
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 7141 - 7146
[3] Optimal Energy Consensus Control for Linear Multi-Agent Systems
Zhang, Han
Hu, Xiaoming
PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2663 - 2668
[4] Optimal consensus control of multi-agent systems with regular/irregular optimal control method
Zhang H.-S.
Xu J.-J.
Kongzhi yu Juece/Control and Decision, 2023, 38 (08): : 2203 - 2210
[5] Secure Consensus Control for Constrained Multi-Agent Systems Against Intermittent Denial-of-Service Attacks: An Adaptive Dynamic Programming Method
Gao, Zhen
Zhao, Ning
Zong, Guangdeng
Zhao, Xudong
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (04) : 705 - 716
[6] Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems
Luo, Biao
Liu, Derong
Huang, Tingwen
Yang, Xiong
Ma, Hongwen
INFORMATION SCIENCES, 2017, 411 : 66 - 83
[7] A Survey on Optimal Consensus of Multi-agent Systems
Sun, Hui
Liu, Yungang
Li, Fengzhong
Niu, Xinglong
2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4978 - 4983
[8] Dynamic pinning consensus control of multi-agent systems
Sakaguchi A.
Ushio T.
IEEE Control Systems Letters, 2017, 1 (02): : 340 - 345
[9] Adaptive consensus of multi-agent systems via odd impulsive control
Ma, Tiedong
Zhang, Zhengle
Cui, Bing
NEUROCOMPUTING, 2018, 321 : 139 - 145
[10] Distributed optimal coordination control for nonlinear multi-agent systems using event-triggered adaptive dynamic programming method
Zhao, Wei
Zhang, Huaipin
ISA TRANSACTIONS, 2019, 91 : 184 - 195

← 1 2 3 4 5 →