Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

被引:4
|
作者
Ji, Lianghao [1 ,3 ]
Jian, Kai [1 ]
Zhang, Cuijuan [1 ]
Yang, Shasha [1 ]
Guo, Xing [1 ]
Li, Huaqing [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing, Peoples R China
[2] Southwest Univ, Coll Elect & Informat Engn, Chongqing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
complex networks; dynamic programming; intelligent control; multi-agent systems; optimal control; OPTIMAL TRACKING CONTROL; ALGORITHM; FRAMEWORK;
D O I
10.1049/cth2.12473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete-time multi-agent systems with completely unknown dynamics. Different from the classical RL-based optimal control algorithms based on one-step temporal difference method, a multi-step-based (also call n-step) policy gradient ADP (MS-PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q-function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor-critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:1443 / 1457
页数:15
相关论文
共 50 条
  • [31] Improved Adaptive Dynamic Event-Triggered Consensus of Multi-Agent Systems
    Shi, Xiaotian
    Yan, Huaicheng
    Xu, Chengjie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (12) : 4509 - 4513
  • [32] Adaptive odd impulsive consensus of multi-agent systems via comparison system method
    Ma, Tiedong
    Yu, Tiantian
    Huang, Jiangshuai
    Yang, Xinsong
    Gu, Zhenyu
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2020, 35
  • [33] Distributed constrained optimal consensus of multi-agent systems
    Qiu, Zhirong
    Liu, Shuai
    Xie, Lihua
    AUTOMATICA, 2016, 68 : 209 - 215
  • [34] Dynamic adaptive autonomy in multi-agent systems
    Barber, KS
    Goel, A
    Martin, CE
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2000, 12 (02) : 129 - 147
  • [35] Consensus of nonlinear multi-agent systems with adaptive protocols
    Wang, Lei
    Feng, Wei-jie
    Chen, Michael Z. Q.
    Wang, Qing-guo
    IET CONTROL THEORY AND APPLICATIONS, 2014, 8 (18) : 2245 - 2252
  • [36] Consensus for Multi-Agent Dynamic Systems: an LQR Perspective
    Zhang Dongmei
    Wang Xingang
    Meng Li
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 6261 - 6266
  • [37] Ultra-fast consensus of discrete-time multi-agent systems with multi-step predictive output feedback
    Zhang, Wenle
    Liu, Jianchang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 47 (06) : 1465 - 1479
  • [38] Optimal consensus control of linear multi-agent systems with communication time delay
    Sheng, Jie
    Ding, Zhengtao
    IET CONTROL THEORY AND APPLICATIONS, 2013, 7 (15) : 1899 - 1905
  • [39] Adaptive Consensus Control of Multi-Agent Systems With Dead-Zone Input
    Wang, Yue
    Yang, Yonghui
    Wu, Libing
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1103 - 1108
  • [40] Distributed adaptive dynamic event-based consensus control for nonlinear uncertain multi-agent systems
    Shahvali, Milad
    Naghibi-Sistani, Mohammad-Bagher
    Askari, Javad
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2022, 236 (09) : 1630 - 1648