Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

被引:4
|
作者
Ji, Lianghao [1 ,3 ]
Jian, Kai [1 ]
Zhang, Cuijuan [1 ]
Yang, Shasha [1 ]
Guo, Xing [1 ]
Li, Huaqing [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing, Peoples R China
[2] Southwest Univ, Coll Elect & Informat Engn, Chongqing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
complex networks; dynamic programming; intelligent control; multi-agent systems; optimal control; OPTIMAL TRACKING CONTROL; ALGORITHM; FRAMEWORK;
D O I
10.1049/cth2.12473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete-time multi-agent systems with completely unknown dynamics. Different from the classical RL-based optimal control algorithms based on one-step temporal difference method, a multi-step-based (also call n-step) policy gradient ADP (MS-PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q-function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor-critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:1443 / 1457
页数:15
相关论文
共 50 条
  • [41] Optimal control for multi-agent persistent monitoring
    Song, Cheng
    Liu, Lu
    Feng, Gang
    Xu, Shengyuan
    AUTOMATICA, 2014, 50 (06) : 1663 - 1668
  • [42] Event-triggered Multi-agent Optimal Regulation Using Adaptive Dynamic Programming
    Zhong, Xiangnan
    He, Haibo
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [43] Optimal iterative learning control design for multi-agent systems consensus tracking
    Yang, Shiping
    Xu, Jian-Xin
    Huang, Deqing
    Tan, Ying
    SYSTEMS & CONTROL LETTERS, 2014, 69 : 80 - 89
  • [44] OPTIMAL CONSENSUS CONTROL OF DISCRETE-TIME STOCHASTIC MULTI-AGENT SYSTEMS
    Zong, Xiaofeng
    Wang, Mingyu
    Zhang, Ji-Feng
    NUMERICAL ALGEBRA CONTROL AND OPTIMIZATION, 2025, 15 (01): : 155 - 172
  • [45] Finite-time adaptive consensus of a class of multi-agent systems
    Liu KeXin
    Wu LuLu
    Lu JinHu
    Zhu HengHui
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2016, 59 (01) : 22 - 32
  • [46] Resilient adaptive optimal control of distributed multi-agent systems using reinforcement learning
    Moghadam, Rohollah
    Modares, Hamidreza
    IET CONTROL THEORY AND APPLICATIONS, 2018, 12 (16) : 2165 - 2174
  • [47] Adaptive Consensus of Multi-Agent Systems with Unknown Control Coefficients and Input Saturation
    Fan, Ming-Can
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 1505 - 1510
  • [48] Decentralized Adaptive Optimal Tracking Control for Massive Multi-Agent Systems with Input Constraint
    Zhou, Zejian
    Xu, Hao
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1 - 8
  • [49] Model-Free Optimal Consensus Control for Multi-agent Systems Based on DHP Algorithm
    Shi, Haoen
    Feng, Yanghe
    Mu, Chaoxu
    Wu, Yunkai
    NEURAL PROCESSING LETTERS, 2022, 54 (01) : 501 - 521
  • [50] Distributed consensus for multi-agent systems via adaptive sliding mode control
    Yu, Zhiyong
    Yu, Shuzhen
    Jiang, Haijun
    Hu, Cheng
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (15) : 7125 - 7151