Cooperative Optimal Output Regulation for Multi-agent Systems Based on Distributed Adaptive Internal Model

被引:0
|
作者
Dong, Yu-Chen [1 ]
Gao, Wei-Nan [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang
[2] Department of Electrical and Computer Engineering, Tandon School of Engineering, New York University, New York, 11201, NY
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2025年 / 51卷 / 03期
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming; cooperative output regulation; distributed adaptive internal model; multi-agent systems; reinforcement learning;
D O I
10.16383/j.aas.c240371
中图分类号
学科分类号
摘要
In this paper, a distributed data-driven adaptive control strategy is proposed for the problem of cooperative optimal output regulation of discrete-time multi-agent systems, in the absence of precise information of multiagent system matrices. Based on adaptive dynamic programming and distributed adaptive internal model, two reinforcement learning algorithms, value iteration and policy iteration, are introduced to learn the optimal controller by using online data, so as to achieve the cooperative output regulation of multi-agent systems. Considering that the followers can only access the estimated value of the leader for online learning, in order to prove that the learned control gain converges to the optimal control gain, this paper provides a rigorous analysis of the stability of the closed-loop system and the convergence of the learning algorithm. The simulation results verify the effectiveness of the proposed control method. © 2025 Science Press. All rights reserved.
引用
收藏
页码:678 / 691
页数:13
相关论文
共 35 条
  • [21] Wang B J, Xu L, Yi X L, Jia Y, Yang T., Semiglobal suboptimal output regulation for heterogeneous multi-agent systems with input saturation via adaptive dynamic programming, IEEE Transactions on Neural Networks and Learning Systems, 35, 3, pp. 3242-3250, (2022)
  • [22] Liu W J, Sun J, Wang G, Bullo F, Chen J., Data-driven self-triggered control via trajectory prediction, IEEE Transactions on Automatic Control, 68, 11, pp. 6951-6958, (2023)
  • [23] Liu S L, Niu B, Zong G D, Zhao X D, Xu N., Data-driven-based event-triggered optimal control of unknown nonlinear systems with input constraints, Nonlinear Dynamics, 109, 2, pp. 891-909, (2022)
  • [24] Jiang Y, Fan J L, Gao W, Chai T Y, Lewis F L., Cooperative adaptive optimal output regulation of nonlinear discrete-time multi-agent systems, Automatica, 121, (2020)
  • [25] Gao W, Jiang Z P, Lewis F L, Wang Y B., Leader-to-formation stability of multiagent systems: An adaptive optimal control approach, IEEE Transactions on Automatic Control, 63, 10, pp. 3581-3587, (2018)
  • [26] Cai H, Lewis F L, Hu G Q, Huang J., The adaptive distributed observer approach to the cooperative output regulation of linear multi-agent systems, Automatica, 75, pp. 299-305, (2017)
  • [27] Gao W, Mynuddin M, Wunsch D C, Jiang Z P., Reinforcement learning-based cooperative optimal output regulation via distributed adaptive internal model, IEEE Transactions on Neural Networks and Learning Systems, 33, 10, pp. 5229-5240, (2021)
  • [28] Gao W, Liu Y Y, Odekunle A, Yu Y J, Liu P L., Adaptive dynamic programming and cooperative output regulation of discrete-time multi-agent systems, International Journal of Control, Automation and Systems, 16, 5, pp. 2273-2281, (2018)
  • [29] Huang J., Nonlinear output regulation: Theory and applications, Society for Industrial and Applied Mathematics, (2004)
  • [30] Huang J., The cooperative output regulation problem of discrete time linear multi-agent systems by the adaptive distributed observer, IEEE Transactions on Automatic Control, 62, 4, (2016)