Data-Based Optimal Synchronization Control for Discrete-Time Nonlinear Heterogeneous Multiagent Systems

被引：16

作者：

Fu, Hao ^{[1
,2
]}

Chen, Xin ^{[1
,2
]}

Wang, Wei ^{[3
]}

Wu, Min ^{[1
,2
]}

机构：

[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China

[2] China Univ Geosci, Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China

[3] WISDRI Engn & Res Inc Ltd, Res & Dev Inst, Wuhan 430223, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Synchronization; Nickel; Performance analysis; Mathematical model; Adaptation models; Decentralized control; Multi-agent systems; Approximate dynamic programming (ADP); discrete time; model reference adaptive control (MRAC); multiagent systems (MASs); optimal synchronization; policy iteration; APPROXIMATE OPTIMAL-CONTROL; OPTIMAL CONSENSUS CONTROL; OUTPUT SYNCHRONIZATION; ADAPTIVE-CONTROL; GRAPHICAL GAMES;

D O I：

10.1109/TCYB.2020.3004494

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article investigates the optimal synchronization problem for unknown discrete-time nonlinear heterogeneous multiagent systems (MASs). It is very intractable to derive the analytical solutions of coupled Bellman's equations, which are necessary to overcome this problem. We propose a data-based optimal synchronization control strategy based on a hierarchical and distributed optimal control framework composed of a model reference adaptive control (MRAC) layer and a distributed control layer. In the MRAC layer, the similar-offline MRAC algorithm is developed to make subsystems of MASs track their reference models, respectively. Then, the distributed optimal control problem of nonlinear heterogeneous MASs is transformed into that of homogeneous MASs composed of the reference models and the leader. In the distributed control layer, the distributed reference policy iteration algorithm is proposed to derive the solutions of coupled composite nonlinear Bellman's equations, which ensure that the homogeneous MASs reach synchronization with optimum. The suboptimal synchronization control is achieved via optimization further. Convergence analysis of both algorithms is rigorously provided. The simulation results verify the effectiveness of the proposed strategy.

引用

页码：2477 / 2490

页数：14

共 48 条

[31] Prioritizing Useful Experience Replay for Heuristic Dynamic Programming-Based Learning Systems
Ni, Zhen
Malla, Naresh
Zhong, Xiangnan
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (11) : 3911 - 3922
[32] Complete stability analysis of a heuristic approximate dynamic programming control design
Sokolov, Yury
Kozma, Robert
Werbos, Ludmilla D.
Werbos, Paul J.
[J]. AUTOMATICA, 2015, 59 : 9 - 18
[33] Cooperative Output Regulation With Application to Multi-Agent Consensus Under Switching Network
Su, Youfeng
Huang, Jie
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 864 - 875
[34] Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
Vamvoudakis, Kyriakos G.
Lewis, Frank L.
Hudas, Greg R.
[J]. AUTOMATICA, 2012, 48 (08) : 1598 - 1611
[35] Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System
Wang, Ding
He, Haibo
Mu, Chaoxu
Liu, Derong
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (06) : 4935 - 4944
[36] Optimal Consensus Control for Heterogeneous Nonlinear Multiagent Systems with Partially Unknown Dynamics
Wang, Tao
Fu, Hao
Li, Jinbin
Zhang, Yaodong
Zhou, Xinfeng
Chen, Xin
[J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2019, 17 (09) : 2400 - 2413
[37] Model-Free Distributed Consensus Control Based on Actor-Critic Framework for Discrete-Time Nonlinear Multiagent Systems
Wang, Wei
Chen, Xin
Fu, Hao
Wu, Min
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4123 - 4134
[38] Model-free optimal containment control of multi-agent systems based on actor-critic framework
Wang, Wei
Chen, Xin
[J]. NEUROCOMPUTING, 2018, 314 : 242 - 250
[39] Leader-Follower Output Synchronization of Linear Heterogeneous Systems With Active Leader Using Reinforcement Learning
Yang, Yongliang
Modares, Hamidreza
Wunsch, Donald C., II
Yin, Yixin
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2139 - 2153
[40] Indirect adaptive control of nonlinear dynamic systems using self recurrent wavelet neural networks via adaptive learning rates
Yoo, Sung Jin
Park, Jin Bae
Choi, Yoon Ho
[J]. INFORMATION SCIENCES, 2007, 177 (15) : 3074 - 3098

← 1 2 3 4 5 →