Data-Based Optimal Synchronization Control for Discrete-Time Nonlinear Heterogeneous Multiagent Systems

被引：16

作者：

Fu, Hao ^{[1
,2
]}

Chen, Xin ^{[1
,2
]}

Wang, Wei ^{[3
]}

Wu, Min ^{[1
,2
]}

机构：

[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China

[2] China Univ Geosci, Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China

[3] WISDRI Engn & Res Inc Ltd, Res & Dev Inst, Wuhan 430223, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Synchronization; Nickel; Performance analysis; Mathematical model; Adaptation models; Decentralized control; Multi-agent systems; Approximate dynamic programming (ADP); discrete time; model reference adaptive control (MRAC); multiagent systems (MASs); optimal synchronization; policy iteration; APPROXIMATE OPTIMAL-CONTROL; OPTIMAL CONSENSUS CONTROL; OUTPUT SYNCHRONIZATION; ADAPTIVE-CONTROL; GRAPHICAL GAMES;

D O I：

10.1109/TCYB.2020.3004494

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article investigates the optimal synchronization problem for unknown discrete-time nonlinear heterogeneous multiagent systems (MASs). It is very intractable to derive the analytical solutions of coupled Bellman's equations, which are necessary to overcome this problem. We propose a data-based optimal synchronization control strategy based on a hierarchical and distributed optimal control framework composed of a model reference adaptive control (MRAC) layer and a distributed control layer. In the MRAC layer, the similar-offline MRAC algorithm is developed to make subsystems of MASs track their reference models, respectively. Then, the distributed optimal control problem of nonlinear heterogeneous MASs is transformed into that of homogeneous MASs composed of the reference models and the leader. In the distributed control layer, the distributed reference policy iteration algorithm is proposed to derive the solutions of coupled composite nonlinear Bellman's equations, which ensure that the homogeneous MASs reach synchronization with optimum. The suboptimal synchronization control is achieved via optimization further. Convergence analysis of both algorithms is rigorously provided. The simulation results verify the effectiveness of the proposed strategy.

引用

页码：2477 / 2490

页数：14

共 48 条

[1] Abouheaf M, 2013, P AMER CONTR CONF, P4189
[2] Multi-agent discrete-time graphical games and reinforcement learning solutions
Abouheaf, Mohammed I.
Lewis, Frank L.
Vamvoudakis, Kyriakos G.
Haesaert, Sofie
Babuska, Robert
[J]. AUTOMATICA, 2014, 50 (12) : 3038 - 3053
[3] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
Al-Tamimi, Asma
Lewis, Frank L.
Abu-Khalaf, Murad
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
[4] [Anonymous], 1986, ROBUST ADAPTIVE CONT
[5] Gaussian-kernel-based adaptive critic design using two-phase value iteration
Chen, Xin
Wang, Wei
Cao, Weihua
Wu, Min
[J]. INFORMATION SCIENCES, 2019, 482 : 139 - 155
[6] A Minimal Control Multiagent for Collision Avoidance and Velocity Alignment
Chen, Zhiyong
Zhang, Hai-Tao
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (08) : 2185 - 2192
[7] Adaptive Neural Network-Based Finite-Time Online Optimal Tracking Control of the Nonlinear System With Dead Zone
Ding, Liang
Li, Shu
Gao, Haibo
Liu, Yan-Jun
Huang, Lan
Deng, Zongquan
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 382 - 392
[8] CONTROLLABILITY THEORY FOR NONLINEAR SYSTEMS
GERSHWIN, SB
JACOBSON, DH
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1971, AC16 (01) : 37 - &
[9] Distributed Model Predictive Control for Smart Energy Systems
Halvgaard, Rasmus
Vandenberghe, Lieven
Poulsen, Niels Kjolstad
Madsen, Henrik
Jorgensen, John Bagterp
[J]. IEEE TRANSACTIONS ON SMART GRID, 2016, 7 (03) : 1675 - 1682
[10] Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
He, Pingan
Jagannathan, S.
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 425 - 436

← 1 2 3 4 5 →