Data-Based Optimal Synchronization Control for Discrete-Time Nonlinear Heterogeneous Multiagent Systems

被引:16
作者
Fu, Hao [1 ,2 ]
Chen, Xin [1 ,2 ]
Wang, Wei [3 ]
Wu, Min [1 ,2 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] WISDRI Engn & Res Inc Ltd, Res & Dev Inst, Wuhan 430223, Peoples R China
基金
中国国家自然科学基金;
关键词
Synchronization; Nickel; Performance analysis; Mathematical model; Adaptation models; Decentralized control; Multi-agent systems; Approximate dynamic programming (ADP); discrete time; model reference adaptive control (MRAC); multiagent systems (MASs); optimal synchronization; policy iteration; APPROXIMATE OPTIMAL-CONTROL; OPTIMAL CONSENSUS CONTROL; OUTPUT SYNCHRONIZATION; ADAPTIVE-CONTROL; GRAPHICAL GAMES;
D O I
10.1109/TCYB.2020.3004494
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the optimal synchronization problem for unknown discrete-time nonlinear heterogeneous multiagent systems (MASs). It is very intractable to derive the analytical solutions of coupled Bellman's equations, which are necessary to overcome this problem. We propose a data-based optimal synchronization control strategy based on a hierarchical and distributed optimal control framework composed of a model reference adaptive control (MRAC) layer and a distributed control layer. In the MRAC layer, the similar-offline MRAC algorithm is developed to make subsystems of MASs track their reference models, respectively. Then, the distributed optimal control problem of nonlinear heterogeneous MASs is transformed into that of homogeneous MASs composed of the reference models and the leader. In the distributed control layer, the distributed reference policy iteration algorithm is proposed to derive the solutions of coupled composite nonlinear Bellman's equations, which ensure that the homogeneous MASs reach synchronization with optimum. The suboptimal synchronization control is achieved via optimization further. Convergence analysis of both algorithms is rigorously provided. The simulation results verify the effectiveness of the proposed strategy.
引用
收藏
页码:2477 / 2490
页数:14
相关论文
共 48 条
  • [1] Abouheaf M, 2013, P AMER CONTR CONF, P4189
  • [2] Multi-agent discrete-time graphical games and reinforcement learning solutions
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Vamvoudakis, Kyriakos G.
    Haesaert, Sofie
    Babuska, Robert
    [J]. AUTOMATICA, 2014, 50 (12) : 3038 - 3053
  • [3] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
  • [4] [Anonymous], 1986, ROBUST ADAPTIVE CONT
  • [5] Gaussian-kernel-based adaptive critic design using two-phase value iteration
    Chen, Xin
    Wang, Wei
    Cao, Weihua
    Wu, Min
    [J]. INFORMATION SCIENCES, 2019, 482 : 139 - 155
  • [6] A Minimal Control Multiagent for Collision Avoidance and Velocity Alignment
    Chen, Zhiyong
    Zhang, Hai-Tao
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (08) : 2185 - 2192
  • [7] Adaptive Neural Network-Based Finite-Time Online Optimal Tracking Control of the Nonlinear System With Dead Zone
    Ding, Liang
    Li, Shu
    Gao, Haibo
    Liu, Yan-Jun
    Huang, Lan
    Deng, Zongquan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 382 - 392
  • [8] CONTROLLABILITY THEORY FOR NONLINEAR SYSTEMS
    GERSHWIN, SB
    JACOBSON, DH
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1971, AC16 (01) : 37 - &
  • [9] Distributed Model Predictive Control for Smart Energy Systems
    Halvgaard, Rasmus
    Vandenberghe, Lieven
    Poulsen, Niels Kjolstad
    Madsen, Henrik
    Jorgensen, John Bagterp
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2016, 7 (03) : 1675 - 1682
  • [10] Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
    He, Pingan
    Jagannathan, S.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 425 - 436