Data-Based Optimal Synchronization Control for Discrete-Time Nonlinear Heterogeneous Multiagent Systems

被引:16
作者
Fu, Hao [1 ,2 ]
Chen, Xin [1 ,2 ]
Wang, Wei [3 ]
Wu, Min [1 ,2 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] WISDRI Engn & Res Inc Ltd, Res & Dev Inst, Wuhan 430223, Peoples R China
基金
中国国家自然科学基金;
关键词
Synchronization; Nickel; Performance analysis; Mathematical model; Adaptation models; Decentralized control; Multi-agent systems; Approximate dynamic programming (ADP); discrete time; model reference adaptive control (MRAC); multiagent systems (MASs); optimal synchronization; policy iteration; APPROXIMATE OPTIMAL-CONTROL; OPTIMAL CONSENSUS CONTROL; OUTPUT SYNCHRONIZATION; ADAPTIVE-CONTROL; GRAPHICAL GAMES;
D O I
10.1109/TCYB.2020.3004494
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the optimal synchronization problem for unknown discrete-time nonlinear heterogeneous multiagent systems (MASs). It is very intractable to derive the analytical solutions of coupled Bellman's equations, which are necessary to overcome this problem. We propose a data-based optimal synchronization control strategy based on a hierarchical and distributed optimal control framework composed of a model reference adaptive control (MRAC) layer and a distributed control layer. In the MRAC layer, the similar-offline MRAC algorithm is developed to make subsystems of MASs track their reference models, respectively. Then, the distributed optimal control problem of nonlinear heterogeneous MASs is transformed into that of homogeneous MASs composed of the reference models and the leader. In the distributed control layer, the distributed reference policy iteration algorithm is proposed to derive the solutions of coupled composite nonlinear Bellman's equations, which ensure that the homogeneous MASs reach synchronization with optimum. The suboptimal synchronization control is achieved via optimization further. Convergence analysis of both algorithms is rigorously provided. The simulation results verify the effectiveness of the proposed strategy.
引用
收藏
页码:2477 / 2490
页数:14
相关论文
共 48 条
  • [31] Prioritizing Useful Experience Replay for Heuristic Dynamic Programming-Based Learning Systems
    Ni, Zhen
    Malla, Naresh
    Zhong, Xiangnan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (11) : 3911 - 3922
  • [32] Complete stability analysis of a heuristic approximate dynamic programming control design
    Sokolov, Yury
    Kozma, Robert
    Werbos, Ludmilla D.
    Werbos, Paul J.
    [J]. AUTOMATICA, 2015, 59 : 9 - 18
  • [33] Cooperative Output Regulation With Application to Multi-Agent Consensus Under Switching Network
    Su, Youfeng
    Huang, Jie
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 864 - 875
  • [34] Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
    Vamvoudakis, Kyriakos G.
    Lewis, Frank L.
    Hudas, Greg R.
    [J]. AUTOMATICA, 2012, 48 (08) : 1598 - 1611
  • [35] Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System
    Wang, Ding
    He, Haibo
    Mu, Chaoxu
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (06) : 4935 - 4944
  • [36] Optimal Consensus Control for Heterogeneous Nonlinear Multiagent Systems with Partially Unknown Dynamics
    Wang, Tao
    Fu, Hao
    Li, Jinbin
    Zhang, Yaodong
    Zhou, Xinfeng
    Chen, Xin
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2019, 17 (09) : 2400 - 2413
  • [37] Model-Free Distributed Consensus Control Based on Actor-Critic Framework for Discrete-Time Nonlinear Multiagent Systems
    Wang, Wei
    Chen, Xin
    Fu, Hao
    Wu, Min
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4123 - 4134
  • [38] Model-free optimal containment control of multi-agent systems based on actor-critic framework
    Wang, Wei
    Chen, Xin
    [J]. NEUROCOMPUTING, 2018, 314 : 242 - 250
  • [39] Leader-Follower Output Synchronization of Linear Heterogeneous Systems With Active Leader Using Reinforcement Learning
    Yang, Yongliang
    Modares, Hamidreza
    Wunsch, Donald C., II
    Yin, Yixin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2139 - 2153
  • [40] Indirect adaptive control of nonlinear dynamic systems using self recurrent wavelet neural networks via adaptive learning rates
    Yoo, Sung Jin
    Park, Jin Bae
    Choi, Yoon Ho
    [J]. INFORMATION SCIENCES, 2007, 177 (15) : 3074 - 3098