Optimal Consensus Control Design for Multiagent Systems With Multiple Time Delay Using Adaptive Dynamic Programming

被引:72
作者
Zhang, Huaguang [1 ,2 ]
Ren, He [2 ]
Mu, Yunfei [2 ]
Han, Ji [2 ]
机构
[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Delay effects; Delays; Consensus control; Synchronization; Optimal control; System dynamics; Adaptive dynamic programming (ADP); data-based optimal control; multiagent systems (MASs); reinforcement learning (RL); time delay; DIFFERENTIAL GRAPHICAL GAMES; OPTIMAL-CONTROL SCHEME; SYNCHRONIZATION; ALGORITHMS; OBSERVER; FEEDBACK;
D O I
10.1109/TCYB.2021.3090067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a novel data-based adaptive dynamic programming (ADP) method is presented to solve the optimal consensus tracking control problem for discrete-time (DT) multiagent systems (MASs) with multiple time delays. Necessary and sufficient conditions of the corresponding equivalent time-delay system are provided on the basis of the causal transformations. Benefitting from the construction of tracking error dynamics, the optimal tracking problem can be transformed into settling the Nash-equilibrium in the graphical game, which can be completed by solving the coupled Hamilton-Jacobi (HJ) equations. An error estimator is introduced to construct the tracking error of the MASs only using the input and output (I/O) data. Therefore, the designed data-based ADP algorithm can minimize the cost functions and ensure the consensus of MASs without the knowledge of system dynamics. Finally, a numerical example is given to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:12832 / 12842
页数:11
相关论文
共 54 条
  • [1] Al-Tamimi A., 2011, AUTOMATICA, V47, P207
  • [2] An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination
    Cao, Yongcan
    Yu, Wenwu
    Ren, Wei
    Chen, Guanrong
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (01) : 427 - 438
  • [3] Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming
    Gao, Weinan
    Jiang, Yu
    Jiang, Zhong-Ping
    Chai, Tianyou
    [J]. AUTOMATICA, 2016, 72 : 37 - 45
  • [4] Equivalence of Linear Time-Delay Systems
    Garate-Garcia, Araceli
    Alejandro Marquez-Martinez, Luis
    Moog, Claude H.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (03) : 666 - 670
  • [5] Gu G, 2012, DISCRETE TIME LINEAR, DOI DOI 10.1007/978-1-4614-2281-5
  • [6] Intermediate Observer-Based Robust Distributed Fault Estimation for Nonlinear Multiagent Systems With Directed Graphs
    Han, Jian
    Liu, Xiuhua
    Gao, Xianwen
    Wei, Xinjiang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) : 7426 - 7436
  • [7] Synchronization of discrete-time multi-agent systems on graphs using Riccati design
    Hengster-Movric, Kristian
    You, Keyou
    Lewis, Frank L.
    Xie, Lihua
    [J]. AUTOMATICA, 2013, 49 (02) : 414 - 423
  • [8] Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control
    Jiao, Qiang
    Modares, Hamidreza
    Xu, Shengyuan
    Lewis, Frank L.
    Vamvoudakis, Kyriakos G.
    [J]. AUTOMATICA, 2016, 69 : 24 - 34
  • [9] Overview: Collective Control of Multiagent Systems
    Knorn, Steffi
    Chen, Zhiyong
    Middleton, Richard H.
    [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2016, 3 (04): : 334 - 347
  • [10] Lewis FL, 2014, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-5574-4