Optimal Consensus Control Design for Multiagent Systems With Multiple Time Delay Using Adaptive Dynamic Programming

被引:90
作者
Zhang, Huaguang [1 ,2 ]
Ren, He [2 ]
Mu, Yunfei [2 ]
Han, Ji [2 ]
机构
[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Delay effects; Delays; Consensus control; Synchronization; Optimal control; System dynamics; Adaptive dynamic programming (ADP); data-based optimal control; multiagent systems (MASs); reinforcement learning (RL); time delay; DIFFERENTIAL GRAPHICAL GAMES; OPTIMAL-CONTROL SCHEME; SYNCHRONIZATION; ALGORITHMS; OBSERVER; FEEDBACK;
D O I
10.1109/TCYB.2021.3090067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a novel data-based adaptive dynamic programming (ADP) method is presented to solve the optimal consensus tracking control problem for discrete-time (DT) multiagent systems (MASs) with multiple time delays. Necessary and sufficient conditions of the corresponding equivalent time-delay system are provided on the basis of the causal transformations. Benefitting from the construction of tracking error dynamics, the optimal tracking problem can be transformed into settling the Nash-equilibrium in the graphical game, which can be completed by solving the coupled Hamilton-Jacobi (HJ) equations. An error estimator is introduced to construct the tracking error of the MASs only using the input and output (I/O) data. Therefore, the designed data-based ADP algorithm can minimize the cost functions and ensure the consensus of MASs without the knowledge of system dynamics. Finally, a numerical example is given to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:12832 / 12842
页数:11
相关论文
共 54 条
[1]  
Al-Tamimi A., 2011, AUTOMATICA, V47, P207
[2]   An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination [J].
Cao, Yongcan ;
Yu, Wenwu ;
Ren, Wei ;
Chen, Guanrong .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (01) :427-438
[3]   Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming [J].
Gao, Weinan ;
Jiang, Yu ;
Jiang, Zhong-Ping ;
Chai, Tianyou .
AUTOMATICA, 2016, 72 :37-45
[4]   Equivalence of Linear Time-Delay Systems [J].
Garate-Garcia, Araceli ;
Alejandro Marquez-Martinez, Luis ;
Moog, Claude H. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (03) :666-670
[5]  
Gu G, 2012, Discrete-time linear systems: theory and design with applications
[6]   Intermediate Observer-Based Robust Distributed Fault Estimation for Nonlinear Multiagent Systems With Directed Graphs [J].
Han, Jian ;
Liu, Xiuhua ;
Gao, Xianwen ;
Wei, Xinjiang .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) :7426-7436
[7]   Synchronization of discrete-time multi-agent systems on graphs using Riccati design [J].
Hengster-Movric, Kristian ;
You, Keyou ;
Lewis, Frank L. ;
Xie, Lihua .
AUTOMATICA, 2013, 49 (02) :414-423
[8]   Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control [J].
Jiao, Qiang ;
Modares, Hamidreza ;
Xu, Shengyuan ;
Lewis, Frank L. ;
Vamvoudakis, Kyriakos G. .
AUTOMATICA, 2016, 69 :24-34
[9]   Overview: Collective Control of Multiagent Systems [J].
Knorn, Steffi ;
Chen, Zhiyong ;
Middleton, Richard H. .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2016, 3 (04) :334-347
[10]  
Lewis FL, 2014, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-5574-4