Optimal Consensus Control Design for Multiagent Systems With Multiple Time Delay Using Adaptive Dynamic Programming

被引：90

作者：

Zhang, Huaguang ^{[1
,2
]}

Ren, He ^{[2
]}

Mu, Yunfei ^{[2
]}

Han, Ji ^{[2
]}

机构：

[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Games; Delay effects; Delays; Consensus control; Synchronization; Optimal control; System dynamics; Adaptive dynamic programming (ADP); data-based optimal control; multiagent systems (MASs); reinforcement learning (RL); time delay; DIFFERENTIAL GRAPHICAL GAMES; OPTIMAL-CONTROL SCHEME; SYNCHRONIZATION; ALGORITHMS; OBSERVER; FEEDBACK;

D O I：

10.1109/TCYB.2021.3090067

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, a novel data-based adaptive dynamic programming (ADP) method is presented to solve the optimal consensus tracking control problem for discrete-time (DT) multiagent systems (MASs) with multiple time delays. Necessary and sufficient conditions of the corresponding equivalent time-delay system are provided on the basis of the causal transformations. Benefitting from the construction of tracking error dynamics, the optimal tracking problem can be transformed into settling the Nash-equilibrium in the graphical game, which can be completed by solving the coupled Hamilton-Jacobi (HJ) equations. An error estimator is introduced to construct the tracking error of the MASs only using the input and output (I/O) data. Therefore, the designed data-based ADP algorithm can minimize the cost functions and ensure the consensus of MASs without the knowledge of system dynamics. Finally, a numerical example is given to demonstrate the effectiveness of the proposed method.

引用

页码：12832 / 12842

页数：11

共 54 条

[1]

Al-Tamimi A., 2011, AUTOMATICA, V47, P207

[2] An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination [J].

Cao, Yongcan ;

Yu, Wenwu ;

Ren, Wei ;

Chen, Guanrong .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (01) :427-438

[3] Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming [J].

Gao, Weinan ;

Jiang, Yu ;

Jiang, Zhong-Ping ;

Chai, Tianyou .

AUTOMATICA, 2016, 72 :37-45

[4] Equivalence of Linear Time-Delay Systems [J].

Garate-Garcia, Araceli ;

Alejandro Marquez-Martinez, Luis ;

Moog, Claude H. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (03) :666-670

[5]

Gu G, 2012, Discrete-time linear systems: theory and design with applications

[6] Intermediate Observer-Based Robust Distributed Fault Estimation for Nonlinear Multiagent Systems With Directed Graphs [J].

Han, Jian ;

Liu, Xiuhua ;

Gao, Xianwen ;

Wei, Xinjiang .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) :7426-7436

[7] Synchronization of discrete-time multi-agent systems on graphs using Riccati design [J].

Hengster-Movric, Kristian ;

You, Keyou ;

Lewis, Frank L. ;

Xie, Lihua .

AUTOMATICA, 2013, 49 (02) :414-423

[8] Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control [J].

Jiao, Qiang ;

Modares, Hamidreza ;

Xu, Shengyuan ;

Lewis, Frank L. ;

Vamvoudakis, Kyriakos G. .

AUTOMATICA, 2016, 69 :24-34

[9] Overview: Collective Control of Multiagent Systems [J].

Knorn, Steffi ;

Chen, Zhiyong ;

Middleton, Richard H. .

IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2016, 3 (04) :334-347

[10]

Lewis FL, 2014, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-5574-4

← 1 2 3 4 5 6 →