Data-based optimal coordination control of continuous-time nonlinear multi-agent systems via adaptive dynamic programming method

被引：4

作者：

Shi, Jing ^{[1
,2
]}

Yue, Dong ^{[1
,2
,3
]}

Xie, Xiangpeng ^{[3
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Nanjing 210023, Peoples R China

[3] Nanjing Univ Posts & Telecommun, Inst Adv Technol, Nanjing 210023, Peoples R China

来源：

JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2020年 / 357卷 / 15期

基金：

中国国家自然科学基金;

关键词：

APPROXIMATE OPTIMAL-CONTROL; OPTIMAL CONSENSUS CONTROL; SWITCHING TOPOLOGY; LEARNING SOLUTION; FEEDBACK-CONTROL; GAMES; NETWORKS;

D O I：

10.1016/j.jfranklin.2020.08.007

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper focuses on the optimal coordination control problem for continuous-time nonlinear multi-agent systems with completely unknown dynamics via a data-based distributed adaptive dynamic programming method. As for most real-world applications, accurate system models are complicated to obtain, which restricts the application of the conventional methods. Moreover, it is challenging to design optimal coordination control of multi-agent systems especially for the time-varying communication topology. To deal with the difficulties, we investigate a distributed adaptive dynamic programming method with identifier-critic architecture under the switching communication topology. First, using the available system data, an online adaptive identifier is developed to approximate the unknown model dynamics, and simultaneously a critic neural network is employed for approximation of the optimal cost function, which yields approximated optimal coordination control in real time. Then, we analyze the stability of our proposed scheme. Eventually, the simulation illustrates the effectiveness of the developed method. (C) 2020 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.

引用

页码：10312 / 10328

页数：17

共 45 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Abu-Khalaf, M
Lewis, FL
[J]. AUTOMATICA, 2005, 41 (05) : 779 - 791
[2] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
Al-Tamimi, Asma
Lewis, Frank L.
Abu-Khalaf, Murad
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
[3] [Anonymous], 2011, ADAPTIVE CONTROL STA
[4] Baird L.C., 1993, Technical report wl-tr-93-1146
[5] Adaptive-critic-based neural networks for aircraft optimal control
Balakrishnan, SN
Biega, V
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) : 893 - 898
[6] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
Bhasin, S.
Kamalapurkar, R.
Johnson, M.
Vamvoudakis, K. G.
Lewis, F. L.
Dixon, W. E.
[J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
[7] ASYMPTOTIC AGREEMENT IN DISTRIBUTED ESTIMATION
BORKAR, V
VARAIYA, PP
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1982, 27 (03) : 650 - 655
[8] Reinforcement learning in continuous time and space
Doya, K
[J]. NEURAL COMPUTATION, 2000, 12 (01) : 219 - 245
[9] Robust ADP Design for Continuous-Time Nonlinear Systems With Output Constraints
Fan, Bo
Yang, Qinmin
Tang, Xiaoyu
Sun, Youxian
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2127 - 2138
[10] Adaptive feedback control by constrained approximate dynamic programming
Ferrari, Silvia
Steck, James E.
Chandramohan, Rajeev
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 982 - 987

← 1 2 3 4 5 →