Data-based optimal coordination control of continuous-time nonlinear multi-agent systems via adaptive dynamic programming method

被引:4
作者
Shi, Jing [1 ,2 ]
Yue, Dong [1 ,2 ,3 ]
Xie, Xiangpeng [3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Nanjing 210023, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Inst Adv Technol, Nanjing 210023, Peoples R China
来源
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2020年 / 357卷 / 15期
基金
中国国家自然科学基金;
关键词
APPROXIMATE OPTIMAL-CONTROL; OPTIMAL CONSENSUS CONTROL; SWITCHING TOPOLOGY; LEARNING SOLUTION; FEEDBACK-CONTROL; GAMES; NETWORKS;
D O I
10.1016/j.jfranklin.2020.08.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on the optimal coordination control problem for continuous-time nonlinear multi-agent systems with completely unknown dynamics via a data-based distributed adaptive dynamic programming method. As for most real-world applications, accurate system models are complicated to obtain, which restricts the application of the conventional methods. Moreover, it is challenging to design optimal coordination control of multi-agent systems especially for the time-varying communication topology. To deal with the difficulties, we investigate a distributed adaptive dynamic programming method with identifier-critic architecture under the switching communication topology. First, using the available system data, an online adaptive identifier is developed to approximate the unknown model dynamics, and simultaneously a critic neural network is employed for approximation of the optimal cost function, which yields approximated optimal coordination control in real time. Then, we analyze the stability of our proposed scheme. Eventually, the simulation illustrates the effectiveness of the developed method. (C) 2020 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:10312 / 10328
页数:17
相关论文
共 45 条
  • [1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    Abu-Khalaf, M
    Lewis, FL
    [J]. AUTOMATICA, 2005, 41 (05) : 779 - 791
  • [2] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
  • [3] [Anonymous], 2011, ADAPTIVE CONTROL STA
  • [4] Baird L.C., 1993, Technical report wl-tr-93-1146
  • [5] Adaptive-critic-based neural networks for aircraft optimal control
    Balakrishnan, SN
    Biega, V
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) : 893 - 898
  • [6] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    [J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [7] ASYMPTOTIC AGREEMENT IN DISTRIBUTED ESTIMATION
    BORKAR, V
    VARAIYA, PP
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1982, 27 (03) : 650 - 655
  • [8] Reinforcement learning in continuous time and space
    Doya, K
    [J]. NEURAL COMPUTATION, 2000, 12 (01) : 219 - 245
  • [9] Robust ADP Design for Continuous-Time Nonlinear Systems With Output Constraints
    Fan, Bo
    Yang, Qinmin
    Tang, Xiaoyu
    Sun, Youxian
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2127 - 2138
  • [10] Adaptive feedback control by constrained approximate dynamic programming
    Ferrari, Silvia
    Steck, James E.
    Chandramohan, Rajeev
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 982 - 987