Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引:0
作者
Shen, Ziwen [1 ]
Dong, Tao [1 ]
Huang, Tingwen [2 ]
机构
[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China
关键词
Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;
D O I
10.1016/j.neunet.2024.106667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Cooperative adaptive optimal output regulation of nonlinear discrete-time multi-agent systems
    Jiang, Yi
    Fan, Jialu
    Gao, Weinan
    Chai, Tianyou
    Lewis, Frank L.
    AUTOMATICA, 2020, 121
  • [22] Consensus of a class of discrete-time nonlinear multi-agent systems in the presence of communication delays
    Hu, Haiyun
    Lin, Zongli
    ISA TRANSACTIONS, 2017, 71 : 10 - 20
  • [23] Distributed adaptive containment control for a class of discrete-time nonlinear multi-agent systems with unknown parameters and control gains
    Li, Nannan
    Fei, Qing
    Ma, Hongbin
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (13): : 8566 - 8590
  • [24] Consensus of discrete-time multi-agent systems with adversaries and time delays
    Wu, Yiming
    He, Xiongxiong
    Liu, Shuai
    Xie, Lihua
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2014, 43 (3-4) : 402 - 411
  • [25] Cooperative containment of discrete-time linear multi-agent systems
    Ma, Qian
    Lewis, Frank L.
    Xu, Shengyuan
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (07) : 1007 - 1018
  • [26] Weight Conditions for Consensus of Discrete-Time Multi-Agent Systems
    Zhang Ya
    Tian Yu-Ping
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4841 - 4845
  • [27] Equilibrium of consensus problems for discrete-time multi-agent systems
    Li, Jun-Bing
    Yan, Wei-Sheng
    Fang, Xin-Peng
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2013, 30 (04): : 513 - 518
  • [28] Leader-Following Consensus of Discrete-Time Nonlinear Multi-Agent Systems with Asymmetric Saturation Impulsive Control
    Yuan, Qiao
    Chen, Guorong
    Tian, Yuan
    Yuan, Yu
    Zhang, Qian
    Wang, Xiaonan
    Liu, Jingcheng
    MATHEMATICS, 2024, 12 (03)
  • [29] Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning
    Mu, Chaoxu
    Zhao, Qian
    Gao, Zhongke
    Sun, Changyin
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2019, 356 (13): : 6946 - 6967
  • [30] Analysis of nonlinear discrete-time systems with higher-order iterative learning control
    Sun, MX
    Wang, DW
    DYNAMICS AND CONTROL, 2001, 11 (01) : 81 - 96