Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引:0
作者
Shen, Ziwen [1 ]
Dong, Tao [1 ]
Huang, Tingwen [2 ]
机构
[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China
关键词
Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;
D O I
10.1016/j.neunet.2024.106667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Coordinated Fuzzy Adaptive Iterative Learning Control of Consensus for Unknown Nonlinear Multi-agent Systems
    Liang, Mengdan
    Li, Junmin
    Li, Jinsha
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2022, 24 (06) : 3000 - 3014
  • [42] Cooperative robust containment control for general discrete-time multi-agent systems with external disturbance
    Liang, Hongjing
    Li, Hongyi
    Yu, Zhandong
    Li, Ping
    Wang, Wei
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (12) : 1928 - 1937
  • [43] Ultra-fast Tracking Control of High-order Discrete-time Multi-agent Systems with H∞ Performance Specification
    Zhang, Wenle
    Tang, Yang
    Du, Wenli
    He, Wangli
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 999 - 1004
  • [44] Reduced-order interval observer-based coordination control for discrete-time multi-agent systems
    Su, Housheng
    Luo, Miaohong
    Zeng, Zhigang
    AUTOMATICA, 2025, 174
  • [45] Distributed Bipartite Output Formation Control for Heterogeneous Discrete-Time Linear Multi-Agent Systems
    Zhang, Jie
    Yao, Yao
    Wang, Jian-An
    Li, Zhiqiang
    Feng, Penghui
    Bai, Wulin
    IEEE ACCESS, 2024, 12 : 18901 - 18912
  • [46] Tracking control of discrete-time Markovian jump systems
    Tian, Guangtai
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (15) : 3070 - 3080
  • [47] Multi-Agent Q-Learning for Power Allocation in Interference Channel
    Wongphatcharatham, Tanutsorn
    Phakphisut, Watid
    Wijitpornchai, Thongchai
    Areeprayoonkij, Poonlarp
    Jaruvitayakovit, Tanun
    Hannanta-Anan, Pimkhuan
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 876 - 879
  • [48] Cooperative Output Regulation of Discrete-Time Linear Multi-Agent Systems Based on State Observer
    Zheng, Mengna
    Gao, Jinfeng
    Wang, Huijiao
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 7214 - 7218
  • [49] Convergence of Fractional-order Discrete-time Multi-agent Systems with A Leader
    Liu Bo
    Han Xiao
    Zhang Junjun
    Sun Dehui
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 7322 - 7326
  • [50] Accelerated Consensus of Discrete-Time Multi-agent Systems under Switching Topologies
    Li Min
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 7202 - 7206