Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引:0
|
作者
Shen, Ziwen [1 ]
Dong, Tao [1 ]
Huang, Tingwen [2 ]
机构
[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China
关键词
Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;
D O I
10.1016/j.neunet.2024.106667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Consensus of discrete-time multi-agent system based on Q-learning
    Zhu Z.-B.
    Wang F.-Y.
    Yin Y.-H.
    Liu Z.-X.
    Chen Z.-Q.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (07): : 997 - 1005
  • [2] Group consensus for discrete-time multi-agent systems based on iterative learning control
    Gao, Qianhui
    Li, Jinsha
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 271 - 276
  • [3] Reinforcement Q-learning and Optimal Tracking Control of Unknown Discrete-time Multi-player Systems Based on Game Theory
    Zhao, Jin-Gang
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (05) : 1751 - 1759
  • [4] Robust H∞ Output Consensus in Heterogeneous Multi-agent Discrete-Time Systems Using Q-Learning Algorithm
    Valadbeigi, Amir Parviz
    Soltanian, Farzad
    Shasadeghi, Mokhtar
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2025, : 251 - 263
  • [5] Event-triggered tracking control for discrete-time multi-agent systems
    Yin, Xiuxia
    Yue, Dong
    Su, Housheng
    IMA JOURNAL OF MATHEMATICAL CONTROL AND INFORMATION, 2014, 31 (02) : 165 - 182
  • [6] Data-Driven Tracking Control for Multi-Agent Systems With Unknown Dynamics via Multithreading Iterative Q-Learning
    Dong, Tao
    Gong, Xiaomei
    Wang, Aijuan
    Li, Huaqing
    Huang, Tingwen
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (04): : 2533 - 2542
  • [7] Asynchronous impulsive consensus of discrete-time nonlinear multi-agent systems with time-varying delays
    Zhang, Qunjiao
    Luo, Juan
    Tong, Ping
    Wan, Li
    Wu, Xiaoqun
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 645
  • [8] Decentralized Adaptive Tracking Control of a Class of Nonlinear Discrete-Time Coupled Multi-Agent Systems With Unknown Dynamics
    Zhang, Xinghong
    IEEE ACCESS, 2020, 8 : 55927 - 55936
  • [9] Disturbance rejection of a class of discrete-time multi-agent systems with pinning control
    Zhu, Qiuguo
    Wu, Jun
    Xiong, Rong
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (06)
  • [10] Adaptive preview consensus control for discrete-time nonlinear multi-agent systems with unknown control directions
    Ren, Chang-E
    Chen, C. L. Philip
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (15) : 2941 - 2950