Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引：0

作者：

Shen, Ziwen ^{[1
]}

Dong, Tao ^{[1
]}

Huang, Tingwen ^{[2
]}

机构：

[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China

[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 180卷

关键词：

Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;

D O I：

10.1016/j.neunet.2024.106667

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.

引用

页数：13

共 50 条

[1] Consensus of discrete-time multi-agent system based on Q-learning
Zhu Z.-B.
Wang F.-Y.
Yin Y.-H.
Liu Z.-X.
Chen Z.-Q.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (07): : 997 - 1005
[2] Group consensus for discrete-time multi-agent systems based on iterative learning control
Gao, Qianhui
Li, Jinsha
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 271 - 276
[3] Reinforcement Q-learning and Optimal Tracking Control of Unknown Discrete-time Multi-player Systems Based on Game Theory
Zhao, Jin-Gang
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (05) : 1751 - 1759
[4] Robust H∞ Output Consensus in Heterogeneous Multi-agent Discrete-Time Systems Using Q-Learning Algorithm
Valadbeigi, Amir Parviz
Soltanian, Farzad
Shasadeghi, Mokhtar
IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2025, : 251 - 263
[5] Event-triggered tracking control for discrete-time multi-agent systems
Yin, Xiuxia
Yue, Dong
Su, Housheng
IMA JOURNAL OF MATHEMATICAL CONTROL AND INFORMATION, 2014, 31 (02) : 165 - 182
[6] Data-Driven Tracking Control for Multi-Agent Systems With Unknown Dynamics via Multithreading Iterative Q-Learning
Dong, Tao
Gong, Xiaomei
Wang, Aijuan
Li, Huaqing
Huang, Tingwen
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (04): : 2533 - 2542
[7] Asynchronous impulsive consensus of discrete-time nonlinear multi-agent systems with time-varying delays
Zhang, Qunjiao
Luo, Juan
Tong, Ping
Wan, Li
Wu, Xiaoqun
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 645
[8] Decentralized Adaptive Tracking Control of a Class of Nonlinear Discrete-Time Coupled Multi-Agent Systems With Unknown Dynamics
Zhang, Xinghong
IEEE ACCESS, 2020, 8 : 55927 - 55936
[9] Disturbance rejection of a class of discrete-time multi-agent systems with pinning control
Zhu, Qiuguo
Wu, Jun
Xiong, Rong
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (06)
[10] Adaptive preview consensus control for discrete-time nonlinear multi-agent systems with unknown control directions
Ren, Chang-E
Chen, C. L. Philip
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (15) : 2941 - 2950

← 1 2 3 4 5 →