Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引：0

作者：

Shen, Ziwen ^{[1
]}

Dong, Tao ^{[1
]}

Huang, Tingwen ^{[2
]}

机构：

[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China

[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 180卷

关键词：

Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;

D O I：

10.1016/j.neunet.2024.106667

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.

引用

页数：13

共 50 条

[41] Coordinated Fuzzy Adaptive Iterative Learning Control of Consensus for Unknown Nonlinear Multi-agent Systems
Liang, Mengdan
Li, Junmin
Li, Jinsha
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2022, 24 (06) : 3000 - 3014
[42] Cooperative robust containment control for general discrete-time multi-agent systems with external disturbance
Liang, Hongjing
Li, Hongyi
Yu, Zhandong
Li, Ping
Wang, Wei
IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (12) : 1928 - 1937
[43] Ultra-fast Tracking Control of High-order Discrete-time Multi-agent Systems with H∞ Performance Specification
Zhang, Wenle
Tang, Yang
Du, Wenli
He, Wangli
PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 999 - 1004
[44] Reduced-order interval observer-based coordination control for discrete-time multi-agent systems
Su, Housheng
Luo, Miaohong
Zeng, Zhigang
AUTOMATICA, 2025, 174
[45] Distributed Bipartite Output Formation Control for Heterogeneous Discrete-Time Linear Multi-Agent Systems
Zhang, Jie
Yao, Yao
Wang, Jian-An
Li, Zhiqiang
Feng, Penghui
Bai, Wulin
IEEE ACCESS, 2024, 12 : 18901 - 18912
[46] Tracking control of discrete-time Markovian jump systems
Tian, Guangtai
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (15) : 3070 - 3080
[47] Multi-Agent Q-Learning for Power Allocation in Interference Channel
Wongphatcharatham, Tanutsorn
Phakphisut, Watid
Wijitpornchai, Thongchai
Areeprayoonkij, Poonlarp
Jaruvitayakovit, Tanun
Hannanta-Anan, Pimkhuan
2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 876 - 879
[48] Cooperative Output Regulation of Discrete-Time Linear Multi-Agent Systems Based on State Observer
Zheng, Mengna
Gao, Jinfeng
Wang, Huijiao
2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 7214 - 7218
[49] Convergence of Fractional-order Discrete-time Multi-agent Systems with A Leader
Liu Bo
Han Xiao
Zhang Junjun
Sun Dehui
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 7322 - 7326
[50] Accelerated Consensus of Discrete-Time Multi-agent Systems under Switching Topologies
Li Min
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 7202 - 7206

← 1 2 3 4 5 →