Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引：0

作者：

Shen, Ziwen ^{[1
]}

Dong, Tao ^{[1
]}

Huang, Tingwen ^{[2
]}

机构：

[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China

[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 180卷

关键词：

Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;

D O I：

10.1016/j.neunet.2024.106667

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.

引用

页数：13

共 50 条

[21] Cooperative adaptive optimal output regulation of nonlinear discrete-time multi-agent systems
Jiang, Yi
Fan, Jialu
Gao, Weinan
Chai, Tianyou
Lewis, Frank L.
AUTOMATICA, 2020, 121
[22] Consensus of a class of discrete-time nonlinear multi-agent systems in the presence of communication delays
Hu, Haiyun
Lin, Zongli
ISA TRANSACTIONS, 2017, 71 : 10 - 20
[23] Distributed adaptive containment control for a class of discrete-time nonlinear multi-agent systems with unknown parameters and control gains
Li, Nannan
Fei, Qing
Ma, Hongbin
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (13): : 8566 - 8590
[24] Consensus of discrete-time multi-agent systems with adversaries and time delays
Wu, Yiming
He, Xiongxiong
Liu, Shuai
Xie, Lihua
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2014, 43 (3-4) : 402 - 411
[25] Cooperative containment of discrete-time linear multi-agent systems
Ma, Qian
Lewis, Frank L.
Xu, Shengyuan
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (07) : 1007 - 1018
[26] Weight Conditions for Consensus of Discrete-Time Multi-Agent Systems
Zhang Ya
Tian Yu-Ping
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4841 - 4845
[27] Equilibrium of consensus problems for discrete-time multi-agent systems
Li, Jun-Bing
Yan, Wei-Sheng
Fang, Xin-Peng
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2013, 30 (04): : 513 - 518
[28] Leader-Following Consensus of Discrete-Time Nonlinear Multi-Agent Systems with Asymmetric Saturation Impulsive Control
Yuan, Qiao
Chen, Guorong
Tian, Yuan
Yuan, Yu
Zhang, Qian
Wang, Xiaonan
Liu, Jingcheng
MATHEMATICS, 2024, 12 (03)
[29] Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning
Mu, Chaoxu
Zhao, Qian
Gao, Zhongke
Sun, Changyin
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2019, 356 (13): : 6946 - 6967
[30] Analysis of nonlinear discrete-time systems with higher-order iterative learning control
Sun, MX
Wang, DW
DYNAMICS AND CONTROL, 2001, 11 (01) : 81 - 96

← 1 2 3 4 5 →