Optimized Leader-Follower Consensus Control Using Reinforcement Learning for a Class of Second-Order Nonlinear Multiagent Systems

被引:46
作者
Wen, Guoxing [1 ]
Li, Bin [2 ]
机构
[1] Binzhou Univ, Coll Sci, Binzhou 256600, Shandong, Peoples R China
[2] Qilu Univ Technol, Sch Math & Stat, Shandong Acad Sci, Jinan 250353, Shandong, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 09期
基金
中国国家自然科学基金;
关键词
Optimal control; Multi-agent systems; Artificial neural networks; Heuristic algorithms; Reinforcement learning; Consensus control; Topology; Double integrator dynamic; multiagent system; neural network (NN); optimal control; reinforcement learning (RL); unknown nonlinear dynamic; HJB EQUATION; NETWORKS;
D O I
10.1109/TSMC.2021.3130070
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, an optimized leader-follower consensus control is proposed for a class of second-order unknown nonlinear dynamical multiagent system. Different with the first-order multiagent consensus, the second-order case needs to achieve the agreement not only on position but also on velocity, therefore this optimized control is more challenging and interesting. To derive the control, reinforcement learning (RL) can be a natural consideration because it can overcome the difficulty of solving the Hamilton-Jacobi-Bellman (HJB) equation. To implement RL, it needs to iterate both adaptive critic and actor networks each other. However, if this optimized control learns RL from most existing optimal methods that derives the critic and actor adaptive laws from the negative gradient of square of the approximating function of the HJB equation, this control algorithm will be very intricate, because the HJB equation correlated to a second-order nonlinear multiagent system will become very complex due to strong state coupling and nonlinearity. In this work, since the two RL adaptive laws are derived via implementing the gradient descent method to a simple positive function, which is obtained on the basis of a partial derivative of the HJB equation, this optimized control is significantly simple. Meanwhile, it not merely can avoid the requirement of known dynamic acknowledge, but also can release the condition of persistent excitation, which is demanded in most RL optimization methods for training the adaptive parameter more sufficiently. Finally, the proposed control is demonstrated by both theory and computer simulation.
引用
收藏
页码:5546 / 5555
页数:10
相关论文
共 50 条
  • [31] Fuzzy Adaptive Optimized Leader-Following Formation Control for Second-Order Stochastic Multiagent Systems
    Li, Yongming
    Zhang, Jiaxin
    Tong, Shaocheng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6026 - 6037
  • [32] Leader-following consensus for a class of second-order nonlinear multi-agent systems
    Wang, Chuanrui
    Wang, Xinghu
    Ji, Haibo
    SYSTEMS & CONTROL LETTERS, 2016, 89 : 61 - 65
  • [33] Cluster Lag Consensus for Second-Order Multiagent Systems with Nonlinear Dynamics and Switching Topologies
    Wang, Yi
    Li, Yixiao
    Ma, Zhongjun
    Cai, Guoyong
    Chen, Guanrong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (06): : 2093 - 2100
  • [34] LQR-Based Optimal Leader-Follower Consensus of Second-Order Multi-agent Systems
    Li, Zonggang
    Zhang, Tongzhou
    Xie, Guangming
    PROCEEDINGS OF THE 2015 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL 2, 2016, 360 : 353 - 361
  • [35] Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems
    Wen, Guoxing
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1524 - 1536
  • [36] Adaptive fuzzy leader-follower consensus control using sliding mode mechanism for a class of high-order unknown nonlinear dynamic multi-agent systems
    Wen, Guoxing
    Dou, Hui
    Li, Bin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (01) : 545 - 558
  • [37] Learning Consensus of Second-Order Unknown Nonlinear Parameterized Multiagent Systems With Periodic Disturbances
    Chen, Jiaxi
    Li, Junmin
    Chen, Weisheng
    Zhang, Shuai
    IEEE SYSTEMS JOURNAL, 2023, 17 (04): : 6357 - 6367
  • [38] Leader-follower output consensus of multiagent systems over finite fields q
    Yu, Miao
    Xia, Jianwei
    Feng, Jun-e
    Fu, Shihua
    Shen, Hao
    NEUROCOMPUTING, 2023, 550
  • [39] Adaptive Leader-Following Consensus for Second-Order Time-Varying Nonlinear Multiagent Systems
    Hua, Changchun
    You, Xiu
    Guan, Xinping
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1532 - 1539
  • [40] Second-Order Consensus for Multiagent Systems via Intermittent Sampled Position Data Control
    Su, Housheng
    Liu, Yifan
    Zeng, Zhigang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 2063 - 2072