Adaptive optimal consensus of nonlinear multi-agent systems with unknown dynamics using off-policy integral reinforcement learning

被引:0
作者
Yan, Lei [1 ]
Liu, Zhi [2 ,4 ]
Chen, C. L. Philip [3 ]
Zhang, Yun [2 ]
Wu, Zongze [2 ]
机构
[1] Nanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
[2] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China
[3] South China Univ Technol, Fac Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[4] Pazhou Lab, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive optimal consensus; Integral reinforcement learning; Off-policy; Completely unknown dynamics; TRACKING CONTROL; GAMES;
D O I
10.1016/j.neucom.2024.129185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has been identified as a promising approach for developing adaptive optimal consensus schemes for high-order strict-feedback nonlinear multi-agent systems (MASs). However, existing methods have limitations, as they can only be applied to systems with partially unknown dynamics and require an identifier-actor-critic framework. This paper proposes a novel approach that combines classical backstepping techniques and off-policy integral reinforcement learning (IRL) to circumvent these limitations and develop an adaptive optimal consensus scheme for nonlinear MASs with completely unknown dynamics. Specifically, we introduce an off-policy IRL-based adaptive optimal consensus scheme that can obtain optimal control inputs without knowledge of the system dynamics. The algorithm utilizes the actor-critic structure and updates the weight vectors using only one learning rule in each step based on the collected system trajectory data. We have proven that the optimal consensus is achieved, and the estimation errors of the optimal weight vectors are uniformly ultimately bounded (UUB). Finally, we present a simulation example to validate the effectiveness of the proposed approach.
引用
收藏
页数:10
相关论文
共 45 条
  • [11] Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics
    Li, Hongliang
    Liu, Derong
    Wang, Ding
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (03) : 706 - 714
  • [12] Fuzzy Adaptive Optimal Consensus Fault-Tolerant Control for Stochastic Nonlinear Multiagent Systems
    Li, Kewen
    Li, Yongming
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (08) : 2870 - 2885
  • [13] An Observer-Based Fuzzy Adaptive Consensus Control Method for Nonlinear Multiagent Systems
    Li, Yongming
    Li, Kewen
    Tong, Shaocheng
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (11) : 4667 - 4678
  • [14] Fuzzy Adaptive Optimized Leader-Following Formation Control for Second-Order Stochastic Multiagent Systems
    Li, Yongming
    Zhang, Jiaxin
    Tong, Shaocheng
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6026 - 6037
  • [15] Event-Triggered Guarantee Cost Control for Partially Unknown Stochastic Systems via Explorized Integral Reinforcement Learning Strategy
    Liang, Yuling
    Zhang, Huaguang
    Zhang, Juan
    Ming, Zhongyang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7830 - 7844
  • [16] Adaptive Neural Design of Consensus Controllers for Nonlinear Multiagent Systems Under Switching Topologies
    Lu, Kaixin
    Liu, Zhi
    Wang, Yaonan
    Chen, C. L. Philip
    Zhang, Yun
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (01): : 309 - 320
  • [17] Adaptive Consensus Tracking Control of Uncertain Nonlinear Multiagent Systems With Predefined Accuracy
    Lu, Kaixin
    Liu, Zhi
    Lai, Guanyu
    Chen, C. L. Philip
    Zhang, Yun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 405 - 415
  • [18] Off-Policy Reinforcement Learning for H∞ Control Design
    Luo, Biao
    Wu, Huai-Ning
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (01) : 65 - 76
  • [19] Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
    Modares, Hamidreza
    Lewis, Frank L.
    [J]. AUTOMATICA, 2014, 50 (07) : 1780 - 1792
  • [20] Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
    Modares, Hamidreza
    Lewis, Frank L.
    Naghibi-Sistani, Mohammad-Bagher
    [J]. AUTOMATICA, 2014, 50 (01) : 193 - 202