Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning

被引:104
|
作者
Qin, Jiahu [1 ]
Li, Man [1 ]
Shi, Yang [2 ]
Ma, Qichao [1 ]
Zheng, Wei Xing [3 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Univ Victoria, Dept Mech Engn, Victoria, BC V8W 2Y2, Canada
[3] Western Sydney Univ, Sch Comp Engn & Math, Sydney, NSW 2751, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Input saturation; multiagent systems; neural networks (NNs); off-policy reinforcement learning (RL); optimal synchronization control; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NETWORKS; GAMES;
D O I
10.1109/TNNLS.2018.2832025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we aim to investigate the optimal synchronization problem for a group of generic linear systems with input saturation. To seek the optimal controller, Hamilton Jacobi-Bellman (HJB) equations involving nonquadratic input energy terms in coupled forms are established. The solutions to these coupled HJB equations are further proven to be optimal and the induced controllers constitute interactive Nash equilibrium. Due to the difficulty to analytically solve HJB equations, especially in coupled forms, and the possible lack of model information of the systems, we apply the data-based off-policy reinforcement learning algorithm to learn the optimal control policies. A byproduct of this off-policy algorithm is shown that it is insensitive to probing noise that is exerted to the system to maintain persistence of excitation condition. In order to implement this off-policy algorithm, we employ actor and critic neural networks to approximate the controllers and the cost functions. Furthermore, the estimated control policies obtained by this presented implementation are proven to converge to the optimal ones under certain conditions. Finally, an illustrative example is provided to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [31] Observer-Based Robust Coordinated Control of Multiagent Systems With Input Saturation
    Wang, Xiaoling
    Su, Housheng
    Chen, Michael Z. Q.
    Wang, Xiaofan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (05) : 1933 - 1946
  • [32] Passivity-based state synchronization of homogeneous multiagent systems via static protocol in the presence of input saturation
    Liu, Zhenwei
    Saberi, Ali
    Stoorvogel, Anton A.
    Zhang, Meirong
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 28 (07) : 2720 - 2741
  • [33] H∞ Tracking learning control for discrete-time Markov jump systems: A parallel off-policy reinforcement learning
    Zhang, Xuewen
    Xia, Jianwei
    Wang, Jing
    Chen, Xiangyong
    Shen, Hao
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (18): : 14878 - 14890
  • [34] Observer-Based Event-Triggered Optimal Control for Nonlinear Multiagent Systems With Input Delay via Reinforcement Learning Strategy
    Wang, Xin
    Liao, Yujie
    Tan, Lihua
    Zhang, Wei
    Li, Huaqing
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [35] Safe Off-Policy Deep Reinforcement Learning Algorithm for Volt-VAR Control in Power Distribution Systems
    Wang, Wei
    Yu, Nanpeng
    Gao, Yuanqi
    Shi, Jie
    IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (04) : 3008 - 3018
  • [36] Off-policy integral reinforcement learning-based optimal tracking control for a class of nonzero-sum game systems with unknown dynamics
    Zhao, Jin-Gang
    Chen, Fang-Fang
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (06) : 1623 - 1644
  • [37] Reinforcement Learning-Based Event-Triggered Optimal Control of Power Systems With Control Input Saturation
    Gu, Zhou
    Cao, Ruiyan
    Tian, Engang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (02) : 1528 - 1536
  • [38] Flexible prescribed performance control for multiagent systems under DoS attacks and input saturation
    Peng, Wenjun
    Liu, Zhi
    Chen, C. L. Philip
    Wu, Zongze
    JOURNAL OF THE FRANKLIN INSTITUTE, 2025, 362 (05)
  • [39] Fuzzy reinforcement learning based control of linear systems with input saturation
    Liu, Kainan
    Ban, Xiaojun
    Xie, Shengkun
    ISA TRANSACTIONS, 2025, 158 : 405 - 414
  • [40] Self-triggered control of heterogeneous multiagent systems with input saturation
    Du, Shengli
    Wu, Di
    Gao, Yongfeng
    Li, Xu
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6812 - 6817