Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning

被引:104
|
作者
Qin, Jiahu [1 ]
Li, Man [1 ]
Shi, Yang [2 ]
Ma, Qichao [1 ]
Zheng, Wei Xing [3 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Univ Victoria, Dept Mech Engn, Victoria, BC V8W 2Y2, Canada
[3] Western Sydney Univ, Sch Comp Engn & Math, Sydney, NSW 2751, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Input saturation; multiagent systems; neural networks (NNs); off-policy reinforcement learning (RL); optimal synchronization control; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NETWORKS; GAMES;
D O I
10.1109/TNNLS.2018.2832025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we aim to investigate the optimal synchronization problem for a group of generic linear systems with input saturation. To seek the optimal controller, Hamilton Jacobi-Bellman (HJB) equations involving nonquadratic input energy terms in coupled forms are established. The solutions to these coupled HJB equations are further proven to be optimal and the induced controllers constitute interactive Nash equilibrium. Due to the difficulty to analytically solve HJB equations, especially in coupled forms, and the possible lack of model information of the systems, we apply the data-based off-policy reinforcement learning algorithm to learn the optimal control policies. A byproduct of this off-policy algorithm is shown that it is insensitive to probing noise that is exerted to the system to maintain persistence of excitation condition. In order to implement this off-policy algorithm, we employ actor and critic neural networks to approximate the controllers and the cost functions. Furthermore, the estimated control policies obtained by this presented implementation are proven to converge to the optimal ones under certain conditions. Finally, an illustrative example is provided to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [41] Distributed Optimal Tracking Control of Discrete-Time Multiagent Systems via Event-Triggered Reinforcement Learning
    Peng, Zhinan
    Luo, Rui
    Hu, Jiangping
    Shi, Kaibo
    Ghosh, Bijoy Kumar
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (09) : 3689 - 3700
  • [42] Traffic Signal Control Using End-to-End Off-Policy Deep Reinforcement Learning
    Chu, Kai-Fung
    Lam, Albert Y. S.
    Li, Victor O. K.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7184 - 7195
  • [43] Adaptive optimal consensus of nonlinear multi-agent systems with unknown dynamics using off-policy integral reinforcement learning
    Yan, Lei
    Liu, Zhi
    Chen, C. L. Philip
    Zhang, Yun
    Wu, Zongze
    NEUROCOMPUTING, 2025, 621
  • [44] A Continuous Off-Policy Reinforcement Learning Scheme for Optimal Motion Planning in Simply-Connected Workspaces
    Rousseas, Panagiotis
    Bechlioulis, Charalampos P.
    Kyriakopoulos, Kostas J.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10247 - 10253
  • [45] Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning
    Xia, Lina
    Li, Qing
    Song, Ruizhuo
    Modares, Hamidreza
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (03) : 520 - 532
  • [46] Distributed Robust Global Containment Control of Second-Order Multiagent Systems With Input Saturation
    Fu, Junjie
    Wan, Ying
    Wen, Guanghui
    Huang, Tingwen
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2019, 6 (04): : 1426 - 1437
  • [47] Adaptive tracking control for multiagent systems with event-triggered communication and asymmetric input saturation
    Yang, Xiaoyu
    Cao, Liang
    Pan, Yingnan
    Zhou, Xiaoshuai
    Lu, Qing
    ASIAN JOURNAL OF CONTROL, 2023, 25 (06) : 4813 - 4824
  • [48] Prescribed performance adaptive event-triggered consensus control for multiagent systems with input saturation
    Yue, Xia
    Liu, Jiarui
    Chen, Kairui
    Zhang, Yuanqing
    Hu, Zikai
    FRONTIERS IN NEUROROBOTICS, 2023, 16
  • [49] Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions
    Li, Jinna
    Yuan, Lin
    Cheng, Weiran
    Chai, Tianyou
    Lewis, Frank L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) : 6545 - 6558
  • [50] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375