Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning

被引：104

作者：

Qin, Jiahu ^{[1
]}

Li, Man ^{[1
]}

Shi, Yang ^{[2
]}

Ma, Qichao ^{[1
]}

Zheng, Wei Xing ^{[3
]}

机构：

[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China

[2] Univ Victoria, Dept Mech Engn, Victoria, BC V8W 2Y2, Canada

[3] Western Sydney Univ, Sch Comp Engn & Math, Sydney, NSW 2751, Australia

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2019年 / 30卷 / 01期

基金：

澳大利亚研究理事会; 中国国家自然科学基金;

关键词：

Input saturation; multiagent systems; neural networks (NNs); off-policy reinforcement learning (RL); optimal synchronization control; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NETWORKS; GAMES;

D O I：

10.1109/TNNLS.2018.2832025

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we aim to investigate the optimal synchronization problem for a group of generic linear systems with input saturation. To seek the optimal controller, Hamilton Jacobi-Bellman (HJB) equations involving nonquadratic input energy terms in coupled forms are established. The solutions to these coupled HJB equations are further proven to be optimal and the induced controllers constitute interactive Nash equilibrium. Due to the difficulty to analytically solve HJB equations, especially in coupled forms, and the possible lack of model information of the systems, we apply the data-based off-policy reinforcement learning algorithm to learn the optimal control policies. A byproduct of this off-policy algorithm is shown that it is insensitive to probing noise that is exerted to the system to maintain persistence of excitation condition. In order to implement this off-policy algorithm, we employ actor and critic neural networks to approximate the controllers and the cost functions. Furthermore, the estimated control policies obtained by this presented implementation are proven to converge to the optimal ones under certain conditions. Finally, an illustrative example is provided to verify the effectiveness of the proposed algorithm.

引用

页码：85 / 96

页数：12

共 50 条

[41] Distributed Optimal Tracking Control of Discrete-Time Multiagent Systems via Event-Triggered Reinforcement Learning
Peng, Zhinan
Luo, Rui
Hu, Jiangping
Shi, Kaibo
Ghosh, Bijoy Kumar
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (09) : 3689 - 3700
[42] Traffic Signal Control Using End-to-End Off-Policy Deep Reinforcement Learning
Chu, Kai-Fung
Lam, Albert Y. S.
Li, Victor O. K.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7184 - 7195
[43] Adaptive optimal consensus of nonlinear multi-agent systems with unknown dynamics using off-policy integral reinforcement learning
Yan, Lei
Liu, Zhi
Chen, C. L. Philip
Zhang, Yun
Wu, Zongze
NEUROCOMPUTING, 2025, 621
[44] A Continuous Off-Policy Reinforcement Learning Scheme for Optimal Motion Planning in Simply-Connected Workspaces
Rousseas, Panagiotis
Bechlioulis, Charalampos P.
Kyriakopoulos, Kostas J.
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10247 - 10253
[45] Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning
Xia, Lina
Li, Qing
Song, Ruizhuo
Modares, Hamidreza
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (03) : 520 - 532
[46] Distributed Robust Global Containment Control of Second-Order Multiagent Systems With Input Saturation
Fu, Junjie
Wan, Ying
Wen, Guanghui
Huang, Tingwen
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2019, 6 (04): : 1426 - 1437
[47] Adaptive tracking control for multiagent systems with event-triggered communication and asymmetric input saturation
Yang, Xiaoyu
Cao, Liang
Pan, Yingnan
Zhou, Xiaoshuai
Lu, Qing
ASIAN JOURNAL OF CONTROL, 2023, 25 (06) : 4813 - 4824
[48] Prescribed performance adaptive event-triggered consensus control for multiagent systems with input saturation
Yue, Xia
Liu, Jiarui
Chen, Kairui
Zhang, Yuanqing
Hu, Zikai
FRONTIERS IN NEUROROBOTICS, 2023, 16
[49] Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions
Li, Jinna
Yuan, Lin
Cheng, Weiran
Chai, Tianyou
Lewis, Frank L.
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) : 6545 - 6558
[50] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
Ana L. C. Bazzan
Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375

← 1 2 3 4 5 →