Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions

被引：0

作者：

Li, Jinna ^{[1
]}

Yuan, Lin ^{[1
]}

Cheng, Weiran ^{[1
]}

Chai, Tianyou ^{[2
]}

Lewis, Frank L. ^{[3
]}

机构：

[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Liaoning, Peoples R China

[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2024年 / 54卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Synchronization; Protocols; Heuristic algorithms; Decision making; Nash equilibrium; Multi-agent systems; Games; Data-driven control; distributed control; multiagent systems (MASs); reinforcement learning (RL); synchronization;

D O I：

10.1109/TCYB.2024.3440333

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article dedicates to investigating a methodology for enhancing adaptability to environmental changes of reinforcement learning (RL) techniques with data efficiency, by which a joint control protocol is learned using only data for multiagent systems (MASs). Thus, all followers are able to synchronize themselves with the leader and minimize their individual performance. To this end, an optimal synchronization problem of heterogeneous MASs is first formulated, and then an arbitration RL mechanism is developed for well addressing key challenges faced by the current RL techniques, that is, insufficient data and environmental changes. In the developed mechanism, an improved Q-function with an arbitration factor is designed for accommodating the fact that control protocols tend to be made by historic experiences and instinctive decision-making, such that the degree of control over agents' behaviors can be adaptively allocated by on-policy and off-policy RL techniques for the optimal multiagent synchronization problem. Finally, an arbitration RL algorithm with critic-only neural networks is proposed, and theoretical analysis and proofs of synchronization and performance optimality are provided. Simulation results verify the effectiveness of the proposed method.

引用

页码：6545 / 6558

页数：14

共 50 条

[41] Model-Free Reinforcement Learning for Fully Cooperative Consensus Problem of Nonlinear Multiagent Systems
Wang, Hong
Li, Man
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (04) : 1482 - 1491
[42] Beyond Reinforcement Learning and Local View in Multiagent Systems
Bazzan, Ana L. C.
KUNSTLICHE INTELLIGENZ, 2014, 28 (03): : 179 - 189
[43] An Evolutionary Transfer Reinforcement Learning Framework for Multiagent Systems
Hou, Yaqing
Ong, Yew-Soon
Feng, Liang
Zurada, Jacek M.
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (04) : 601 - 615
[44] Prior Knowledge-Augmented Broad Reinforcement Learning Framework for Fault Diagnosis of Heterogeneous Multiagent Systems
Guo, Li
Ren, Yiran
Li, Runze
Jiang, Bin
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 115 - 123
[45] Distributed Multiagent Reinforcement Learning Based on Graph-Induced Local Value Functions
Jing, Gangshan
Bai, He
George, Jemin
Chakrabortty, Aranya
Sharma, Piyush K.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (10) : 6636 - 6651
[46] Synchronization of Heterogeneous Multi-Agent Systems by Adaptive Iterative Learning Control
Yang, Shiping
Xu, Jian-Xin
Huang, Deqing
Tan, Ying
ASIAN JOURNAL OF CONTROL, 2015, 17 (06) : 2091 - 2104
[47] Prescribed Performance Fault-Tolerant Control for Synchronization of Heterogeneous Nonlinear MASs Using Reinforcement Learning
Liu, Donghao
Mao, Zehui
Jiang, Bin
Yan, Xing-Gang
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (09) : 5451 - 5462
[48] Output Synchronization of Heterogeneous Multiagent Systems With Resilience to Link and Actuator Attacks: A Fully Distributed Event-Triggered Mechanism
Yang, Yang
Qi, Chang
Qian, Yue
Li, Yanfei
Deng, Chao
Zhang, Tengfei
Yue, Dong
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (04): : 1695 - 1706
[49] A Proactive Eavesdropping Game in MIMO Systems Based on Multiagent Deep Reinforcement Learning
Guo, Delin
Ding, Hui
Tang, Lan
Zhang, Xinggan
Yang, Lvxi
Liang, Ying-Chang
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (11) : 8889 - 8904
[50] Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization
Zhang, Zhen
Wang, Dongqing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12

← 1 2 3 4 5 →