Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions

被引:0
|
作者
Li, Jinna [1 ]
Yuan, Lin [1 ]
Cheng, Weiran [1 ]
Chai, Tianyou [2 ]
Lewis, Frank L. [3 ]
机构
[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Liaoning, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA
基金
中国国家自然科学基金;
关键词
Synchronization; Protocols; Heuristic algorithms; Decision making; Nash equilibrium; Multi-agent systems; Games; Data-driven control; distributed control; multiagent systems (MASs); reinforcement learning (RL); synchronization;
D O I
10.1109/TCYB.2024.3440333
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article dedicates to investigating a methodology for enhancing adaptability to environmental changes of reinforcement learning (RL) techniques with data efficiency, by which a joint control protocol is learned using only data for multiagent systems (MASs). Thus, all followers are able to synchronize themselves with the leader and minimize their individual performance. To this end, an optimal synchronization problem of heterogeneous MASs is first formulated, and then an arbitration RL mechanism is developed for well addressing key challenges faced by the current RL techniques, that is, insufficient data and environmental changes. In the developed mechanism, an improved Q-function with an arbitration factor is designed for accommodating the fact that control protocols tend to be made by historic experiences and instinctive decision-making, such that the degree of control over agents' behaviors can be adaptively allocated by on-policy and off-policy RL techniques for the optimal multiagent synchronization problem. Finally, an arbitration RL algorithm with critic-only neural networks is proposed, and theoretical analysis and proofs of synchronization and performance optimality are provided. Simulation results verify the effectiveness of the proposed method.
引用
收藏
页码:6545 / 6558
页数:14
相关论文
共 50 条
  • [21] Safe reinforcement learning for cooperative tracking consensus problem of discrete-time multiagent systems with control barrier functions
    Liu, Shihan
    Yu, Zhen
    Gao, Dongxu
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025,
  • [22] Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems
    Sun, Changyin
    Liu, Wenzhang
    Dong, Lu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2054 - 2065
  • [23] General Second-Order Consensus of Discrete-Time Multiagent Systems via Q-Learning Method
    Liu, Yifan
    Su, Housheng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (03): : 1417 - 1425
  • [24] Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network
    Du, Wei
    Ding, Shifei
    Zhang, Chenglong
    Shi, Zhongzhi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6851 - 6860
  • [25] Learning-Based Event-Triggered Control for Synchronization of Passive Multiagent Systems Under Attack
    Rahnama, Arash
    Antsaklis, Panos J.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (10) : 4170 - 4185
  • [26] Dynamic Event-Triggered Model-Free Reinforcement Learning for Cooperative Control of Multiagent Systems
    Wang, Ke
    Tang, Zhuo
    Mu, Chaoxu
    IEEE TRANSACTIONS ON RELIABILITY, 2024,
  • [27] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
  • [28] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [29] SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning
    Yao, Xinghu
    Wen, Chao
    Wang, Yuhui
    Tan, Xiaoyang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 52 - 63
  • [30] Scheduling in Multiagent Systems Using Reinforcement Learning
    Minashina, I. K.
    Gorbachev, R. A.
    Zakharova, E. M.
    DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S70 - S78