Two-Player Multiagent Graphical Games with Reinforcement Learning

被引:0
作者
Lian, Bosen [1 ]
Wu, Jiacheng [2 ]
机构
[1] Auburn Univ, Elect & Comp Engn Dept, Auburn, AL 36849 USA
[2] Zhejiang Univ, State Key Lab Ind Control Technol, Hangzhou 310007, Peoples R China
来源
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS 2024 | 2024年
关键词
multiagent systems; reinforcement learning; Nash; Minmax; OPTIMAL CONSENSUS; SYSTEMS;
D O I
10.1109/ICPS59941.2024.10640049
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper studies the synchronization problem of two-player multiagent systems through reinforcement learning methods. A Nash-minmax strategy is formulated, where the interactions of two players in the same agent are non-zerosum, while interactions of players between agents are zero-sum games. We propose an offline model-based reinforcement learning algorithm to identify Nash solutions for players within each agent, as well as the worst control solutions for players in neighboring antagonistic agents. On this basis, a data-driven off-policy algorithm is provided to alleviate the requirement for accurate system dynamics in the offline algorithm. Besides, the convergence of the proposed algorithms is analyzed. Finally, simulation results verify the effectiveness of the designed algorithms.
引用
收藏
页数:6
相关论文
共 21 条
[1]   Output Feedback Practical Coordinated Tracking of Uncertain Heterogeneous Multi-Agent Systems Under Switching Network Topology [J].
Back, Juhoon ;
Kim, Jung-Su .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (12) :6399-6406
[2]   Invariance of the n dimensional field [J].
Brouwer, LEJ .
MATHEMATISCHE ANNALEN, 1912, 72 :55-56
[3]   Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems [J].
Chen, Ci ;
Lewis, Frank L. ;
Xie, Kan ;
Xie, Shengli ;
Liu, Yilu .
AUTOMATICA, 2020, 119
[4]  
Cheney Ward, 2013, ANAL APPL MATH, V208
[5]   Multiagent Graphical Games With Inverse Reinforcement Learning [J].
Donge, Vrushabh S. ;
Lian, Bosen ;
Lewis, Frank L. ;
Davoudi, Ali .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (02) :841-852
[6]   Leader-to-Formation Stability of Multiagent Systems: An Adaptive Optimal Control Approach [J].
Gao, Weinan ;
Jiang, Zhong-Ping ;
Lewis, Frank L. ;
Wang, Yebin .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (10) :3581-3587
[7]   Coordination of groups of mobile autonomous agents using nearest neighbor rules [J].
Jadbabaie, A ;
Lin, J ;
Morse, AS .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (06) :988-1001
[8]   Reinforcement learning and cooperative H? output regulation of linear continuous-time multi-agent systems [J].
Jiang, Yi ;
Gao, Weinan ;
Wu, Jin ;
Chai, Tianyou ;
Lewis, Frank L. .
AUTOMATICA, 2023, 148
[9]   Event-Based Finite-Time Control for Nonlinear Multiagent Systems With Asymptotic Tracking [J].
Li, Yongming ;
Li, Yuan-Xin ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (06) :3790-3797
[10]   Online Learning of Minmax Solutions for Distributed Estimation and Tracking Control of Sensor Networks in Graphical Games [J].
Lian, Bosen ;
Lewis, Frank L. ;
Hewer, Gary A. ;
Estabridis, Katia ;
Chai, Tianyou .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (04) :1923-1936