Multi-Agent Differential Graphical Games: Nash Online Adaptive Learning Solutions

被引:0
作者
Abouheaf, Mohammed I. [1 ]
Lewis, Frank L. [1 ]
机构
[1] Univ Texas Arlington, Res Inst, Arlington, TX 76019 USA
来源
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2013年
关键词
Critic network structures; graphical games; integral reinforcement learning; optimal control; COOPERATIVE CONTROL; CONSENSUS; SYNCHRONIZATION; NETWORKS; SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies a class of multi-agent graphical games denoted by differential graphical games, where interactions between agents are prescribed by a communication graph structure. Ideas from cooperative control are given to achieve synchronization among the agents to a leader dynamics. New coupled Bellman and Hamilton-Jacobi-Bellman equations are developed for this class of games using Integral Reinforcement Learning. Nash solutions are given in terms of solutions to a set of coupled continuous-time Hamilton-Jacobi-Bellman equations. A multi-agent policy iteration algorithm is given to learn the Nash solution in real time without knowing the complete dynamic models of the agents. A proof of convergence for this algorithm is given. An online multi-agent method based on policy iterations is developed using a critic network to solve all the Hamilton-Jacobi-Bellman equations simultaneously for the graphical game.
引用
收藏
页码:5803 / 5809
页数:7
相关论文
共 50 条
[31]   Neuro-adaptive control for searching generalized Nash equilibrium of multi-agent games: A two-stage design approach [J].
Meng, Qing ;
Nian, Xiaohong ;
Chen, Yong ;
Chen, Zhao .
NEUROCOMPUTING, 2023, 530 :69-80
[32]   Graphical Minimax Game and On-Policy Reinforcement Learning for Consensus of Leaderless Multi-Agent Systems [J].
Dong, Wei ;
Wang, Chunyan ;
Li, Jinna ;
Wang, Jianan .
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, :606-611
[33]   Nash Equilibrium Controllability of Linear-Quadratic Differential Graphical Games [J].
Wang, Zili ;
Yang, Hao ;
Jiang, Bin .
2024 43RD CHINESE CONTROL CONFERENCE, CCC 2024, 2024, :105-112
[34]   Nash Equilibria in Multi-Agent Motor Interactions [J].
Braun, Daniel A. ;
Ortega, Pedro A. ;
Wolpert, Daniel M. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (08)
[35]   Distributed Nash equilibrium searching for multi-agent games under false data injection attacks [J].
Lv, Yixuan ;
Liu, Yan-Jun ;
Liu, Lei ;
Yu, Dengxiu ;
Chen, Yang .
NEUROCOMPUTING, 2024, 570
[36]   Online Learning Cooperative Control for Heterogeneous Multi-Agent Systems [J].
Zhu, Xiaoxia ;
Dong, Lu .
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, :3500-3505
[37]   Asynchronous Online Learning in Multi-Agent Systems With Proximity Constraints [J].
Bedi, Amrit Singh ;
Koppel, Alec ;
Rajawat, Ketan .
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2019, 5 (03) :479-494
[38]   Conservative Multi-agent Online Kernel Learning in Heterogeneous Networks [J].
Pradhan, Hrusikesha ;
Bedi, Amrit Singh ;
Koppel, Alec ;
Rajawat, Ketan .
2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, :53-57
[39]   Multi-Agent Flag Coordination Games [J].
Marzagao, David Kohan ;
Rivera, Nicolas ;
Cooper, Colin ;
McBurney, Peter ;
Steinhofel, Kathleen .
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, :1442-1450
[40]   Distributed adaptive Nash equilibrium seeking over multi-agent networks with communication uncertainties [J].
Fang, Xiao ;
Wen, Guanghui ;
Zhou, Jialing ;
Zheng, Wei Xing .
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, :3387-3392