Learning Nash in Constrained Markov Games With an α-Potential

被引:1
作者
Das, Soham [1 ]
Eksin, Ceyhun [1 ]
机构
[1] Texas A&M Univ, Ind & Syst Engn Dept, College Stn, TX 77843 USA
来源
IEEE CONTROL SYSTEMS LETTERS | 2024年 / 8卷
关键词
Games; Stochastic processes; Picture archiving and communication systems; Kernel; Finite element analysis; Complexity theory; Vehicle dynamics; Game theory; constrained control; optimization; machine learning;
D O I
10.1109/LCSYS.2024.3402132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop a best-response algorithm forsolving constrained Markov games assuming limited viola-tions for the potential game property. The limited violationsof the potential game property mean that changes invalue function due to unilateral policy alterations can bemeasured by the potential function up to an error alpha.We show the existence of stationary is an element of-approximate con-strained Nash policy whenever the set of feasible stationarypolicies is non-empty. Our setting has agents accessingan efficient probably approximately correct solver for aconstrained Markov decision process which they use forgenerating best-response policies against the other agents'former policies. For an accuracy threshold is an element of>4 alpha, thebest-response dynamics generate provable convergence to is an element of-Nash policy in finite time with probability at least 1-delta atthe expense of polynomial bounds on sample complexitythat scales with the reciprocal of is an element of and delta
引用
收藏
页码:808 / 813
页数:6
相关论文
共 20 条
[1]  
Alatur P, 2023, Arxiv, DOI arXiv:2306.07749
[2]  
Altman E, 2000, ANN INT SOC DYN GAME, V5, P213
[3]  
Bai QB, 2022, AAAI CONF ARTIF INTE, P3682
[4]  
Cayci S, 2024, Arxiv, DOI arXiv:2106.04096
[5]  
Daskalakis Constantinos, 2020, Advances in Neural Information Processing Systems, V33
[6]  
Ding D., 2022, PR MACH LEARN RES
[7]   STATIONARY MARKOV NASH EQUILIBRIA FOR NONZERO-SUM CONSTRAINED ARAT MARKOV GAMES [J].
Dufour, Francois ;
Prieto-Rumeau, Tomas .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2022, 60 (02) :945-967
[8]  
Guo X, 2025, Arxiv, DOI arXiv:2403.16962
[9]  
Guo X, 2025, Arxiv, DOI arXiv:2305.12553
[10]   On Approximate and Weak Correlated Equilibria in Constrained Discounted Stochastic Games [J].
Jaskiewicz, Anna ;
Nowak, Andrzej S. S. .
APPLIED MATHEMATICS AND OPTIMIZATION, 2023, 87 (02)