Learning Nash in Constrained Markov Games With an α-Potential

被引：1

作者：

Das, Soham ^{[1
]}

Eksin, Ceyhun ^{[1
]}

机构：

[1] Texas A&M Univ, Ind & Syst Engn Dept, College Stn, TX 77843 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2024年 / 8卷

关键词：

Games; Stochastic processes; Picture archiving and communication systems; Kernel; Finite element analysis; Complexity theory; Vehicle dynamics; Game theory; constrained control; optimization; machine learning;

D O I：

10.1109/LCSYS.2024.3402132

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We develop a best-response algorithm forsolving constrained Markov games assuming limited viola-tions for the potential game property. The limited violationsof the potential game property mean that changes invalue function due to unilateral policy alterations can bemeasured by the potential function up to an error alpha.We show the existence of stationary is an element of-approximate con-strained Nash policy whenever the set of feasible stationarypolicies is non-empty. Our setting has agents accessingan efficient probably approximately correct solver for aconstrained Markov decision process which they use forgenerating best-response policies against the other agents'former policies. For an accuracy threshold is an element of>4 alpha, thebest-response dynamics generate provable convergence to is an element of-Nash policy in finite time with probability at least 1-delta atthe expense of polynomial bounds on sample complexitythat scales with the reciprocal of is an element of and delta

引用

页码：808 / 813

页数：6

共 20 条

[1]

Alatur P, 2023, Arxiv, DOI arXiv:2306.07749

[2]

Altman E, 2000, ANN INT SOC DYN GAME, V5, P213

[3]

Bai QB, 2022, AAAI CONF ARTIF INTE, P3682

[4]

Cayci S, 2024, Arxiv, DOI arXiv:2106.04096

[5]

Daskalakis Constantinos, 2020, Advances in Neural Information Processing Systems, V33

[6]

Ding D., 2022, PR MACH LEARN RES

[7] STATIONARY MARKOV NASH EQUILIBRIA FOR NONZERO-SUM CONSTRAINED ARAT MARKOV GAMES [J].

Dufour, Francois ;

Prieto-Rumeau, Tomas .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2022, 60 (02) :945-967

[8]

Guo X, 2025, Arxiv, DOI arXiv:2403.16962

[9]

Guo X, 2025, Arxiv, DOI arXiv:2305.12553

[10] On Approximate and Weak Correlated Equilibria in Constrained Discounted Stochastic Games [J].

Jaskiewicz, Anna ;

Nowak, Andrzej S. S. .

APPLIED MATHEMATICS AND OPTIMIZATION, 2023, 87 (02)

← 1 2 →