Equilibria of Fully Decentralized Learning in Networked Systems

被引：0

作者：

Jiang, Yan ^{[1
]}

Cui, Wenqi ^{[1
]}

Zhang, Baosen ^{[1
]}

Cortes, Jorge ^{[2
]}

机构：

[1] Univ Washington, Dept Elect & Comp Engn, Seattle, WA 98195 USA

[2] Univ Calif San Diego, Dept Mech & Aerosp Engn, San Diego, CA 92093 USA

来源：

LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211 | 2023年 / 211卷

关键词：

Decentralized control; multi-agent learning; Nash equilibrium; noncooperative game;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing settings of decentralized learning either require players to have full information or the system to have certain special structure that may be hard to check and hinder their applicability to practical systems. To overcome this, we identify a structure that is simple to check for linear dynamical system, where each player learns in a fully decentralized fashion to minimize its cost. We first establish the existence of pure strategy Nash equilibria in the resulting noncooperative game. We then conjecture that the Nash equilibrium is unique provided that the system satisfies an additional requirement on its structure. We also introduce a decentralized mechanism based on projected gradient descent to have agents learn the Nash equilibrium. Simulations on a 5-player game validate our results.

引用

页数：13

共 35 条

[1] Distributed control of spatially invariant systems [J].

Bamieh, B ;

Paganini, F ;

Dahleh, MA .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (07) :1091-1107

[2]

Basar T., 1999, Dynamic noncooperative game theory

[3] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control [J].

Bazzan, Ana L. C. .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) :342-375

[4] A survey of computational complexity results in systems and control [J].

Blondel, VD ;

Tsitsiklis, JN .

AUTOMATICA, 2000, 36 (09) :1249-1274

[5]

Boyd SP., 2004, Convex Optimization, DOI [10.1017/CBO9780511804441, DOI 10.1017/CBO9780511804441]

[6]

Bu JJ, 2019, Arxiv, DOI arXiv:1907.08921

[7] Decentralized safe reinforcement learning for inverter-based voltage control [J].

Cui, Wenqi ;

Li, Jiayi ;

Zhang, Baosen .

ELECTRIC POWER SYSTEMS RESEARCH, 2022, 211

[8] Equilibrium-Independent Stability Analysis for Distribution Systems With Lossy Transmission Lines [J].

Cui, Wenqi ;

Zhang, Baosen .

IEEE CONTROL SYSTEMS LETTERS, 2022, 6 :3349-3354

[9] On the Sample Complexity of the Linear Quadratic Regulator [J].

Dean, Sarah ;

Mania, Horia ;

Matni, Nikolai ;

Recht, Benjamin ;

Tu, Stephen .

FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2020, 20 (04) :633-679

[10]

Engwerda JC., 2005, LQ dynamic optimization and differential games

← 1 2 3 4 →