Cooperative Control and Potential Games

被引：375

作者：

Marden, Jason R. ^{[1
]}

Arslan, Guerdal ^{[2
]}

Shamma, Jeff S. ^{[3
]}

机构：

[1] CALTECH, Social & Informat Sci Lab, Pasadena, CA 91125 USA

[2] Univ Hawaii, Dept Elect Engn, Honolulu, HI 96822 USA

[3] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2009年 / 39卷 / 06期

基金：

美国国家科学基金会;

关键词：

Cooperative control; game theory; learning in games; multi-agent systems; FICTITIOUS PLAY; CONSENSUS; AGENTS;

D O I：

10.1109/TSMCB.2009.2017273

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a view of cooperative control using the language of learning in games. We review the game-theoretic concepts of potential and weakly acyclic games, and demonstrate how several cooperative control problems, such as consensus and dynamic sensor coverage, can be formulated in these settings. Motivated by this connection, we build upon game-theoretic concepts to better accommodate a broader class of cooperative control problems. In particular, we extend existing learning algorithms to accommodate restricted action sets caused by the limitations of agent capabilities and group-based decision making. Furthermore, we also introduce a new class of games called sometimes weakly acyclic games for time-varying objective functions and action sets, and provide distributed algorithms for convergence to an equilibrium.

引用

页码：1393 / 1407

页数：15

共 29 条

[21]

Shamma J., 2008, Cooperative Control of Distributed Multi-Agent Systems

[22] STOCHASTIC GAMES [J].

SHAPLEY, LS .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1953, 39 (10) :1095-1100

[23]

Shoham Y, 2009, MULTIAGENT SYSTEMS: ALGORITHMIC, GAME-THEORETIC, AND LOGICAL FOUNDATIONS, P1

[24] DISTRIBUTED ASYNCHRONOUS DETERMINISTIC AND STOCHASTIC GRADIENT OPTIMIZATION ALGORITHMS [J].

TSITSIKLIS, JN ;

BERTSEKAS, DP ;

ATHANS, M .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1986, 31 (09) :803-812

[25]

Wolpert D.H., 1999, Handbook of Agent technology

[26]

Xiao L, 2005, 2005 FOURTH INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING IN SENSOR NETWORKS, P63

[27] Fast linear iterations for distributed averaging [J].

Xiao, L ;

Boyd, S .

SYSTEMS & CONTROL LETTERS, 2004, 53 (01) :65-78

[28]

Young H.P., 2005, Strategic Learning and Its Limits

[29]

Young H.P., 1998, Individual Strategy and Social Structure: An Evolutionary Theory of Institutions

← 1 2 3 →