Coordinating actions in congestion games: impact of top-down and bottom-up utilities

被引：4

作者：

Tumer, Kagan ^{[1
]}

Proper, Scott ^{[1
]}

机构：

[1] Oregon State Univ, Corvallis, OR 97331 USA

来源：

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS | 2013年 / 27卷 / 03期

基金：

美国国家科学基金会;

关键词：

Multiagent; Reinforcement learning; Coordination; Congestion games; MINORITY GAME; MULTIAGENT; MODEL;

D O I：

10.1007/s10458-012-9211-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Congestion games offer a perfect environment in which to study the impact of local decisions on global utilities in multiagent systems. What is particularly interesting in such problems is that no individual action is intrinsically "good" or "bad" but that combinations of actions lead to desirable or undesirable outcomes. As a consequence, agents need to learn how to coordinate their actions with those of other agents, rather than learn a particular set of "good" actions. A congestion game can be studied from two different perspectives: (i) from the top down, where a global utility (e.g., a system-centric view of congestion) specifies the task to be achieved; or (ii) from the bottom up, where each agent has its own intrinsic utility it wants to maximize. In many cases, these two approaches are at odds with one another, where agents aiming to maximize their intrinsic utilities lead to poor values of a system level utility. In this paper we extend results on difference utilities, a form of shaped utility that enables multiagent learning in congested, noisy conditions, to study the global behavior that arises from the agents' choices in two types of congestion games. Our key result is that agents that aim to maximize a modified version of their own intrinsic utilities not only perform well in terms of the global utility, but also, on average perform better with respect to their own original utilities. In addition, we show that difference utilities are robust to agents "defecting" and using their own intrinsic utilities, and that performance degrades gracefully with the number of defectors.

引用

页码：419 / 443

页数：25

共 53 条

[21]

Dresner K., 2004, P 3 INT JOINT C AUT, V2, P530

[22] INCENTIVES IN TEAMS [J].

GROVES, T .

ECONOMETRICA, 1973, 41 (04) :617-631

[23]

Hall S., 2004, 3 WORKSH AG TRAFF TR

[24] TRAGEDY OF COMMONS [J].

HARDIN, G .

SCIENCE, 1968, 162 (3859) :1243-+

[25] Traffic and related self-driven many-particle systems [J].

Helbing, D .

REVIEWS OF MODERN PHYSICS, 2001, 73 (04) :1067-1141

[26] Structure and instability of high-density equations for traffic flow [J].

Helbing, D .

PHYSICAL REVIEW E, 1998, 57 (05) :6176-6179

[27] Generalized force model of traffic dynamics [J].

Helbing, D ;

Tilch, B .

PHYSICAL REVIEW E, 1998, 58 (01) :133-138

[28]

HUBERMAN BA, 1988, ECOLOGY COMPUTATION, P77

[29]

Ieong S., 2005, AAAI, V5, P489

[30] Deterministic dynamics in the minority game [J].

Jefferies, P ;

Hart, ML ;

Johnson, NF .

PHYSICAL REVIEW E, 2002, 65 (01) :1-016105

← 1 2 3 4 5 6 →