On partially controlled multi-agent systems

被引:25
作者
Brafman, RI [1 ]
Tennenholtz, M [1 ]
机构
[1] TECHNION ISRAEL INST TECHNOL,IL-32000 HAIFA,ISRAEL
关键词
D O I
10.1613/jair.318
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by the control theoretic distinction between controllable and uncontrollable events, we distinguish between two types of agents within a multi-agent system: controllable agents, which are directly controlled by the system's designer, and uncontrollable agents, which are not under the designer's direct control. We refer to such systems as partially controlled multi-agent systems, and we investigate how one might influence the behavior of the uncontrolled agents through appropriate design of the controlled agents. In particular, we wish to understand which problems are naturally described in these terms, what methods can be applied to influence the uncontrollable agents, the effectiveness of such methods, and whether similar methods work across different domains. Using a game-theoretic framework, this paper studies the design of partially controlled multi-agent systems in two contexts: in one context, the uncontrollable agents are expected utility maximizers, while in the other they are reinforcement learners. We suggest different techniques for controlling agents' behavior in each domain, assess their success, and examine their relationship.
引用
收藏
页码:477 / 507
页数:31
相关论文
共 32 条
[1]  
ALTENBERG L, 1987, GENETICS, V117, P559
[2]  
[Anonymous], MACHINE LEARNING
[3]  
[Anonymous], 1980, Markov Random Fields and Their Applications
[4]  
Bellman R.E., 1962, DYNAMIC PROGRAMMING
[5]  
Bond A., 1988, READINGS DISTRIBUTED
[6]  
BRIGGS W, 1995, P 14 INT JOINT C ART, P688
[7]  
Dixit AvinashK. Nalebuff., 1991, THINKING STRATEGICAL
[8]   COHERENT COOPERATION AMONG COMMUNICATING PROBLEM SOLVERS [J].
DURFEE, EH ;
LESSER, VR ;
CORKILL, DD .
IEEE TRANSACTIONS ON COMPUTERS, 1987, 36 (11) :1275-1291
[9]   KNOWLEDGE AND COMMON KNOWLEDGE IN A BYZANTINE ENVIRONMENT - CRASH FAILURES [J].
DWORK, C ;
MOSES, Y .
INFORMATION AND COMPUTATION, 1990, 88 (02) :156-186
[10]  
EATWELL J, 1989, NEW PALGRAVE GAME TH