Coordinating Multiple Agents via Reinforcement Learning

被引:0
作者
Gang Chen
Zhonghua Yang
Hao He
Kiah Mok Goh
机构
[1] Nanyang Technological University,Information Communication Institute of Singapore, School of Electrical and Electronic Engineering
[2] Singapore Institute of Manufacturing Technology,undefined
来源
Autonomous Agents and Multi-Agent Systems | 2005年 / 10卷
关键词
multiagent system; coordination; reinforcement learning; task-oriented Environment;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we attempt to use reinforcement learning techniques to solve agent coordination problems in task-oriented environments. The Fuzzy Subjective Task Structure model (FSTS) is presented to model the general agent coordination. We show that an agent coordination problem modeled in FSTS is a Decision-Theoretic Planning (DTP) problem, to which reinforcement learning can be applied. Two learning algorithms, ‘‘coarse-grained’’ and ‘‘fine-grained’’, are proposed to address agents coordination behavior at two different levels. The ‘‘coarse-grained’’ algorithm operates at one level and tackle hard system constraints, and the ‘‘fine-grained’’ at another level and for soft constraints. We argue that it is important to explicitly model and explore coordination-specific (particularly system constraints) information, which underpins the two algorithms and attributes to the effectiveness of the algorithms. The algorithms are formally proved to converge and experimentally shown to be effective.
引用
收藏
页码:273 / 328
页数:55
相关论文
共 32 条
[1]  
Abul O.(2000)‘‘Multiagent reinforcement learning using function approximation’‘ IEEE Trans. Syst., Man, Cyber 30 485-497
[2]  
Polat F.(1999)‘‘Decision-Theoretic Planning: Structural Assumptions and Computational Leverage’‘ J. Artif. Intelligence Res. Vol. 11 1-94
[3]  
Alhajj R.(2000)‘‘Stochastic dynamic programming with factored representations’‘ Artif. Intell. Vol. 121 49-107
[4]  
Boutilier C.(1998)‘‘Elevator group control using multiple reinforcement learning agents’‘ Machine Learning 33 235-262
[5]  
Dean T.(1997)‘‘Abstraction and approximate decision-theoretic planning’‘ Artif. Intell. 89 219-283
[6]  
Hanks S.(1989)‘‘Negotiation task decomposition and allocation using partial global planning’‘ Distributed Artificial Intelligence vol. 2 229-243
[7]  
Boutilier C.(2003)‘‘Equivalence notions and model minimization in Markov decision processes’‘ Artif. Intell. vol. 147 163-223
[8]  
Dearden R.(1996)‘‘Reinforcement Learning A survey’‘ J. Artif. Intell. Res. vol. 4 237-285
[9]  
Goldszmidt M.(1951)‘‘A stochastic approximation method’‘ Ann. Math. Stat. vol. 22 400-407
[10]  
Crites R. H.(1989)‘‘Constraint-directed negotiation of resource reallocations’‘ Distributed Artificial Intelligence vol. 2 163-193