Learning Task Allocation for Multiple Flows in Multi-agent Systems

被引:6
作者
Xiao, Zheng [1 ]
Ma, Shengxiang [1 ]
Zhang, Shiyong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS | 2009年
关键词
Agent cooperation; Task allocation; Multi-agent system; Multiple task flows; Q-learning;
D O I
10.1109/ICCSN.2009.28
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Task allocation is a key problem for agent to reach cooperation in multi-agent systems. Lately task flows are replacing traditional static tasks, thus real-time dynamic task allocation mechanisms draw more attention. Though scheduling single task flow is well investigated, little work on allocation of multiple task flows has been done. In this paper a distributed and self-adaptable scheduling algorithm based on Q-learning for multiple task flows is proposed. This algorithm can not only adapt to task arrival process on itself, but also fully consider the influence from task flows on other agents. Besides, its distributed property guaranteed that it can be applied to open multi-agent systems with local view. Reinforcement learning makes allocation adapt to task load and node distribution. It is verified that this algorithm improves task throughput, and decreases average execution time per task.
引用
收藏
页码:153 / 157
页数:5
相关论文
共 9 条
[1]  
Ali H. H., 1993, Journal of Combinatorial Mathematics and Combinatorial Computing, V14, P15
[2]   SCHEDULING PARALLEL PROGRAM TASKS ONTO ARBITRARY TARGET MACHINES [J].
ELREWINI, H ;
LEWIS, TG .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1990, 9 (02) :138-153
[3]  
Hanna H., 2002, Proceedings of the First International Joint Conference on Autonomous Agents and Multiagent Systems, P1303
[4]  
Schneider J, 1999, MACHINE LEARNING, PROCEEDINGS, P371
[5]  
SHERIEF A, 2005, P WORKSH PLANN LEARN, P76
[6]  
SHERIEF A, 2005, P 4 INT JOINT C AUT, P719
[7]  
SHERIEF A, 2006, P 5 INT JOINT C AUT, P850
[8]  
Shoham Y., 2003, Multi-agent reinforcement learning: a critical survey
[9]  
Zhong Yu, 2003, Control Theory & Applications, V20, P317