Sharing in Teams of Heterogeneous, Collaborative Learning Agents

被引:12
作者
Gifford, Christopher M. [1 ]
Agah, Arvin [1 ]
机构
[1] Univ Kansas, CReSIS, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
D O I
10.1002/int.20331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is focused on the effects of sharing knowledge and collaboration of multiple heterogenous, intelligent agents (hardware or software) which work together to learn a task. As each agent employ's a different machine learning technique, the system consists of multiple knowledge sources and their respective heterogeneous knowledge representations. Collaboration between agents involves sharing knowledge to both speed up team learning, as well as refine the team's overall performance and group behavior. Experiments have been performed that vary the learn composition in terms of machine learning algorithms. learning strategies employed by the agents, and sharing frequency for a predator-prey cooperative pursuit task. For lifelong learning heterogenous learning teams were more successful than homogenous learning counterparts. Interestingly, sharing increased the learning rate, but sharing with higher frequency showed diminishing results. Lastly. knowledge conflicts are reduced over time the more sharing takes place. These results support further investigation of the merits of heterogenous learning. (C) 2008 Wiley Periodicals. Inc.
引用
收藏
页码:173 / 200
页数:28
相关论文
共 39 条
  • [1] Cooperative Q-learning: the knowledge sharing issue
    Ahmadabadi, MN
    Asadpour, M
    Nakano, E
    [J]. ADVANCED ROBOTICS, 2001, 15 (08) : 815 - 832
  • [2] AHMADABADI MN, 2000, P IEEE RSJ INT C INT, V3, P2261
  • [3] Alpaydin E., 1998, Proceedings of Engineering of Intelligent Systems, V2, P6
  • [4] [Anonymous], 11 INT WORKSH DISTR
  • [5] [Anonymous], 1997, Multi-agent reinforcement learning: independent vs. cooperative agents
  • [6] [Anonymous], 1989, THESIS CAMBRIDGE U
  • [7] Arkin RC, 2003, MULTI-ROBOT SYSTEMS: FROM SWARMS TO INTELLIGENT AUTOMATA, VOL II, P137
  • [8] ARKIN RC, 1998, MULTISTRATEGY LEARNI, P331
  • [9] BALCH T, 1998, THESIS COLL COMPUTIN
  • [10] BALCH T, 1998, P AAAI SPRING S STAN