Agents teaching agents: a survey on inter-agent transfer learning

被引:48
作者
Da Silva, Felipe Leno [1 ,3 ]
Warnell, Garrett [2 ]
Costa, Anna Helena Reali [1 ]
Stone, Peter [3 ]
机构
[1] Univ Sao Paulo, Sao Paulo, Brazil
[2] US Army, Res Lab, Austin, TX USA
[3] Univ Texas Austin, Austin, TX 78712 USA
基金
巴西圣保罗研究基金会;
关键词
Multiagent learning; Transfer learning; Reinforcement learning; REINFORCEMENT; DOMAINS;
D O I
10.1007/s10458-019-09430-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While recent work in reinforcement learning (RL) has led to agents capable of solving increasingly complex tasks, the issue of high sample complexity is still a major concern. This issue has motivated the development of additional techniques that augment RL methods in an attempt to increase task learning speed. In particular, inter-agent teaching-endowing agents with the ability to respond to instructions from others-has been responsible for many of these developments. RL agents that can leverage instruction from a more competent teacher have been shown to be able to learn tasks significantly faster than agents that cannot take advantage of such instruction. That said, the inter-agent teaching paradigm presents many new challenges due to, among other factors, differences between the agents involved in the teaching interaction. As a result, many inter-agent teaching methods work only in restricted settings and have proven difficult to generalize to new domains or scenarios. In this article, we propose two frameworks that provide a comprehensive view of the challenges associated with inter-agent teaching. We highlight state-of-the-art solutions, open problems, prospective applications, and argue that new research in this area should be developed in the context of the proposed frameworks.
引用
收藏
页数:17
相关论文
共 70 条
[1]  
Amir O., 2016, P 25 INT JOINT C ART, P804
[2]  
[Anonymous], 2018, IEEE INT S SAF SEC R
[3]  
[Anonymous], 2000, ANAL STOCHASTIC GAME
[4]  
[Anonymous], THESIS
[5]  
[Anonymous], 1996, ADAPTATION LEARNING
[6]  
[Anonymous], 2018, IEEE INT C ROB AUT I
[7]  
[Anonymous], P 5 INT C LEARN REP
[8]  
[Anonymous], NIPS WORKSH DEEP REI
[9]  
[Anonymous], IEEE INT S ROB HUM I
[10]  
[Anonymous], 2019, P 33 AAAI C ART INT