Agents teaching agents: a survey on inter-agent transfer learning

被引：48

作者：

Da Silva, Felipe Leno ^{[1
,3
]}

Warnell, Garrett ^{[2
]}

Costa, Anna Helena Reali ^{[1
]}

Stone, Peter ^{[3
]}

机构：

[1] Univ Sao Paulo, Sao Paulo, Brazil

[2] US Army, Res Lab, Austin, TX USA

[3] Univ Texas Austin, Austin, TX 78712 USA

来源：

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS | 2020年 / 34卷 / 01期

基金：

巴西圣保罗研究基金会;

关键词：

Multiagent learning; Transfer learning; Reinforcement learning; REINFORCEMENT; DOMAINS;

D O I：

10.1007/s10458-019-09430-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While recent work in reinforcement learning (RL) has led to agents capable of solving increasingly complex tasks, the issue of high sample complexity is still a major concern. This issue has motivated the development of additional techniques that augment RL methods in an attempt to increase task learning speed. In particular, inter-agent teaching-endowing agents with the ability to respond to instructions from others-has been responsible for many of these developments. RL agents that can leverage instruction from a more competent teacher have been shown to be able to learn tasks significantly faster than agents that cannot take advantage of such instruction. That said, the inter-agent teaching paradigm presents many new challenges due to, among other factors, differences between the agents involved in the teaching interaction. As a result, many inter-agent teaching methods work only in restricted settings and have proven difficult to generalize to new domains or scenarios. In this article, we propose two frameworks that provide a comprehensive view of the challenges associated with inter-agent teaching. We highlight state-of-the-art solutions, open problems, prospective applications, and argue that new research in this area should be developed in the context of the proposed frameworks.

引用

页数：17

共 70 条

[1]

Amir O., 2016, P 25 INT JOINT C ART, P804

[2]

[Anonymous], 2018, IEEE INT S SAF SEC R

[3]

[Anonymous], 2000, ANAL STOCHASTIC GAME

[4]

[Anonymous], THESIS

[5]

[Anonymous], 1996, ADAPTATION LEARNING

[6]

[Anonymous], 2018, IEEE INT C ROB AUT I

[7]

[Anonymous], P 5 INT C LEARN REP

[8]

[Anonymous], NIPS WORKSH DEEP REI

[9]

[Anonymous], IEEE INT S ROB HUM I

[10]

[Anonymous], 2019, P 33 AAAI C ART INT

← 1 2 3 4 5 6 7 →