Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types

被引：8

作者：

Dhakan, Paresh ^{[1
]}

Merrick, Kathryn ^{[2
]}

Rano, Inaki ^{[3
]}

Siddique, Nazmul ^{[1
]}

机构：

[1] Ulster Univ, Intelligent Syst Res Ctr, Derry, North Ireland

[2] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT, Australia

[3] Univ Southern Denmark, Maersk Mc Kinney Moller Inst, Embodied Syst Robot & Learning, Odense, Denmark

来源：

FRONTIERS IN NEUROROBOTICS | 2018年 / 12卷

关键词：

intrinsic reward function; goal types; open-ended learning; autonomous goal generation; reinforcement learning; MOTIVATION; ROBOTS;

D O I：

10.3389/fnbot.2018.00063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors.

引用

页数：16

共 39 条

[1] Amodei Dario, 2016, CORR
[2] Andrychowicz Marcin, 2017, NEURAL INFORM PROCES, P5048
[3] [Anonymous], 2004, Advances in Neural Information Processing Systems, DOI DOI 10.21236/ADA440280
[4] [Anonymous], 2012, P IEEE INT C DEV LEA
[5] [Anonymous], 2015, Reinforcement Learning: An Introduction
[6] Baldassarre G., 2013, INTRINSICALLY MOTIVA
[7] Baraldi A., 1998, SIMPLIFIED ART NEW C
[8] Baranes A., 2010, 2010 IEEE 9th International Conference on Development and Learning (ICDL 2010), P197, DOI 10.1109/DEVLRN.2010.5578842
[9] Intrinsically Motivated Goal Exploration for Active Motor Learning in Robots: A Case Study
Baranes, Adrien
Oudeyer, Pierre-Yves
[J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 1766 - 1773
[10] Novelty or Surprise?
Barto, Andrew
Mirolli, Marco
Baldassarre, Gianluca
[J]. FRONTIERS IN PSYCHOLOGY, 2013, 4

← 1 2 3 4 →