Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types

被引：8

作者：

Dhakan, Paresh ^{[1
]}

Merrick, Kathryn ^{[2
]}

Rano, Inaki ^{[3
]}

Siddique, Nazmul ^{[1
]}

机构：

[1] Ulster Univ, Intelligent Syst Res Ctr, Derry, North Ireland

[2] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT, Australia

[3] Univ Southern Denmark, Maersk Mc Kinney Moller Inst, Embodied Syst Robot & Learning, Odense, Denmark

来源：

FRONTIERS IN NEUROROBOTICS | 2018年 / 12卷

关键词：

intrinsic reward function; goal types; open-ended learning; autonomous goal generation; reinforcement learning; MOTIVATION; ROBOTS;

D O I：

10.3389/fnbot.2018.00063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors.

引用

页数：16

共 39 条

[21]

Merrick K. E, 2007, REINFORCEMENT LEARNI

[22]

Merrick K. E, 2009, MOTIVATED REINFORCEM, P1

[23] Intrinsic Motivation and Introspection in Reinforcement Learning [J].

Merrick, Kathryn E. .

IEEE TRANSACTIONS ON AUTONOMOUS MENTAL DEVELOPMENT, 2012, 4 (04) :315-329

[24]

Mirolli M., 2013, Functions and Mechanisms of Intrinsic Motivations, P49

[25]

Nehmzow U., 2013, Intrinsically Motivated Learning in Natural and Artificial Systems, P185, DOI DOI 10.1007/978-3-642-32375-1_8

[26]

Neto H. V, 2004, AUTONOMOUS ROBOTIC S

[27]

Ng AY, 1999, MACHINE LEARNING, PROCEEDINGS, P278

[28] Intrinsic motivation systems for autonomous mental development [J].

Oudeyer, Pierre-Yves ;

Kaplan, Frederic ;

Hafner, Verena V. .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (02) :265-286

[29]

Oudeyer Pierre-Yves, 2007, Front Neurorobot, V1, P6, DOI 10.3389/neuro.12.006.2007

[30]

Rao A. S., 1995, ICMAS-95 Proceedings. First International Conference on Multi-Agent Systems, P312

← 1 2 3 4 →