Reinforcement learning in robotics: A survey

被引：1896

作者：

Kober, Jens ^{[1
,2
]}

Bagnell, J. Andrew ^{[3
]}

Peters, Jan ^{[4
,5
]}

机构：

[1] Univ Bielefeld, CoR Lab Res Inst Cognit & Robot, D-33615 Bielefeld, Germany

[2] Honda Res Inst Europe, Offenbach, Germany

[3] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA

[4] Max Planck Inst Intelligent Syst, Dept Empir Inference, Tubingen, Germany

[5] Tech Univ Darmstadt, FB Informat, FG Intelligent Autonomous Syst, Darmstadt, Germany

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2013年 / 32卷 / 11期

关键词：

Reinforcement learning; learning control; robot; survey; REAL ROBOT; POLICY GRADIENT; BEHAVIOR; ACQUISITION; PERCEPTION; LOCOMOTION; SKILLS;

D O I：

10.1177/0278364913495721

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.

引用

页码：1238 / 1274

页数：37

共 222 条

[1]

Abbeel P., 2004, P 21 INT C MACH LEAR

[2]

Abbeel Pieter, 2006, INT C MACH LEARN ICM, P1

[3]

An Chae H., 1988, Model-based Control of a Robot Manipulator

[4]

[Anonymous], 2008, P AAAI C ART INT

[5]

[Anonymous], 2010, Robotics: Science and Systems RSS

[6]

[Anonymous], IEEE INT C INT ROB S

[7]

[Anonymous], IEEE INT C ROB AUT I

[8]

[Anonymous], IEEE INT C ROB AUT I

[9]

[Anonymous], 2012, INT C MACH LEARN ICM

[10]

[Anonymous], 2011, AISTATS

← 1 2 3 4 5 6 7 8 9 10 →