Reinforcement learning for robot control

被引：7

作者：

Smart, WD ^{[1
]}

Kaelbling, LP ^{[1
]}

机构：

[1] Washington Univ, Dept Comp Sci, St Louis, MO 63130 USA

来源：

MOBILE ROBOTS XVI | 2002年 / 4573卷

关键词：

mobile robots; machine learning; reinforcement learning; learning control; learning by demonstration;

D O I：

10.1117/12.457434

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Writing control code for mobile robots can be a very time-consuming process, Even for apparently simple tasks. it is often difficult to specify in detail how the robot should accomplish them. Robot control code is typically full of "magic numbers" that must be painstakingly set for each environment that the robot must operate in. The idea of having a robot learn how to accomplish a task, rather than being told explicitly is an appealing one. It seems easier and much more intuitive for the programmer to specify what the robot should be doing, and to let it learn the fine details of how to do it. Ill this paper, we describe JAQL, a framework for efficient learning on mobile robots, and present the results of using it to learn control policies for some simple tasks.

引用

页码：92 / 103

页数：12

共 22 条

[1]

[Anonymous], 1993, NEURAL NETWORK PERCE

[2] Purposive behavior acquisition for a real robot by vision-based reinforcement learning [J].

Asada, M ;

Noda, S ;

Tawaratsumida, S ;

Hosoda, K .

MACHINE LEARNING, 1996, 23 (2-3) :279-303

[3]

Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014

[4]

BAKKER P, 1996, P AISB WORKSH LEARN, P3

[5]

Boyan J. A., 1995, Advances in Neural Information Processing Systems 7, P369

[6] INFLUENTIAL OBSERVATIONS IN LINEAR-REGRESSION [J].

COOK, RD .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (365) :169-174

[7]

DEMIRIS J, 1996, P 5 EUR WORKSH LEARN

[8] ROBOT SHAPING - DEVELOPING AUTONOMOUS AGENTS THROUGH LEARNING [J].

DORIGO, M ;

COLOMBETTI, M .

ARTIFICIAL INTELLIGENCE, 1994, 71 (02) :321-370

[9]

GORDON GJ, 1999, THESIS CARNEGIEMELLO

[10] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

← 1 2 3 →