Reinforcement learning for robot control

被引:7
作者
Smart, WD [1 ]
Kaelbling, LP [1 ]
机构
[1] Washington Univ, Dept Comp Sci, St Louis, MO 63130 USA
来源
MOBILE ROBOTS XVI | 2002年 / 4573卷
关键词
mobile robots; machine learning; reinforcement learning; learning control; learning by demonstration;
D O I
10.1117/12.457434
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Writing control code for mobile robots can be a very time-consuming process, Even for apparently simple tasks. it is often difficult to specify in detail how the robot should accomplish them. Robot control code is typically full of "magic numbers" that must be painstakingly set for each environment that the robot must operate in. The idea of having a robot learn how to accomplish a task, rather than being told explicitly is an appealing one. It seems easier and much more intuitive for the programmer to specify what the robot should be doing, and to let it learn the fine details of how to do it. Ill this paper, we describe JAQL, a framework for efficient learning on mobile robots, and present the results of using it to learn control policies for some simple tasks.
引用
收藏
页码:92 / 103
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 1993, NEURAL NETWORK PERCE
[2]   Purposive behavior acquisition for a real robot by vision-based reinforcement learning [J].
Asada, M ;
Noda, S ;
Tawaratsumida, S ;
Hosoda, K .
MACHINE LEARNING, 1996, 23 (2-3) :279-303
[3]  
Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
[4]  
BAKKER P, 1996, P AISB WORKSH LEARN, P3
[5]  
Boyan J. A., 1995, Advances in Neural Information Processing Systems 7, P369
[6]   INFLUENTIAL OBSERVATIONS IN LINEAR-REGRESSION [J].
COOK, RD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (365) :169-174
[7]  
DEMIRIS J, 1996, P 5 EUR WORKSH LEARN
[8]   ROBOT SHAPING - DEVELOPING AUTONOMOUS AGENTS THROUGH LEARNING [J].
DORIGO, M ;
COLOMBETTI, M .
ARTIFICIAL INTELLIGENCE, 1994, 71 (02) :321-370
[9]  
GORDON GJ, 1999, THESIS CARNEGIEMELLO
[10]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285