Skills' learning in an autonomous mobile robot using continuous reinforcement

被引:0
作者
Boada, MJL [1 ]
Salichs, MA [1 ]
机构
[1] Univ Carlos III Madrid, Syst Engn & Automat Div, Madrid, Spain
来源
ADVANCED FUZZY-NEURAL CONTROL 2001 | 2002年
关键词
neural network; reinforcement learning; continuous reinforcement; continuous state and output spaces; visual tracking; autonomous mobile robot;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a reinforcement learning algorithm which allows a mobile robot, equipped with a vision system formed by a single CCD camera mounted on a pan-tilt platform, to learn simple skills such as watching skill and orientation skill. Once these skills are learned, they are combined to obtain a more complex skill called approaching which permits the robot to go towards a visual target. The vision system is the only mean by which the robot is able to obtain information from environment. The used learning algorithm allows to work with continuous state and output spaces. When the robot performs an action, it receives a reinforcement signal, so that, at each moment the robot knows wether it is performing the action correctly. Copyright (C) 2 001 IFA.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 14 条
[1]  
ASADA M, 2000, P IEEE INT C ROB AUT, V3, P2076
[2]   GripSee:: A gesture-controlled robot for object perception and manipulation [J].
Becker, M ;
Kefalea, E ;
Maël, E ;
Von Der Malsburg, C ;
Pagel, M ;
Triesch, J ;
Vorbrüggen, JC ;
Würtz, RP ;
Zadel, S .
AUTONOMOUS ROBOTS, 1999, 6 (02) :203-221
[3]   Experiences with an architecture for intelligent, reactive agents [J].
Bonasso, RP ;
Firby, RJ ;
Gat, E ;
Kortenkamp, D ;
Miller, DP ;
Slack, MG .
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1997, 9 (2-3) :237-256
[4]  
GACHET D, 1994, P INT C INT ROB SYST, P290
[5]  
Hasegawa Y, 1999, ICRA '99: IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, PROCEEDINGS, P2799, DOI 10.1109/ROBOT.1999.774021
[6]  
Hashimoto K, 1998, IEEE INT CONF ROBOT, P2321, DOI 10.1109/ROBOT.1998.680669
[7]   A tutorial on visual servo control [J].
Hutchinson, S ;
Hager, GD ;
Corke, PI .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (05) :651-670
[8]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285
[9]   On-line training of recurrent neural networks with continuous topology adaptation [J].
Obradovic, D .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (01) :222-228
[10]  
Prokopowicz P. N., 1994, Proceedings of the Workshop on Visual Behaviors, P73, DOI 10.1109/VL.1994.365608