Reinforcement learning using Voronoi space division

被引:2
作者
Aung, Kathy [1 ]
Fuchida, Takayasu [1 ]
机构
[1] Kagoshima Univ, Fac Engn, Grad Sch Sci & Engn, Dept Informat & Comp Sci, 1-21-40 Korimoto, Kagoshima 8900065, Japan
关键词
Reinforcement learning; Q-learning; Voronoi diagram; VQE;
D O I
10.1007/s10015-010-0818-3
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement learning is considered an important tool for robotic learning in unknown/uncertain environments. In this article, we suggest that Voronoi space division creates a new Voronoi region which permits an arbitrary point in the plane, say a Voronoi Q-value element (VQE), and constructs a new method for space division using a Voronoi diagram in order to realize multidimensional reinforcement learning. This article shows some results for four-dimensional spaces, and the essential characteristics of VQEs in a continuous state and action are also described. The advantages of learning with a variety of VQEs are enhanced learning speed and reliability for this task.
引用
收藏
页码:330 / 334
页数:5
相关论文
共 6 条
[1]  
Baird LC, 1993, WLTR931147 DEF TECHN
[2]  
Gaskett C, 1999, LECT NOTES ARTIF INT, V1747, P417
[3]  
Rummery G. A., 1995, THESIS
[4]   Experiments with reinforcement learning in problems with continuous state and action spaces [J].
Santamaria, JC ;
Sutton, RS ;
Ram, A .
ADAPTIVE BEHAVIOR, 1997, 6 (02) :163-217
[5]  
Sutton R. S., 1998, INTRO REINFORCEMENT, V2
[6]  
Watkins C. J. C. H., 1989, LEARNING DELAYED REW