Reinforcement learning using Voronoi space division

被引：2

作者：

Aung, Kathy ^{[1
]}

Fuchida, Takayasu ^{[1
]}

机构：

[1] Kagoshima Univ, Fac Engn, Grad Sch Sci & Engn, Dept Informat & Comp Sci, 1-21-40 Korimoto, Kagoshima 8900065, Japan

来源：

ARTIFICIAL LIFE AND ROBOTICS | 2010年 / 15卷 / 03期

关键词：

Reinforcement learning; Q-learning; Voronoi diagram; VQE;

D O I：

10.1007/s10015-010-0818-3

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Reinforcement learning is considered an important tool for robotic learning in unknown/uncertain environments. In this article, we suggest that Voronoi space division creates a new Voronoi region which permits an arbitrary point in the plane, say a Voronoi Q-value element (VQE), and constructs a new method for space division using a Voronoi diagram in order to realize multidimensional reinforcement learning. This article shows some results for four-dimensional spaces, and the essential characteristics of VQEs in a continuous state and action are also described. The advantages of learning with a variety of VQEs are enhanced learning speed and reliability for this task.

引用

页码：330 / 334

页数：5