Prediction based segmentation of state space and application to a subgoal finding problem in reinforcement learning

被引:0
作者
Nagata, Y [1 ]
Ohigashi, Y [1 ]
Takahashi, H [1 ]
Ishikawa, S [1 ]
Omori, T [1 ]
Morikawa, K [1 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci, Sapporo, Hokkaido, Japan
来源
SICE 2004 ANNUAL CONFERENCE, VOLS 1-3 | 2004年
关键词
segmentation; reinforcement learning; subgoal; acceleration of learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Humans solve problems by segmenting perceived continuous phenomenon and searching for action in compressed problem space. In this paper, we propose a method for segmenting continuous state space based on local prediction and a long-term prediction of continuous phenomenon. Furthermore, we investigate a subgoal finding problem in reinforcement learning as an instance of application of the segmentation result. A state having a subgoal function is found by propagating value from goal in a compressed state space. Reinforcement learning is accelerated by establishing subrewards in the state.
引用
收藏
页码:2560 / 2565
页数:6
相关论文
共 7 条
  • [1] GREFENSTETTE JJ, 1988, MACH LEARN, V3, P224
  • [2] MATARIC MJ, 1994, P 11 INT C MACH LEAR, P181
  • [3] McGovern A, 2001, P 18 INT C MACH LEAR, P361, DOI 10.5555/645530.655681
  • [4] MIYAZAKI K, 1999, P 5 INT C INF SYST A, P190
  • [5] NAGATA Y, 2004, NC2003196 IEICE, P13
  • [6] OHIGASHI Y, 2004, NC2003195 IEICE, P7
  • [7] Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447