A novel graphical approach to automatic abstraction in reinforcement learning

被引:14
作者
Taghizadeh, Nasrin [1 ]
Beigy, Hamid [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
Automatic skill acquisition; Spectral graph clustering; Eigenvector centrality; Option pruning; SKILL ACQUISITION;
D O I
10.1016/j.robot.2013.04.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent researches on automatic skill acquisition in reinforcement learning have focused on subgoal discovery methods. Among them, algorithms based on graph partitioning have achieved higher performance. In this paper, we propose a new automatic skill acquisition framework based on graph partitioning approach. The main steps of this framework are identifying subgoals and discovering useful skills. We propose two subgoal discovery algorithms, which use spectral analysis on the transition graph of the learning agent. The first proposed algorithm, incorporates k'-means algorithm with spectral clustering. In the second algorithm, eigenvector centrality measure is utilized and options are discovered. Moreover, we propose an algorithm for pruning useless options, which cause additional costs for the learning agent. The experimental results on various problems show significant improvement in the learning performance of the agent. (c) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:821 / 835
页数:15
相关论文
共 40 条
[1]  
[Anonymous], 1997, NUMERICAL LINEAR ALG
[2]  
[Anonymous], 2008, Behavioral Building Blocks for Autonomous Agents: Description, Identification, and Learning
[3]  
[Anonymous], 1970, Problems in Analysis
[4]  
[Anonymous], 1996, THESIS U ROCHESTER
[5]  
[Anonymous], 2008, P 25 INT C MACH LEAR, DOI 10
[6]  
[Anonymous], 2002, ICML
[7]  
[Anonymous], 2001, INT C MACHINE LEARNI
[8]  
Asadpour M., 2007, THESIS ECOLE POLYTEC
[9]   Roles in networks [J].
Canright, G ;
Engo-Monsen, K .
SCIENCE OF COMPUTER PROGRAMMING, 2004, 53 (02) :195-214
[10]  
Canright G. S., 2005, Telektronikk, V101, P65