State Abstraction in Reinforcement Learning by Eliminating Useless Dimensions

被引:1
作者
Cheng, Zhao [1 ]
Ray, Laura E. [1 ]
机构
[1] Dartmouth Coll, Thayer Sch Engn, Hanover, NH 03755 USA
来源
2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA) | 2014年
关键词
reinforcement learning; intelligent agent; state abstraction; complexity reduction;
D O I
10.1109/ICMLA.2014.22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Q-learning and other linear dynamic learning algorithms are subject to Bellman's curse of dimensionality for any realistic learning problem. This paper introduces a framework for satisficing state abstraction-one that reduces state dimensionality, improving convergence and reducing computational and memory resources-by eliminating useless state dimensions. Statistical parameters that are dependent on the state and Q-values identify the relevance of a given state space to a task space and allow state elements that contribute least to task learning to be discarded. Empirical results of applying state abstraction to a canonical single-agent path planning task and to a more difficult multi-agent foraging problem demonstrate utility of the proposed methods in improving learning convergence and performance in resource-constrained learning problems.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 18 条
[11]  
Jonsson A, 2001, ADV NEUR IN, V13, P1054
[12]  
Lu Y., 2007, P 15 ACM INT C MULT, P301, DOI [10.1145/1291233.1291297, DOI 10.1145/1291233.129129715]
[13]  
Mao T., 2012, INT J INFORM ELECT E, V2, P538
[14]  
Parr R., 2008, P 25 INT C MACH LEAR, P752
[15]   Cooperative Multi-Robot Reinforcement Learning: A Framework in Hybrid State Space [J].
Sun, Xueqing ;
Mao, Tao ;
Kralik, Jerald D. ;
Ray, Laura E. .
2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, :1190-1196
[16]  
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698
[17]   Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning [J].
Sun X. ;
Mao T. ;
Ray L. ;
Shi D. ;
Kralik J. .
Journal of Control Theory and Applications, 2011, 9 (03) :440-450
[18]  
Xueqing Sun, 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010), P3244, DOI 10.1109/IROS.2010.5652923