State Abstraction in Reinforcement Learning by Eliminating Useless Dimensions

被引：1

作者：

Cheng, Zhao ^{[1
]}

Ray, Laura E. ^{[1
]}

机构：

[1] Dartmouth Coll, Thayer Sch Engn, Hanover, NH 03755 USA

来源：

2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA) | 2014年

关键词：

reinforcement learning; intelligent agent; state abstraction; complexity reduction;

D O I：

10.1109/ICMLA.2014.22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Q-learning and other linear dynamic learning algorithms are subject to Bellman's curse of dimensionality for any realistic learning problem. This paper introduces a framework for satisficing state abstraction-one that reduces state dimensionality, improving convergence and reducing computational and memory resources-by eliminating useless state dimensions. Statistical parameters that are dependent on the state and Q-values identify the relevance of a given state space to a task space and allow state elements that contribute least to task learning to be discarded. Empirical results of applying state abstraction to a canonical single-agent path planning task and to a more difficult multi-agent foraging problem demonstrate utility of the proposed methods in improving learning convergence and performance in resource-constrained learning problems.

引用

页码：105 / 110

页数：6

共 18 条

[11]

Jonsson A, 2001, ADV NEUR IN, V13, P1054

[12]

Lu Y., 2007, P 15 ACM INT C MULT, P301, DOI [10.1145/1291233.1291297, DOI 10.1145/1291233.129129715]

[13]

Mao T., 2012, INT J INFORM ELECT E, V2, P538

[14]

Parr R., 2008, P 25 INT C MACH LEAR, P752

[15] Cooperative Multi-Robot Reinforcement Learning: A Framework in Hybrid State Space [J].

Sun, Xueqing ;

Mao, Tao ;

Kralik, Jerald D. ;

Ray, Laura E. .

2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, :1190-1196

[16]

WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698

[17] Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning [J].

Sun X. ;

Mao T. ;

Ray L. ;

Shi D. ;

Kralik J. .

Journal of Control Theory and Applications, 2011, 9 (03) :440-450

[18]

Xueqing Sun, 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010), P3244, DOI 10.1109/IROS.2010.5652923

← 1 2 →