Graph based skill acquisition and transfer Learning for continuous reinforcement learning domains

被引：26

作者：

Shoeleh, Farzaneh ^{[1
]}

Asadpour, Masoud ^{[1
]}

机构：

[1] Univ Tehran, Fac Elect & Comp Engn, Tehran, Iran

来源：

PATTERN RECOGNITION LETTERS | 2017年 / 87卷

基金：

美国国家科学基金会;

关键词：

Reinforcement learning; Skill acquisition; Transfer learning; Graph learning; FRAMEWORK; ABSTRACTION;

D O I：

10.1016/j.patrec.2016.08.009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Since reinforcement learning algorithms suffer from the curse of dimensionality in continuous domains, generalization is the most challenging issue in this area. Both skill acquisition and transfer learning are successful techniques to overcome such problem that result in big improvements in agent learning performance. In this paper, we propose a novel graph based skill acquisition method, named GSL, and a skill based transfer learning framework, named STL. GSL discovers skills as high-level knowledge using community detection from connectivity graph, a model to capture not only the agent's experience but also the environment's dynamics. STL incorporates skills previously learned from source task to speed up learning on a new target task. The experimental results indicate the effectiveness of the proposed methods in dealing with continuous reinforcement learning problems. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：104 / 116

页数：13

共 35 条

[1] A random graph model for power law graphs [J].

Aiello, W ;

Chung, F ;

Lu, LY .

EXPERIMENTAL MATHEMATICS, 2001, 10 (01) :53-66

[2]

[Anonymous], 2012, P AAAI C ART INT, DOI DOI 10.1609/AAAI.V26I1.8313

[3]

[Anonymous], 2009, NIPS

[4]

[Anonymous], P ICML WORKSH NEW DE

[5]

[Anonymous], THESIS

[6]

[Anonymous], 2011, Advances in Neural Information Processing Systems

[7]

Asadi M., 2015, Proceedings on the International Conference on Artificial Intelligence (ICAI). The Steering Committee of The World Congress in Computer Science, P22

[8]

Asadi M, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2054

[9] Fast unfolding of communities in large networks [J].

Blondel, Vincent D. ;

Guillaume, Jean-Loup ;

Lambiotte, Renaud ;

Lefebvre, Etienne .

JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,

[10]

Bohlin L., 2014, inMeasuringScholarlyImpact

← 1 2 3 4 →