Graph based skill acquisition and transfer Learning for continuous reinforcement learning domains

被引:25
作者
Shoeleh, Farzaneh [1 ]
Asadpour, Masoud [1 ]
机构
[1] Univ Tehran, Fac Elect & Comp Engn, Tehran, Iran
基金
美国国家科学基金会;
关键词
Reinforcement learning; Skill acquisition; Transfer learning; Graph learning; FRAMEWORK; ABSTRACTION;
D O I
10.1016/j.patrec.2016.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since reinforcement learning algorithms suffer from the curse of dimensionality in continuous domains, generalization is the most challenging issue in this area. Both skill acquisition and transfer learning are successful techniques to overcome such problem that result in big improvements in agent learning performance. In this paper, we propose a novel graph based skill acquisition method, named GSL, and a skill based transfer learning framework, named STL. GSL discovers skills as high-level knowledge using community detection from connectivity graph, a model to capture not only the agent's experience but also the environment's dynamics. STL incorporates skills previously learned from source task to speed up learning on a new target task. The experimental results indicate the effectiveness of the proposed methods in dealing with continuous reinforcement learning problems. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 35 条
  • [1] A random graph model for power law graphs
    Aiello, W
    Chung, F
    Lu, LY
    [J]. EXPERIMENTAL MATHEMATICS, 2001, 10 (01) : 53 - 66
  • [2] [Anonymous], 2012, P AAAI C ART INT, DOI DOI 10.1609/AAAI.V26I1.8313
  • [3] [Anonymous], 2009, NIPS
  • [4] [Anonymous], P ICML WORKSH NEW DE
  • [5] [Anonymous], THESIS
  • [6] [Anonymous], 2011, Advances in Neural Information Processing Systems
  • [7] Asadi M., 2015, Proceedings on the International Conference on Artificial Intelligence (ICAI). The Steering Committee of The World Congress in Computer Science, P22
  • [8] Asadi M, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2054
  • [9] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [10] Bohlin L., 2014, inMeasuringScholarlyImpact