Accounting for Task-Difficulty in Active Multi-Task Robot Control Learning

被引:4
作者
Fabisch, Alexander [1 ]
Metzen, Jan Hendrik [1 ,2 ]
Krell, Mario Michael [1 ]
Kirchner, Frank [1 ,2 ]
机构
[1] Univ Bremen, Robot Grp, Bremen, Germany
[2] German Res Ctr Artificial Intelligence DFKI, Robot Innovat Ctr, Bremen, Germany
来源
KUNSTLICHE INTELLIGENZ | 2015年 / 29卷 / 04期
关键词
Contextual policy search; Multi-task learning; Active learning;
D O I
10.1007/s13218-015-0363-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contextual policy search is a reinforcement learning approach for multi-task learning in the context of robot control learning. It can be used to learn versatilely applicable skills that generalize over a range of tasks specified by a context vector. In this work, we combine contextual policy search with ideas from active learning for selecting the task in which the next trial will be performed. Moreover, we use active training set selection for reducing detrimental effects of exploration in the sampling policy. A core challenge in this approach is that the distribution of the obtained rewards may not be directly comparable between different tasks. We propose the novel approach PUBSVE for estimating a reward baseline and investigate empirically on benchmark problems and simulated robotic tasks to which extent this method can remedy the issue of non-comparable reward.
引用
收藏
页码:369 / 377
页数:9
相关论文
共 22 条
[1]  
da Silva BC, 2014, P 31 INT C MACH LEAR
[2]  
da Silva Bruno Castro, 2012, P 29 INT C MACH LEAR
[3]  
Deisenroth M. P., 2013, FDN TRENDS ROBOTICS, V2, P328
[4]  
Fabisch A, 2014, J MACH LEARN RES, V15, P3371
[5]  
Hansen N., 2010, P 12 ANN C COMP GEN
[6]   Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors [J].
Ijspeert, Auke Jan ;
Nakanishi, Jun ;
Hoffmann, Heiko ;
Pastor, Peter ;
Schaal, Stefan .
NEURAL COMPUTATION, 2013, 25 (02) :328-373
[7]   Reinforcement learning to adjust parametrized motor primitives to new situations [J].
Kober, Jens ;
Wilhelm, Andreas ;
Oztop, Erhan ;
Peters, Jan .
AUTONOMOUS ROBOTS, 2012, 33 (04) :361-379
[8]   Policy search for motor primitives in robotics [J].
Kober, Jens ;
Peters, Jan .
MACHINE LEARNING, 2011, 84 (1-2) :171-203
[9]  
Krell MM, 2015, THESIS
[10]  
Kupcsik A., 2013, P NAT C ART INT AAAI