Structural knowledge transfer by spatial abstraction for reinforcement learning agents

被引:7
作者
Frommberger, Lutz [1 ]
Wolter, Diedrich [1 ]
机构
[1] Univ Bremen, AG Cognit Syst, SFB TR Spatial Cognit 8, Cognit Syst Res Grp, D-28334 Bremen, Germany
关键词
Abstraction; knowledge transfer; reinforcement learning; transfer learning; robot navigation;
D O I
10.1177/1059712310391484
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we investigate the role of abstraction principles for knowledge transfer in agent control learning tasks. We analyze abstraction from a formal point of view and characterize three distinct facets: aspectualization, coarsening, and conceptual classification. The taxonomy we develop allows us to interrelate existing approaches to abstraction, leading to a code of practice for designing knowledge representations that support knowledge transfer. We detail how aspectualization can be utilized to achieve knowledge transfer in reinforcement learning. We propose the use of so-called structure space aspectualizable knowledge representations that explicate structural properties of the state space and present a posteriori structure space aspectualization (APSST) as a method to extract generally sensible behavior from a learned policy. This new policy can be used for knowledge transfer to support learning new tasks in different environments. Finally, we present a case study that demonstrates transfer of generally sensible navigation skills from simple simulation to a real-world robotic platform.
引用
收藏
页码:507 / 525
页数:19
相关论文
共 38 条
[1]  
[Anonymous], 1995, ADV NEURAL INFORM PR
[2]  
[Anonymous], P 24 INT C MACH LEAR
[3]  
Bertel S., 2004, VISUAL SPATIAL REASO, P255
[4]  
Bittner T., 2001, Spatial Information Theory. Foundations of Geographic Information Science. International Conference, COSIT 2001. Proceedings (Lecture Notes in Computer Science Vol.2205), P28
[5]  
Christopher John Cornish Hellaby Watkins, 1989, Learning from Delayed Rewards
[6]  
Dean T., 1997, AAAIIAAI
[7]  
Dietterich T. G., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P118
[8]  
Dietterich TG, 2000, ADV NEUR IN, V12, P994
[9]   ROBOT SHAPING - DEVELOPING AUTONOMOUS AGENTS THROUGH LEARNING [J].
DORIGO, M ;
COLOMBETTI, M .
ARTIFICIAL INTELLIGENCE, 1994, 71 (02) :321-370
[10]  
Fernandez F., 2006, P 5 INT JOINT C AUT, P720, DOI DOI 10.1145/1160633.1160762