Structural knowledge transfer by spatial abstraction for reinforcement learning agents

被引：7

作者：

Frommberger, Lutz ^{[1
]}

Wolter, Diedrich ^{[1
]}

机构：

[1] Univ Bremen, AG Cognit Syst, SFB TR Spatial Cognit 8, Cognit Syst Res Grp, D-28334 Bremen, Germany

来源：

ADAPTIVE BEHAVIOR | 2010年 / 18卷 / 06期

关键词：

Abstraction; knowledge transfer; reinforcement learning; transfer learning; robot navigation;

D O I：

10.1177/1059712310391484

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this article we investigate the role of abstraction principles for knowledge transfer in agent control learning tasks. We analyze abstraction from a formal point of view and characterize three distinct facets: aspectualization, coarsening, and conceptual classification. The taxonomy we develop allows us to interrelate existing approaches to abstraction, leading to a code of practice for designing knowledge representations that support knowledge transfer. We detail how aspectualization can be utilized to achieve knowledge transfer in reinforcement learning. We propose the use of so-called structure space aspectualizable knowledge representations that explicate structural properties of the state space and present a posteriori structure space aspectualization (APSST) as a method to extract generally sensible behavior from a learned policy. This new policy can be used for knowledge transfer to support learning new tasks in different environments. Finally, we present a case study that demonstrates transfer of generally sensible navigation skills from simple simulation to a real-world robotic platform.

引用

页码：507 / 525

页数：19

共 38 条

[1]

[Anonymous], 1995, ADV NEURAL INFORM PR

[2]

[Anonymous], P 24 INT C MACH LEAR

[3]

Bertel S., 2004, VISUAL SPATIAL REASO, P255

[4]

Bittner T., 2001, Spatial Information Theory. Foundations of Geographic Information Science. International Conference, COSIT 2001. Proceedings (Lecture Notes in Computer Science Vol.2205), P28

[5]

Christopher John Cornish Hellaby Watkins, 1989, Learning from Delayed Rewards

[6]

Dean T., 1997, AAAIIAAI

[7]

Dietterich T. G., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P118

[8]

Dietterich TG, 2000, ADV NEUR IN, V12, P994

[9] ROBOT SHAPING - DEVELOPING AUTONOMOUS AGENTS THROUGH LEARNING [J].

DORIGO, M ;

COLOMBETTI, M .

ARTIFICIAL INTELLIGENCE, 1994, 71 (02) :321-370

[10]

Fernandez F., 2006, P 5 INT JOINT C AUT, P720, DOI DOI 10.1145/1160633.1160762

← 1 2 3 4 →