Simultaneously Learning at Different Levels of Abstraction

被引：0

作者：

Quack, Benjamin ^{[1
,2
]}

Woergoetter, Florentin ^{[1
,2
]}

Agostini, Alejandro ^{[1
,2
]}

机构：

[1] Univ Gottingen, Inst Phys 3, Gottingen, Germany

[2] Univ Gottingen, BCCN, Gottingen, Germany

来源：

2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2015年

关键词：

MANIPULATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Robotic applications in human environments are usually implemented using a cognitive architecture that integrates techniques of different levels of abstraction, ranging from artificial intelligence techniques for making decisions at a symbolic level to robotic techniques for grounding symbolic actions. In this work we address the problem of simultaneous learning at different levels of abstractions in such an architecture. This problem is important since human environments are highly variable, and many unexpected situations may arise during the execution of a task. The usual approach under this circumstance is to train each level individually to learn how to deal with the new situations. However, this approach is limited since it implies long task interruptions every time a new situation needs to be learned. We propose an architecture where learning takes place simultaneously at all the levels of abstraction. To achieve this, we devise a method that permits higher levels to guide the learning at the levels below for the correct execution of the task. The architecture is instantiated with a logic-based planner and an online planning operator learner, at the highest level, and with online reinforcement learning units that learn action policies for the grounding of the symbolic actions, at the lowest one. A human teacher is involved in the decision-making loop to facilitate learning. The framework is tested in a physically realistic simulation of the Sokoban game.

引用

页码：4600 / 4607

页数：8

共 27 条

[1] Agostini A., 2015, ARTIFICIAL IN PRESS
[2] Learning weakly correlated cause-effects for gardening with a cognitive system
Agostini, Alejandro
Torras, Carme
Woergoetter, Florentin
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 36 : 178 - 194
[3] [Anonymous], 1998, ARTIF INTELL
[4] Toward humanoid manipulation in human-centred environments
Asfour, T.
Azad, P.
Vahrenkamp, N.
Regenstein, K.
Bierbaum, A.
Welke, K.
Schroeder, J.
Dillmann, R.
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2008, 56 (01) : 54 - 65
[5] Bakker B., 2004, Proc. of the 8-th Conf. on Intelligent Autonomous Systems, P438
[6] Botea A, 2003, LECT NOTES COMPUT SC, V2883, P360
[7] Hierarchical reinforcement learning with the MAXQ value function decomposition
Dietterich, TG
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 : 227 - 303
[8] Ghallab Malik, 2004, Automated Planning: theory and practice
[9] Grounds M, 2008, LECT NOTES ARTIF INT, V4865, P75, DOI 10.1007/978-3-540-77949-0_6
[10] Hengst B., 2003, THESIS U NEW S WALES

← 1 2 3 →