Relational reinforcement learning

被引:170
作者
Dzeroski, S
De Raedt, L
Driessens, K
机构
[1] Jozef Stefan Inst, Dept Intelligent Syst, SI-1000 Ljubljana, Slovenia
[2] Univ Freiburg, Inst Informat, D-79110 Freiburg, Germany
[3] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Heverlee, Belgium
关键词
reinforcement learning; inductive logic programming; planning;
D O I
10.1023/A:1007694015589
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relational reinforcement learning is presented, a learning technique that combines reinforcement learning with relational learning or inductive logic programming. Due to the use of a more expressive representation language to represent states, actions and Q-functions, relational reinforcement learning can be potentially applied to a new range of learning tasks. One such task that we investigate is planning in the blocks world, where it is assumed that the effects of the actions are unknown to the agent and the agent has to learn a policy. Within this simple domain we show that relational reinforcement learning solves some existing problems with reinforcement learning. In particular, relational reinforcement learning allows us to employ structural representations, to abstract from specific goals pursued and to exploit the results of previous learning phases when addressing new (more complex) situations.
引用
收藏
页码:7 / 52
页数:46
相关论文
共 33 条
[1]  
[Anonymous], P 15 INT C MACH LEAR
[2]  
BAUM EB, 1996, P 13 INT C MACH LEAR
[3]   Top-down induction of first-order logical decision trees [J].
Blockeel, H ;
De Raedt, L .
ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) :285-297
[4]   Scaling up inductive logic programming by learning from interpretations [J].
Blockeel, H ;
de Raedt, L ;
Jacobs, N ;
Demoen, B .
DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 3 (01) :59-93
[5]  
BLOCKEEL H, 1997, P 7 INT WORKSH IND L, P77
[6]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[7]  
CARBONELL J, 1990, MACHINE LEARNING ART, P191
[8]  
Chapman D, 1991, IJCAI, V91, P726
[9]  
DERAEDT L, 1997, P 7 INT WORKSH IND L, P133
[10]   STRIPS - NEW APPROACH TO APPLICATION OF THEOREM PROVING TO PROBLEM SOLVING [J].
FIKES, RE ;
NILSSON, NJ .
ARTIFICIAL INTELLIGENCE, 1971, 2 (3-4) :189-208