Inductive general game playing

被引:0
作者
Andrew Cropper
Richard Evans
Mark Law
机构
[1] University of Oxford,
[2] Imperial College London,undefined
来源
Machine Learning | 2020年 / 109卷
关键词
Program induction; Program synthesis; Inductive logic programming; General game playing;
D O I
暂无
中图分类号
学科分类号
摘要
General game playing (GGP) is a framework for evaluating an agent’s general intelligence across a wide range of tasks. In the GGP competition, an agent is given the rules of a game (described as a logic program) that it has never seen before. The task is for the agent to play the game, thus generating game traces. The winner of the GGP competition is the agent that gets the best total score over all the games. In this paper, we invert this task: a learner is given game traces and the task is to learn the rules that could produce the traces. This problem is central to inductive general game playing (IGGP). We introduce a technique that automatically generates IGGP tasks from GGP games. We introduce an IGGP dataset which contains traces from 50 diverse games, such as Sudoku, Sokoban, and Checkers. We claim that IGGP is difficult for existing inductive logic programming (ILP) approaches. To support this claim, we evaluate existing ILP systems on our dataset. Our empirical results show that most of the games cannot be correctly learned by existing systems. The best performing system solves only 40% of the tasks perfectly. Our results suggest that IGGP poses many challenges to existing approaches. Furthermore, because we can automatically generate IGGP tasks from GGP games, our dataset will continue to grow with the GGP competition, as new games are added every year. We therefore think that the IGGP problem and dataset will be valuable for motivating and evaluating future research.
引用
收藏
页码:1393 / 1434
页数:41
相关论文
共 90 条
[1]  
Bellodi E(2015)Structure learning of probabilistic logic programs by searching the clause space Theory and Practice of Logic Programming 15 169-212
[2]  
Riguzzi F(2002)Deep blue Artificial Intelligence 134 57-83
[3]  
Campbell M(2012)The YAP prolog system TPLP 12 5-34
[4]  
Joseph Hoane A(2019)Learning efficient logic programs Machine Learning 108 1063-1083
[5]  
Hsu F(2001)Complexity and expressive power of logic programming ACM Computing Surveys (CSUR) 33 374-425
[6]  
Costa VS(2007)Problog: A probabilistic prolog and its application in link discovery IJCAI 7 2462-2467
[7]  
Rocha R(1991)Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity Journal of Medicinal Chemistry 34 786-797
[8]  
Damas L(2001)Relational reinforcement learning Machine learning 43 7-52
[9]  
Cropper A(2018)Learning explanatory rules from noisy data Journal of Artificial Intelligence 61 1-64
[10]  
Muggleton SH(2011)Potassco: The potsdam answer set solving collection Ai Communications 24 107-124