Single-player Monte-Carlo tree search for SameGame

被引:47
作者
Schadd, Maarten P. D. [1 ]
Winands, Mark H. M. [1 ]
Tak, Mandy J. W. [1 ]
Uiterwijk, Jos W. H. M. [1 ]
机构
[1] Maastricht Univ, Dept Knowledge Engn, Games & AI Grp, Maastricht, Netherlands
关键词
Monte-Carlo tree search; One-player game; Puzzle; Same Game; Cross-entropy method; KNOWLEDGE; UCT; GO;
D O I
10.1016/j.knosys.2011.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classic methods such as A* and IDA* are a popular and successful choice for one-player games. However, without an accurate admissible evaluation function, they fail. In this article we investigate whether Monte-Carlo tree search (MCTS) is an interesting alternative for one-player games where A* and IDA* methods do not perform well. Therefore, we propose a new MCTS variant, called single-player Monte-Carlo tree search (SP-MCTS). The selection and backpropagation strategy in SP-MCTS are different from standard MCTS. Moreover, SP-MCTS makes use of randomized restarts. We tested IDA* and SP-MCTS on the puzzle SameGame and used the cross-entropy method to tune the SP-MCTS parameters. It turned out that our SP-MCTS program is able to score a substantial number of points on the standardized test set. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:3 / 11
页数:9
相关论文
共 44 条
[1]  
Allis L. V., 1994, THESIS RIJKSUNIVERSI
[2]  
[Anonymous], 2006, 6062 INRIA
[3]  
[Anonymous], THESIS U ALBERTA EDM
[4]   Limited-Damage A*: A path search algorithm that considers damage as a feasibility criterion [J].
Bayili, Serhat ;
Polat, Faruk .
KNOWLEDGE-BASED SYSTEMS, 2011, 24 (04) :501-512
[5]   Temporal difference learning for heuristic search and game playing [J].
Beal, DF ;
Smith, MC .
INFORMATION SCIENCES, 2000, 122 (01) :3-21
[6]  
Biedl T., 2002, More Games of No Chance, P389
[7]  
Billings D, 2007, COMMUNICATION
[8]   Fast planning through planning graph analysis [J].
Blum, AL ;
Furst, ML .
ARTIFICIAL INTELLIGENCE, 1997, 90 (1-2) :281-300
[9]   Associating domain-dependent knowledge and Monte Carlo approaches within a Go program [J].
Bouzy, B .
INFORMATION SCIENCES, 2005, 175 (04) :247-257
[10]  
Cazenave T, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P456