An analysis of Single-Player Monte Carlo Tree Search performance in Sokoban

被引:8
作者
Crippa, Mattia [1 ]
Lanzi, Pier Luca [1 ]
Marocchi, Fabio [1 ]
机构
[1] Politecn Milan, Milan, Italy
关键词
Artificial intelligence; Puzzle games; Monte Carlo Tree Search; Sokoban;
D O I
10.1016/j.eswa.2021.116224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We apply the extension of Monte Carlo Tree Search for single player games (SP-MCTS) to Sokoban and compare its performance to a solver integrating Iterative Deepening A* (IDA*) with several problem-specific heuristics. We introduce two extensions of MCTS to deal with some of the challenges that Sokoban poses to MCTS methods, namely, the reduced search space that deadlock situations can cause and the large number of cycles. We also evaluate three domain-independent enhancements that have been shown to improve MCTS performance, namely, UCB1-Tuned, Rapid Action Value Estimation (RAVE), and Node Recycling. We perform a series of experiments to determine the best SP-MCTS configuration and then compare its performance to IDA*. We show that SP-MCTS can solve around 85% of the levels with 1000000 iterations, that is the same performance reached by IDA* with only 10000 nodes. Overall, our results suggest that IDA* is still the best solver for Sokoban, also because it can easily integrate much domain knowledge. At the same time, our results also highlight some interesting directions to design better MCTS solvers for this domain.
引用
收藏
页数:10
相关论文
共 47 条
[1]  
[Anonymous], 2007, P 24 INT C MACH LEAR
[2]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[3]   MCTS-Minimax Hybrids with State Evaluations [J].
Baier, Hendrik ;
Winands, Mark H. M. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 62 :193-231
[4]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[5]  
Cazenave T., 2010, BOARD GAM STUD C PAR
[6]  
Cazenave T, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P456
[7]   Ensemble Determinization in Monte Carlo Tree Search for the Imperfect Information Card Game Magic: The Gathering [J].
Cowling, Peter I. ;
Ward, Colin D. ;
Powley, Edward J. .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (04) :241-257
[8]   Information Set Monte Carlo Tree Search [J].
Cowling, Peter I. ;
Powley, Edward J. ;
Whitehouse, Daniel .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (02) :120-143
[9]  
Culberson J. C., 1996, Advances in Artificial Intelligence. 11th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, AI'96. Proceedings, P402
[10]  
Culberson J. C., 1997, 9792 TR U ALB