Investigations with Monte Carlo Tree Search for Finding Better Multivariate Horner Schemes

被引:2
作者
van den Herik, H. Jaap [1 ]
Kuipers, Jan [2 ]
Vermaseren, Jos A. M. [2 ]
Plaat, Aske [1 ]
机构
[1] Tilburg Univ, Tilburg Ctr Cognit & Commun, NL-5037 AB Tilburg, Netherlands
[2] Nikhef Theory Grp, NL-1098 XG Amsterdam, Netherlands
来源
AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2013 | 2014年 / 449卷
关键词
Artificial intelligence; High energy physics; Horners rule; Monte Carlo Tree Search; Go; Chess; GAME; NUMBER;
D O I
10.1007/978-3-662-44440-5_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
After a computer chess program had defeated the human World Champion in 1997, many researchers turned their attention to the oriental game of Go. It turned out that the minimax approach, so successful in chess, did not work in Go. Instead, after some ten years of intensive research, a new method was developed: MCTS (Monte Carlo Tree Search), with promising results. MCTS works by averaging the results of random play-outs. At first glance it is quite surprising that MCTS works so well. However, deeper analysis revealed the reasons. The success of MCTS in Go caused researchers to apply the method to other domains. In this article we report on experiments with MCTS for finding improved orderings for multivariate Horner schemes, a basic method for evaluating polynomials. We report on initial results, and continue with an investigation into two parameters that guide the MCTS search. Horner's rule turns out to be a fruitful testbed for MCTS, allowing easy experimentation with its parameters. The results reported here provide insight into how and why MCTS works. It will be interesting to see if these insights can be transferred to other domains, for example, back to Go.
引用
收藏
页码:3 / 20
页数:18
相关论文
共 48 条
[1]   THE ORIGIN OF DYNAMIC KOMI [J].
Althoefer, Ingo .
ICGA JOURNAL, 2012, 35 (01) :31-34
[2]  
[Anonymous], 2006, 18 BENELUX CONFARTIF
[3]  
[Anonymous], THESIS U LIMBURG MAA
[4]  
Aoyama T., 2011, ARXIV11102826HEPPH
[5]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[6]  
Bouzy Bruno, 2012, Advances in Computer Games. 13th International Conference, ACG 2011. Revised Selected Papers, P96, DOI 10.1007/978-3-642-31866-5_9
[7]  
Bouzy Bruno., 2003, ACG VOLUME 263 IFIP, V263, P159
[8]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[9]  
Brugmann B, 1993, AAAI FALL S GAM PLAY
[10]  
Ceberio M., 2004, SIGSAM Bulletin, V38, P8, DOI 10.1145/980175.980179