Classification of Monte Carlo Tree Search Variants

被引：0

作者：

McGuinness, Cameron ^{[1
]}

机构：

[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada

来源：

2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2016年

关键词：

Monte Carlo Tree Search; Agent Case Embedding; classification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

\Many variations of Monte Carlo tree search have been proposed and tested but relatively little comparison of these variants have occurred. In this study an Agent Case Embedding analysis and agglomorative hierarchical clustering was performed using eight variants of Monte Carlo Tree Search as agents and eight games as cases. This allowed us to compare the variant's abilities on each of the games to determine the type of games each are good at handling as well as which variants are similar to others. This comparison of variants exploits the ability of ACEs to compare different types of objects based on their behavior. By looking at the behavior of MCTS variants on a variety of games we obtain a good notion of the degree to which different MCTS variants exhibit different capabilities. A side effect of comparing MCTS variants with agent-case embeddings is that we also are able to compare the games used to test the MCTS variants.

引用

页码：357 / 363

页数：7

共 13 条

[1]

[Anonymous], 2007, P 24 INT C MACH LEAR

[2] Agent-Case Embeddings for the Analysis of Evolved Systems [J].

Ashlock, Daniel ;

Lee, Colin .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2013, 17 (02) :227-240

[3] Finite-time analysis of the multiarmed bandit problem [J].

Auer, P ;

Cesa-Bianchi, N ;

Fischer, P .

MACHINE LEARNING, 2002, 47 (2-3) :235-256

[4] Computer go: An AI oriented survey [J].

Bouzy, B ;

Cazenave, T .

ARTIFICIAL INTELLIGENCE, 2001, 132 (01) :39-103

[5] A Survey of Monte Carlo Tree Search Methods [J].

Browne, Cameron B. ;

Powley, Edward ;

Whitehouse, Daniel ;

Lucas, Simon M. ;

Cowling, Peter I. ;

Rohlfshagen, Philipp ;

Tavener, Stephen ;

Perez, Diego ;

Samothrakis, Spyridon ;

Colton, Simon .

IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43

[6]

Chaslot G., 2008, P AAAI C ARTIFICIAL, P216

[7]

Frydenberg F, 2015, IEEE CONF COMPU INTE, P107, DOI 10.1109/CIG.2015.7317937

[8]

Helmbold David P., 2009, Proceedings of the 2009 International Conference on Artificial Intelligence. ICAI 2009, P605

[9] Bandit based Monte-Carlo planning [J].

Kocsis, Levente ;

Szepesvari, Csaba .

MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 :282-293

[10]

Schadd M. P. D., 2011, THESIS

← 1 2 →