Dynamic randomization and domain knowledge in Monte-Carlo Tree Search for Go knowledge-based systems

被引：10

作者：

Chen, Keh-Hsun ^{[1
]}

机构：

[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA

来源：

KNOWLEDGE-BASED SYSTEMS | 2012年 / 34卷

关键词：

Monte-Carlo Tree Search; UCT algorithm; Simulation game; Domain knowledge; Go; Search parameters; Move generators; Dynamic randomization; COMPUTER GO; GAME;

D O I：

10.1016/j.knosys.2011.08.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper is an extension of the article 1131 presented at IWCG of TAA1 2010. It proposes two dynamic randomization techniques for Monte-Carlo Tree Search (MCTS) in Go. First, during the in-tree phase of a simulation game, the parameters are randomized in selected ranges before each simulation move. Second, during the play-out phase, the priority orders of the simulation move-generators are hierarchically randomized before each play-out move. Essential domain knowledge used in MCTS for Go is discussed. Both dynamic randomization techniques increase diversity while keeping the sanity of the simulation games. Experimental testing has been completely re-conducted more extensively with the latest version of GoIntellect (GI) on all three Go categories of 19 x 19, 13 x 13, and 9 x 9 boards. The results show that dynamic randomization increases the playing strength of GI significantly with 128K simulations per move, the improvement is about seven percentage points in the winning rate against GnuGo on 19 x 19 Go over the version of Cl without dynamic randomization, about three percentage points on 13 x 13 Go, and four percentage points on 9 x 9 Go. (c) 2011 Elsevier B.V. All rights reserved.

引用

页码：21 / 25

页数：5

共 26 条

[1] [Anonymous], 2007, P 24 INT C MACH LEAR
[2] [Anonymous], 2006, 6062 INRIA
[3] [Anonymous], THESIS U WISCONSIN
[4] Finite-time analysis of the multiarmed bandit problem
Auer, P
Cesa-Bianchi, N
Fischer, P
[J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
[5] Associating domain-dependent knowledge and Monte Carlo approaches within a Go program
Bouzy, B
[J]. INFORMATION SCIENCES, 2005, 175 (04) : 247 - 257
[6] Computer go: An AI oriented survey
Bouzy, B
Cazenave, T
[J]. ARTIFICIAL INTELLIGENCE, 2001, 132 (01) : 39 - 103
[7] Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search
Chaslot, Guillaume
Fiter, Christophe
Hoock, Jean-Baptiste
Rimmel, Arpad
Teytaud, Olivier
[J]. ADVANCES IN COMPUTER GAMES, 2010, 6048 : 1 - +
[8] Chaslot GMJB, 2008, ICGA J, V31, P145
[9] PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
Chaslot, Guillaume M. J-B.
Winands, Mark H. M.
Van den Herik, H. Jaap
Uiterwijk, Jos W. H. M.
Bouzy, Bruno
[J]. NEW MATHEMATICS AND NATURAL COMPUTATION, 2008, 4 (03) : 343 - 357
[10] Static analysis of life and death in the game of Go
Chen, K
Chen, ZX
[J]. INFORMATION SCIENCES, 1999, 121 (1-2) : 113 - 134

← 1 2 3 →