Dynamic randomization and domain knowledge in Monte-Carlo Tree Search for Go knowledge-based systems

被引:10
作者
Chen, Keh-Hsun [1 ]
机构
[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
关键词
Monte-Carlo Tree Search; UCT algorithm; Simulation game; Domain knowledge; Go; Search parameters; Move generators; Dynamic randomization; COMPUTER GO; GAME;
D O I
10.1016/j.knosys.2011.08.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is an extension of the article 1131 presented at IWCG of TAA1 2010. It proposes two dynamic randomization techniques for Monte-Carlo Tree Search (MCTS) in Go. First, during the in-tree phase of a simulation game, the parameters are randomized in selected ranges before each simulation move. Second, during the play-out phase, the priority orders of the simulation move-generators are hierarchically randomized before each play-out move. Essential domain knowledge used in MCTS for Go is discussed. Both dynamic randomization techniques increase diversity while keeping the sanity of the simulation games. Experimental testing has been completely re-conducted more extensively with the latest version of GoIntellect (GI) on all three Go categories of 19 x 19, 13 x 13, and 9 x 9 boards. The results show that dynamic randomization increases the playing strength of GI significantly with 128K simulations per move, the improvement is about seven percentage points in the winning rate against GnuGo on 19 x 19 Go over the version of Cl without dynamic randomization, about three percentage points on 13 x 13 Go, and four percentage points on 9 x 9 Go. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:21 / 25
页数:5
相关论文
共 26 条
  • [1] [Anonymous], 2007, P 24 INT C MACH LEAR
  • [2] [Anonymous], 2006, 6062 INRIA
  • [3] [Anonymous], THESIS U WISCONSIN
  • [4] Finite-time analysis of the multiarmed bandit problem
    Auer, P
    Cesa-Bianchi, N
    Fischer, P
    [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
  • [5] Associating domain-dependent knowledge and Monte Carlo approaches within a Go program
    Bouzy, B
    [J]. INFORMATION SCIENCES, 2005, 175 (04) : 247 - 257
  • [6] Computer go: An AI oriented survey
    Bouzy, B
    Cazenave, T
    [J]. ARTIFICIAL INTELLIGENCE, 2001, 132 (01) : 39 - 103
  • [7] Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search
    Chaslot, Guillaume
    Fiter, Christophe
    Hoock, Jean-Baptiste
    Rimmel, Arpad
    Teytaud, Olivier
    [J]. ADVANCES IN COMPUTER GAMES, 2010, 6048 : 1 - +
  • [8] Chaslot GMJB, 2008, ICGA J, V31, P145
  • [9] PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
    Chaslot, Guillaume M. J-B.
    Winands, Mark H. M.
    Van den Herik, H. Jaap
    Uiterwijk, Jos W. H. M.
    Bouzy, Bruno
    [J]. NEW MATHEMATICS AND NATURAL COMPUTATION, 2008, 4 (03) : 343 - 357
  • [10] Static analysis of life and death in the game of Go
    Chen, K
    Chen, ZX
    [J]. INFORMATION SCIENCES, 1999, 121 (1-2) : 113 - 134