EvoMCTS: A Scalable Approach for General Game Learning

被引:4
作者
Benbassat, Amit [1 ]
Sipper, Moshe [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
基金
以色列科学基金会;
关键词
Board games; genetic programming; Monte Carlo methods; search; CARLO TREE-SEARCH; NEURAL-NETWORKS; CHECKERS;
D O I
10.1109/TCIAIG.2014.2306914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the application of genetic programming as a generic game learning approach to zero-sum, deterministic, full-knowledge board games by evolving board-state evaluation functions to be used in conjunction with Monte Carlo tree search (MCTS). Our method involves evolving board-evaluation functions that are then used to guide the MCTS playout strategy. We examine several variants of Reversi, Dodgem, and Hex using strongly typed genetic programming, explicitly defined introns, and a selective directional crossover method. Our results show a proficiency that surpasses that of baseline handcrafted players using equal and in some cases greater amounts of search, with little domain knowledge and no expert domain knowledge. Moreover, our results exhibit scalability.
引用
收藏
页码:382 / 394
页数:13
相关论文
共 51 条
  • [1] Alhejali A. M., 2013, P IEEE C COMP INT GA, P57
  • [2] [Anonymous], 2006, 6062 INRIA
  • [3] [Anonymous], 2013, Computational Intelligence in Games (CIG), 2013 IEEE Conference on
  • [4] [Anonymous], P IEEE C COMP INT GA
  • [5] [Anonymous], 2003, Genetic programming IV: routine human-competitive machine intelligence
  • [6] [Anonymous], 2008, 23 NAT C ARTIF INTEL
  • [7] A hierarchical approach to computer Hex
    Anshelevich, VV
    [J]. ARTIFICIAL INTELLIGENCE, 2002, 134 (1-2) : 101 - 120
  • [8] Monte Carlo Tree Search in Hex
    Arneson, Broderick
    Hayward, Ryan B.
    Henderson, Philip
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 251 - 258
  • [9] MOHEX WINS HEX TOURNAMENT
    Arneson, Broderick
    Hayward, Ryan
    Henderson, Philip
    [J]. ICGA JOURNAL, 2009, 32 (02) : 114 - 116
  • [10] Ashlock W, 2005, IEEE C EVOL COMPUTAT, P1172