Self-Adaptation of Playing Strategies in General Game Playing

被引:45
作者
Swiechowski, Maciej [1 ]
Mandziuk, Jacek [2 ]
机构
[1] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, PL-00662 Warsaw, Poland
关键词
Game tree search; general game playing (GGP); Monte Carlo methods; statistical learning;
D O I
10.1109/TCIAIG.2013.2275163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The termgeneral game playing (GGP) refers to a sub-field of AI which aims at developing agents able to effectively play many games from a particular class (finite, deterministic). It is also the name of the annual competition proposed by Stanford Logic Group at Stanford University (Stanford, CA, USA), which provides a framework for testing and evaluating GGP agents. In this paper, we present our GGP player which managed to win four out of seven games in the 2012 preliminary round and advanced to the final phase. Our system (named MINI-Player) relies on a pool of playing strategies and autonomously picks the ones which seem to be best suited to a given game. The chosen strategies are combined with one another and incorporated into the upper confidence bounds applied to trees (UCT) algorithm. The effectiveness of our player is evaluated on a set of games from the 2012 GGP Competition as well as a few other, single-player games. The paper discusses the efficacy of proposed playing strategies and evaluates the mechanism of their switching. The proposed idea of dynamically assigning search strategies during play is both novel and promising.
引用
收藏
页码:367 / 381
页数:15
相关论文
共 30 条
  • [1] [Anonymous], ICCA J
  • [2] [Anonymous], 1993, PHD DISSERTATION
  • [3] [Anonymous], 1950, London, DOI DOI 10.1080/14786445008521796
  • [4] [Anonymous], 2007, Proceedings of the AAAI National Conference on Artificial Intelligence
  • [5] CADIAPLAYER: A Simulation-Based General Game Player
    Bjornsson, Yngvi
    Finnsson, Hilmar
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2009, 1 (01) : 4 - 15
  • [6] Brugmann B., 1993, THESIS MAX PLANCK I
  • [7] Clune J., 2007, P 22 AAAI C ART INT, P994
  • [8] TOWARD AN IDEAL TRAINER
    EPSTEIN, SL
    [J]. MACHINE LEARNING, 1994, 15 (03) : 251 - 277
  • [9] Finnsson Hilmar., 2009, IJCAI WORKSHOP GEN G, P21
  • [10] Genesereth M, 2005, AI MAG, V26, P62