Exploring a Learning Architecture for General Game Playing

被引:1
|
作者
Gunawan, Alvaro [1 ]
Ruan, Ji [1 ]
Thielscher, Michael [2 ]
Narayanan, Ajit [1 ]
机构
[1] Auckland Univ Technol, Auckland, New Zealand
[2] Univ New South Wales, Sydney, Australia
来源
AI 2020: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 12576卷
关键词
General Game Playing; Machine learning; Reinforcement learning; Neural networks; GO;
D O I
10.1007/978-3-030-64984-5_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
General Game Playing (GGP) is a platform for developing general Artificial Intelligence algorithms to play a large variety of games that are unknown to players in advance. This paper describes and analyses GGPZero, a learning architecture for GGP, inspired by the success of AlphaGo and AlphaZero. GGPZero takes as input a previously unknown game description and constructs a deep neural network to be trained using self-play together with Monte-Carlo Tree Search. The general architecture of GGPZero is similar to that of Goldwaser and Thielscher (2020) [4] with the main differences in the choice of the GGP reasoner and the neural network construction; furthermore, we explore additional experimental evaluation strategies. Our main contributions are: confirming the feasibility of deep reinforcement for GGP, analysing the impact of the type and depth of the underlying neural network, and investigating simulation vs. time limitations on training.
引用
收藏
页码:294 / 306
页数:13
相关论文
共 50 条
  • [21] General game playing with stochastic CSP
    Koriche, Frederic
    Lagrue, Sylvain
    Piette, Eric
    Tabary, Sebastien
    CONSTRAINTS, 2016, 21 (01) : 95 - 114
  • [22] Gamer, a General Game Playing Agent
    Kissmann, Peter
    Edelkamp, Stefan
    KUNSTLICHE INTELLIGENZ, 2011, 25 (01): : 49 - 52
  • [23] General game playing with stochastic CSP
    Frédéric Koriche
    Sylvain Lagrue
    Éric Piette
    Sébastien Tabary
    Constraints, 2016, 21 : 95 - 114
  • [24] General Game Playing with Stochastic CSP
    Koriche, Frederic
    Lagrue, Sylvain
    Piette, Eric
    Tabary, Sebastien
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2015, 2015, 9255 : 726 - 727
  • [25] Neuroevolution for General Video Game Playing
    Samothrakis, Spyridon
    Perez-Liebana, Diego
    Lucas, Simon M.
    Fasli, Maria
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2015, : 200 - 207
  • [26] General Game Playing with Imperfect Information
    Schofield, Michael
    Thielscher, Michael
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 66 : 901 - 935
  • [27] Towards General Cooperative Game Playing
    Marinheiro, Joao
    Cardoso, Henrique Lopes
    TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE XXVIII, 2018, 10780 : 164 - 192
  • [28] Special Issue on General Game Playing
    Thielscher, Michael
    KUNSTLICHE INTELLIGENZ, 2011, 25 (01): : 5 - 7
  • [29] Coevolving strategies for general game playing
    Reisinger, Joseph
    Bahceci, Erkin
    Karpov, Igor
    Miikkulainen, Risto
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2007, : 320 - 327
  • [30] The International General Game Playing Competition
    Genesereth, Michael
    Bjoernsson, Yngvi
    AI MAGAZINE, 2013, 34 (02) : 107 - 111