Exploring a Learning Architecture for General Game Playing

被引：1

作者：

Gunawan, Alvaro ^{[1
]}

Ruan, Ji ^{[1
]}

Thielscher, Michael ^{[2
]}

Narayanan, Ajit ^{[1
]}

机构：

[1] Auckland Univ Technol, Auckland, New Zealand

[2] Univ New South Wales, Sydney, Australia

来源：

AI 2020: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 12576卷

关键词：

General Game Playing; Machine learning; Reinforcement learning; Neural networks; GO;

D O I：

10.1007/978-3-030-64984-5_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

General Game Playing (GGP) is a platform for developing general Artificial Intelligence algorithms to play a large variety of games that are unknown to players in advance. This paper describes and analyses GGPZero, a learning architecture for GGP, inspired by the success of AlphaGo and AlphaZero. GGPZero takes as input a previously unknown game description and constructs a deep neural network to be trained using self-play together with Monte-Carlo Tree Search. The general architecture of GGPZero is similar to that of Goldwaser and Thielscher (2020) [4] with the main differences in the choice of the GGP reasoner and the neural network construction; furthermore, we explore additional experimental evaluation strategies. Our main contributions are: confirming the feasibility of deep reinforcement for GGP, analysing the impact of the type and depth of the underlying neural network, and investigating simulation vs. time limitations on training.

引用

页码：294 / 306

页数：13

共 50 条

[21] General game playing with stochastic CSP
Koriche, Frederic
Lagrue, Sylvain
Piette, Eric
Tabary, Sebastien
CONSTRAINTS, 2016, 21 (01) : 95 - 114
[22] Gamer, a General Game Playing Agent
Kissmann, Peter
Edelkamp, Stefan
KUNSTLICHE INTELLIGENZ, 2011, 25 (01): : 49 - 52
[23] General game playing with stochastic CSP
Frédéric Koriche
Sylvain Lagrue
Éric Piette
Sébastien Tabary
Constraints, 2016, 21 : 95 - 114
[24] General Game Playing with Stochastic CSP
Koriche, Frederic
Lagrue, Sylvain
Piette, Eric
Tabary, Sebastien
PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2015, 2015, 9255 : 726 - 727
[25] Neuroevolution for General Video Game Playing
Samothrakis, Spyridon
Perez-Liebana, Diego
Lucas, Simon M.
Fasli, Maria
2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2015, : 200 - 207
[26] General Game Playing with Imperfect Information
Schofield, Michael
Thielscher, Michael
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 66 : 901 - 935
[27] Towards General Cooperative Game Playing
Marinheiro, Joao
Cardoso, Henrique Lopes
TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE XXVIII, 2018, 10780 : 164 - 192
[28] Special Issue on General Game Playing
Thielscher, Michael
KUNSTLICHE INTELLIGENZ, 2011, 25 (01): : 5 - 7
[29] Coevolving strategies for general game playing
Reisinger, Joseph
Bahceci, Erkin
Karpov, Igor
Miikkulainen, Risto
2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2007, : 320 - 327
[30] The International General Game Playing Competition
Genesereth, Michael
Bjoernsson, Yngvi
AI MAGAZINE, 2013, 34 (02) : 107 - 111

← 1 2 3 4 5 →