Towards a Deep Reinforcement Learning Approach for Tower Line Wars

被引：7

作者：

Andersen, Per-Arne ^{[1
]}

Goodwin, Morten ^{[1
]}

Granmo, Ole-Christoffer ^{[1
]}

机构：

[1] Univ Agder, Grimstad, Norway

来源：

ARTIFICIAL INTELLIGENCE XXXIV, AI 2017 | 2017年 / 10630卷

关键词：

Reinforcement Learning; Q-Learning; Deep Learning; Game environment;

D O I：

10.1007/978-3-319-71078-5_8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There have been numerous breakthroughs with reinforcement learning in the recent years, perhaps most notably on Deep Reinforcement Learning successfully playing and winning relatively advanced computer games. There is undoubtedly an anticipation that Deep Reinforcement Learning will play a major role when the first AI masters the complicated game plays needed to beat a professional Real-Time Strategy game player. For this to be possible, there needs to be a game environment that targets and fosters AI research, and specifically Deep Reinforcement Learning. Some game environments already exist, however, these are either overly simplistic such as Atari 2600 or complex such as Starcraft II from Blizzard Entertainment. We propose a game environment in between Atari 2600 and Starcraft II, particularly targeting Deep Reinforcement Learning algorithm research. The environment is a variant of Tower Line Wars from War-craft III, Blizzard Entertainment. Further, as a proof of concept that the environment can harbor Deep Reinforcement algorithms, we propose and apply a Deep Q-Reinforcement architecture. The architecture simplifies the state space so that it is applicable to Q-learning, and in turn improves performance compared to current state-of-the-art methods. Our experiments show that the proposed architecture can learn to play the environment well, and score 33% better than standard Deep Q-learning-which in turn proves the usefulness of the game environment.

引用

页码：101 / 114

页数：14

共 13 条

[1]

[Anonymous], 2015, CORR

[2]

[Anonymous], 2013, P ADV NEUR INF PROC

[3]

[Anonymous], 2017, HYBRID REWARD ARCHIT

[4]

[Anonymous], 2015, ARXIV150804582

[5]

[Anonymous], 2015, CORR

[6]

[Anonymous], 2015, ARXIV150907481

[7]

[Anonymous], 2016, CORR

[8]

Bellemare MarcG., 2012, CORR

[9] Reinforcement Learning: A Tutorial Survey and Recent Advances [J].

Gosavi, Abhijit .

INFORMS JOURNAL ON COMPUTING, 2009, 21 (02) :178-192

[10]

Sutton R., 1998, Introduction to reinforcement learning

← 1 2 →