Towards a Deep Reinforcement Learning Approach for Tower Line Wars

被引:7
作者
Andersen, Per-Arne [1 ]
Goodwin, Morten [1 ]
Granmo, Ole-Christoffer [1 ]
机构
[1] Univ Agder, Grimstad, Norway
来源
ARTIFICIAL INTELLIGENCE XXXIV, AI 2017 | 2017年 / 10630卷
关键词
Reinforcement Learning; Q-Learning; Deep Learning; Game environment;
D O I
10.1007/978-3-319-71078-5_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There have been numerous breakthroughs with reinforcement learning in the recent years, perhaps most notably on Deep Reinforcement Learning successfully playing and winning relatively advanced computer games. There is undoubtedly an anticipation that Deep Reinforcement Learning will play a major role when the first AI masters the complicated game plays needed to beat a professional Real-Time Strategy game player. For this to be possible, there needs to be a game environment that targets and fosters AI research, and specifically Deep Reinforcement Learning. Some game environments already exist, however, these are either overly simplistic such as Atari 2600 or complex such as Starcraft II from Blizzard Entertainment. We propose a game environment in between Atari 2600 and Starcraft II, particularly targeting Deep Reinforcement Learning algorithm research. The environment is a variant of Tower Line Wars from War-craft III, Blizzard Entertainment. Further, as a proof of concept that the environment can harbor Deep Reinforcement algorithms, we propose and apply a Deep Q-Reinforcement architecture. The architecture simplifies the state space so that it is applicable to Q-learning, and in turn improves performance compared to current state-of-the-art methods. Our experiments show that the proposed architecture can learn to play the environment well, and score 33% better than standard Deep Q-learning-which in turn proves the usefulness of the game environment.
引用
收藏
页码:101 / 114
页数:14
相关论文
共 13 条
[1]  
[Anonymous], 2015, CORR
[2]  
[Anonymous], 2013, P ADV NEUR INF PROC
[3]  
[Anonymous], 2017, HYBRID REWARD ARCHIT
[4]  
[Anonymous], 2015, ARXIV150804582
[5]  
[Anonymous], 2015, CORR
[6]  
[Anonymous], 2015, ARXIV150907481
[7]  
[Anonymous], 2016, CORR
[8]  
Bellemare MarcG., 2012, CORR
[9]   Reinforcement Learning: A Tutorial Survey and Recent Advances [J].
Gosavi, Abhijit .
INFORMS JOURNAL ON COMPUTING, 2009, 21 (02) :178-192
[10]  
Sutton R., 1998, Introduction to reinforcement learning