Benchmarking End-to-End Behavioural Cloning on Video Games

被引：0

作者：

Kanervisto, Anssi ^{[1
]}

Pussinen, Joonas ^{[1
]}

Hautamaki, Ville ^{[1
]}

机构：

[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland

来源：

2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020) | 2020年

基金：

芬兰科学院;

关键词：

video game; behavioral cloning; imitation learning; reinforcement learning; learning environment; neural networks; LEVEL;

D O I：

10.1109/cog47356.2020.9231600

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Behavioural cloning, where a computer is taught to perform a task based on demonstrations, has been successfully applied to various video games and robotics tasks, with and without reinforcement learning. This also includes end-to-end approaches, where a computer plays a video game like humans do: by looking at the image displayed on the screen, and sending keystrokes to the game. As a general approach to playing video games, this has many inviting properties: no need for specialized modifications to the game, no lengthy training sessions and the ability to re-use the same tools across different games. However, related work includes game-specific engineering to achieve the results. We take a step towards a general approach and study the general applicability of behavioural cloning on twelve video games, including six modern video games (published after 2010), by using human demonstrations as training data. Our results show that these agents cannot match humans in raw performance but do learn basic dynamics and rules. We also demonstrate how the quality of the data matters, and how recording data from humans is subject to a state-action mismatch, due to human reflexes.

引用

页码：558 / 565

页数：8

共 34 条

[1] [Anonymous], 2017, ARXIV170510998
[2] [Anonymous], 2011, AISTATS
[3] The Arcade Learning Environment: An Evaluation Platform for General Agents
Bellemare, Marc G.
Naddaf, Yavar
Veness, Joel
Bowling, Michael
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 253 - 279
[4] Berner C., 2019, 191206680 ARXIV
[5] Bojarski Mariusz, 2016, arXiv
[6] Bontrager Philip., 2019, Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, V15, P10
[7] Chen Z., 2017, ARXIV170205663
[8] de Haan P, 2019, ADV NEUR IN, V32
[9] Delalleau O., 2019, DISCRETE CONTINUOUS
[10] Dulac-Arnold G., 2015, arXiv preprint arXiv:1512.07679, P2

← 1 2 3 4 →