Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning

被引：1

作者：

Kanagawa, Yuji ^{[1
]}

Kaneko, Tomoyuki ^{[2
]}

机构：

[1] Univ Tokyo, Grad Sch Arts & Sci, Tokyo, Japan

[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan

来源：

2019 IEEE CONFERENCE ON GAMES (COG) | 2019年

关键词：

roguelike games; reinforcement learning; generalization; domain adaptation; neural networks; ENVIRONMENT;

D O I：

10.1109/cig.2019.8848075

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose Rogue-Gym, a simple and classic style roguelike game built for evaluating generalization in reinforcement learning (RL). Combined with the recent progress of deep neural networks, RL has successfully trained human-level agents without human knowledge in many games such as those for Atari 2600. However, it has been pointed out that agents trained with RL methods often overfit the training environment, and they work poorly in slightly different environments. To investigate this problem, some research environments with procedural content generation have been proposed. Following these studies, we propose the use of roguelikes as a benchmark for evaluating the generalization ability of RL agents. In our Rogue-Gym, agents need to explore dungeons that are structured differently each time they start a new game. Thanks to the very diverse structures of the dungeons, we believe that the generalization benchmark of Rogue-Gym is sufficiently fair. In our experiments, we evaluate a standard reinforcement learning method, PPO, with and without enhancements for generalization. The results show that some enhancements believed to be effective fail to mitigate the overfitting in Rogue-Gym, although others slightly improve the generalization ability.

引用

页数：8

共 50 条

[41] A new approach for supervised learning based influence value reinforcement learning
Valdivia, Andre
Herrera Quispe, Jose
Barrios-Aranibar, Dennis
2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018), 2015, : 24 - 28
[42] Hybrid Reinforcement Learning and Uneven Generalization of Learning Space Method for Robot Obstacle Avoidance
Li, Jianghao
Bi, Weihong
Li, Mingda
PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT AUTOMATION & INTELLIGENT TECHNOLOGY AND SYSTEMS, 2013, 255 : 175 - 182
[43] Towards better generalization in quadrotor landing using deep reinforcement learning
Jiawei Wang
Teng Wang
Zichen He
Wenzhe Cai
Changyin Sun
Applied Intelligence, 2023, 53 : 6195 - 6213
[44] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
Ada, Suzan Ece
Oztop, Erhan
Ugur, Emre
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3116 - 3123
[45] Generalization of Reinforcement Learning through Artificial Potential Fields for agricultural UGVs
Ricioppo, Petre
Celestini, Davide
Capello, Elisa
PROCEEDINGS OF 2023 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR AGRICULTURE AND FORESTRY, METROAGRIFOR, 2023, : 386 - 391
[46] Towards better generalization in quadrotor landing using deep reinforcement learning
Wang, Jiawei
Wang, Teng
He, Zichen
Cai, Wenzhe
Sun, Changyin
APPLIED INTELLIGENCE, 2023, 53 (06) : 6195 - 6213
[47] Gym-ANM: Reinforcement learning environments for active network management tasks in electricity distribution systems
Henry, Robin
Ernst, Damien
ENERGY AND AI, 2021, 5
[48] Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning
Nyberg, Jakob
Johnson, Pontus
2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 282 - 289
[49] Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T
Colas, Jaron T.
Dundon, Neil M.
Gerraty, Raphael T.
Saragosa-Harris, Natalie M.
Szymula, Karol P.
Tanwisuth, Koranis
Tyszka, J. Michael
van Geen, Camilla
Ju, Harang
Toga, Arthur W.
Gold, Joshua, I
Bassett, Dani S.
Hartley, Catherine A.
Shohamy, Daphna
Grafton, Scott T.
O'Doherty, John P.
HUMAN BRAIN MAPPING, 2022, 43 (15) : 4750 - 4790
[50] Multi-band Environments for Optical Reinforcement Learning Gym for Resource Allocation in Elastic Optical Networks
Morales, Patricia
Franco, Patricia
Lozada, Astrid
Jara, Nicolas
Calderon, Felipe
Pinto-Rios, Juan
Leiva, Ariel
2021 INTERNATIONAL CONFERENCE ON OPTICAL NETWORK DESIGN AND MODELLING (ONDM), 2021,

← 1 2 3 4 5 →