Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning

被引：1

作者：

Kanagawa, Yuji ^{[1
]}

Kaneko, Tomoyuki ^{[2
]}

机构：

[1] Univ Tokyo, Grad Sch Arts & Sci, Tokyo, Japan

[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan

来源：

2019 IEEE CONFERENCE ON GAMES (COG) | 2019年

关键词：

roguelike games; reinforcement learning; generalization; domain adaptation; neural networks; ENVIRONMENT;

D O I：

10.1109/cig.2019.8848075

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose Rogue-Gym, a simple and classic style roguelike game built for evaluating generalization in reinforcement learning (RL). Combined with the recent progress of deep neural networks, RL has successfully trained human-level agents without human knowledge in many games such as those for Atari 2600. However, it has been pointed out that agents trained with RL methods often overfit the training environment, and they work poorly in slightly different environments. To investigate this problem, some research environments with procedural content generation have been proposed. Following these studies, we propose the use of roguelikes as a benchmark for evaluating the generalization ability of RL agents. In our Rogue-Gym, agents need to explore dungeons that are structured differently each time they start a new game. Thanks to the very diverse structures of the dungeons, we believe that the generalization benchmark of Rogue-Gym is sufficiently fair. In our experiments, we evaluate a standard reinforcement learning method, PPO, with and without enhancements for generalization. The results show that some enhancements believed to be effective fail to mitigate the overfitting in Rogue-Gym, although others slightly improve the generalization ability.

引用

页数：8

共 50 条

[31] A reinforcement learning method for collaborative generalization of soundings and depth contours
Song, Zikang
Jia, Shuaidong
Liang, Zhicheng
Zhang, Lihua
Liang, Chuan
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (07): : 1345 - 1354
[32] Visual Grounding for Object-Level Generalization in Reinforcement Learning
Jiang, Haobin
Lu, Zongqing
COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 55 - 72
[33] Improving Policy Generalization for Teacher-Student Reinforcement Learning
Xudong, Gong
Hongda, Jia
Xing, Zhou
Dawei, Feng
Bo, Ding
Jie, Xu
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 39 - 47
[34] Generalization Enhancement of Visual Reinforcement Learning through Internal States
Yang, Hanlin
Zhu, William
Zhu, Xianchao
SENSORS, 2024, 24 (14)
[35] Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Wen, Zheng
Van Roy, Benjamin
MATHEMATICS OF OPERATIONS RESEARCH, 2017, 42 (03) : 762 - 782
[36] Chiplet-Gym: Optimizing Chiplet-Based AI Accelerator Design With Reinforcement Learning
Mishty, Kaniz
Sadi, Mehdi
IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (01) : 43 - 56
[37] Mbt-gym: Reinforcement learning for model-based limit order book trading
Jerome, Joseph
Sanchez-Betancourt, Leandro
Savani, Rahul
Herdegen, Martin
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 619 - 627
[38] Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics
Yuan, Zhaocong
Hall, Adam W.
Zhou, Siqi
Brunke, Lukas
Greeff, Melissa
Panerati, Jacopo
Schoellig, Angela P.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11142 - 11149
[39] A New Solution to the PID18 Challenge: Reinforcement-Learning-based PI Control
Wu, Yuting
Xing, Lantao
Liu, Xiao-Kang
Guo, Fanghong
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 5755 - 5760
[40] Cooperative Multi-Agent Jamming of Multiple Rogue Drones Using Reinforcement Learning
Valianti, Panayiota
Malialis, Kleanthis
Kolios, Panayiotis
Ellinas, Georgios
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12345 - 12359

← 1 2 3 4 5 →