Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning

被引:1
|
作者
Kanagawa, Yuji [1 ]
Kaneko, Tomoyuki [2 ]
机构
[1] Univ Tokyo, Grad Sch Arts & Sci, Tokyo, Japan
[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan
来源
2019 IEEE CONFERENCE ON GAMES (COG) | 2019年
关键词
roguelike games; reinforcement learning; generalization; domain adaptation; neural networks; ENVIRONMENT;
D O I
10.1109/cig.2019.8848075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose Rogue-Gym, a simple and classic style roguelike game built for evaluating generalization in reinforcement learning (RL). Combined with the recent progress of deep neural networks, RL has successfully trained human-level agents without human knowledge in many games such as those for Atari 2600. However, it has been pointed out that agents trained with RL methods often overfit the training environment, and they work poorly in slightly different environments. To investigate this problem, some research environments with procedural content generation have been proposed. Following these studies, we propose the use of roguelikes as a benchmark for evaluating the generalization ability of RL agents. In our Rogue-Gym, agents need to explore dungeons that are structured differently each time they start a new game. Thanks to the very diverse structures of the dungeons, we believe that the generalization benchmark of Rogue-Gym is sufficiently fair. In our experiments, we evaluate a standard reinforcement learning method, PPO, with and without enhancements for generalization. The results show that some enhancements believed to be effective fail to mitigate the overfitting in Rogue-Gym, although others slightly improve the generalization ability.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A reinforcement learning method for collaborative generalization of soundings and depth contours
    Song, Zikang
    Jia, Shuaidong
    Liang, Zhicheng
    Zhang, Lihua
    Liang, Chuan
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (07): : 1345 - 1354
  • [32] Visual Grounding for Object-Level Generalization in Reinforcement Learning
    Jiang, Haobin
    Lu, Zongqing
    COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 55 - 72
  • [33] Improving Policy Generalization for Teacher-Student Reinforcement Learning
    Xudong, Gong
    Hongda, Jia
    Xing, Zhou
    Dawei, Feng
    Bo, Ding
    Jie, Xu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 39 - 47
  • [34] Generalization Enhancement of Visual Reinforcement Learning through Internal States
    Yang, Hanlin
    Zhu, William
    Zhu, Xianchao
    SENSORS, 2024, 24 (14)
  • [35] Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
    Wen, Zheng
    Van Roy, Benjamin
    MATHEMATICS OF OPERATIONS RESEARCH, 2017, 42 (03) : 762 - 782
  • [36] Chiplet-Gym: Optimizing Chiplet-Based AI Accelerator Design With Reinforcement Learning
    Mishty, Kaniz
    Sadi, Mehdi
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (01) : 43 - 56
  • [37] Mbt-gym: Reinforcement learning for model-based limit order book trading
    Jerome, Joseph
    Sanchez-Betancourt, Leandro
    Savani, Rahul
    Herdegen, Martin
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 619 - 627
  • [38] Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics
    Yuan, Zhaocong
    Hall, Adam W.
    Zhou, Siqi
    Brunke, Lukas
    Greeff, Melissa
    Panerati, Jacopo
    Schoellig, Angela P.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11142 - 11149
  • [39] A New Solution to the PID18 Challenge: Reinforcement-Learning-based PI Control
    Wu, Yuting
    Xing, Lantao
    Liu, Xiao-Kang
    Guo, Fanghong
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 5755 - 5760
  • [40] Cooperative Multi-Agent Jamming of Multiple Rogue Drones Using Reinforcement Learning
    Valianti, Panayiota
    Malialis, Kleanthis
    Kolios, Panayiotis
    Ellinas, Georgios
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12345 - 12359