Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning

被引:1
|
作者
Kanagawa, Yuji [1 ]
Kaneko, Tomoyuki [2 ]
机构
[1] Univ Tokyo, Grad Sch Arts & Sci, Tokyo, Japan
[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan
来源
2019 IEEE CONFERENCE ON GAMES (COG) | 2019年
关键词
roguelike games; reinforcement learning; generalization; domain adaptation; neural networks; ENVIRONMENT;
D O I
10.1109/cig.2019.8848075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose Rogue-Gym, a simple and classic style roguelike game built for evaluating generalization in reinforcement learning (RL). Combined with the recent progress of deep neural networks, RL has successfully trained human-level agents without human knowledge in many games such as those for Atari 2600. However, it has been pointed out that agents trained with RL methods often overfit the training environment, and they work poorly in slightly different environments. To investigate this problem, some research environments with procedural content generation have been proposed. Following these studies, we propose the use of roguelikes as a benchmark for evaluating the generalization ability of RL agents. In our Rogue-Gym, agents need to explore dungeons that are structured differently each time they start a new game. Thanks to the very diverse structures of the dungeons, we believe that the generalization benchmark of Rogue-Gym is sufficiently fair. In our experiments, we evaluate a standard reinforcement learning method, PPO, with and without enhancements for generalization. The results show that some enhancements believed to be effective fail to mitigate the overfitting in Rogue-Gym, although others slightly improve the generalization ability.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A new approach for supervised learning based influence value reinforcement learning
    Valdivia, Andre
    Herrera Quispe, Jose
    Barrios-Aranibar, Dennis
    2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018), 2015, : 24 - 28
  • [42] Hybrid Reinforcement Learning and Uneven Generalization of Learning Space Method for Robot Obstacle Avoidance
    Li, Jianghao
    Bi, Weihong
    Li, Mingda
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT AUTOMATION & INTELLIGENT TECHNOLOGY AND SYSTEMS, 2013, 255 : 175 - 182
  • [43] Towards better generalization in quadrotor landing using deep reinforcement learning
    Jiawei Wang
    Teng Wang
    Zichen He
    Wenzhe Cai
    Changyin Sun
    Applied Intelligence, 2023, 53 : 6195 - 6213
  • [44] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
    Ada, Suzan Ece
    Oztop, Erhan
    Ugur, Emre
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3116 - 3123
  • [45] Generalization of Reinforcement Learning through Artificial Potential Fields for agricultural UGVs
    Ricioppo, Petre
    Celestini, Davide
    Capello, Elisa
    PROCEEDINGS OF 2023 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR AGRICULTURE AND FORESTRY, METROAGRIFOR, 2023, : 386 - 391
  • [46] Towards better generalization in quadrotor landing using deep reinforcement learning
    Wang, Jiawei
    Wang, Teng
    He, Zichen
    Cai, Wenzhe
    Sun, Changyin
    APPLIED INTELLIGENCE, 2023, 53 (06) : 6195 - 6213
  • [47] Gym-ANM: Reinforcement learning environments for active network management tasks in electricity distribution systems
    Henry, Robin
    Ernst, Damien
    ENERGY AND AI, 2021, 5
  • [48] Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning
    Nyberg, Jakob
    Johnson, Pontus
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 282 - 289
  • [49] Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T
    Colas, Jaron T.
    Dundon, Neil M.
    Gerraty, Raphael T.
    Saragosa-Harris, Natalie M.
    Szymula, Karol P.
    Tanwisuth, Koranis
    Tyszka, J. Michael
    van Geen, Camilla
    Ju, Harang
    Toga, Arthur W.
    Gold, Joshua, I
    Bassett, Dani S.
    Hartley, Catherine A.
    Shohamy, Daphna
    Grafton, Scott T.
    O'Doherty, John P.
    HUMAN BRAIN MAPPING, 2022, 43 (15) : 4750 - 4790
  • [50] Multi-band Environments for Optical Reinforcement Learning Gym for Resource Allocation in Elastic Optical Networks
    Morales, Patricia
    Franco, Patricia
    Lozada, Astrid
    Jara, Nicolas
    Calderon, Felipe
    Pinto-Rios, Juan
    Leiva, Ariel
    2021 INTERNATIONAL CONFERENCE ON OPTICAL NETWORK DESIGN AND MODELLING (ONDM), 2021,