Exploiting semantic segmentation to boost reinforcement learning in video game environments

被引:1
|
作者
Montalvo, Javier [1 ]
Garcia-Martin, Alvaro [1 ]
Bescos, Jesus [1 ]
机构
[1] Univ Autonoma Madrid, VPULab, Ciudad Univ Cantoblanco, E-28049 Madrid, Spain
关键词
Semantic segmentation; Reinforcement learning; Domain adaptation; Synthetic data;
D O I
10.1007/s11042-022-13695-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we explore enhancing performance of reinforcement learning algorithms in video game environments by feeding it better, more relevant data. For this purpose, we use semantic segmentation to transform the images that would be used as input for the reinforcement learning algorithm from their original domain to a simplified semantic domain with just silhouettes and class labels instead of textures and colors, and then we train the reinforcement learning algorithm with these simplified images. We have conducted different experiments to study multiple aspects: feasibility of our proposal, and potential benefits to model generalization and transfer learning. Experiments have been performed with the Super Mario Bros video game as the testing environment. Our results show multiple advantages for this method. First, it proves that using semantic segmentation enables reaching higher performance than the baseline reinforcement learning algorithm without modifying the actual algorithm, and in fewer episodes; second, it shows noticeable performance improvements when training on multiple levels at the same time; and finally, it allows to apply transfer learning for models trained on visually different environments. We conclude that using semantic segmentation can certainly help reinforcement learning algorithms that work with visual data, by refining it. Our results also suggest that other computer vision techniques may also be beneficial for data prepossessing. Models and code will be available on github upon acceptance.
引用
收藏
页码:10961 / 10979
页数:19
相关论文
共 50 条
  • [41] Video Semantic Segmentation leveraging Dense Optical Flow
    Lup, Vasile
    Nedevschi, Sergiu
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 369 - 376
  • [42] Video Semantic Segmentation via Sparse Temporal Transformer
    Li, Jiangtong
    Wang, Wentao
    Chen, Junjie
    Niu, Li
    Si, Jianlou
    Qian, Chen
    Zhang, Liqing
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 59 - 68
  • [43] FASSVid: Fast and Accurate Semantic Segmentation for Video Sequences
    Portillo-Portillo, Jose
    Sanchez-Perez, Gabriel
    Toscano-Medina, Linda K.
    Hernandez-Suarez, Aldo
    Olivares-Mercado, Jesus
    Perez-Meana, Hector
    Velarde-Alvarado, Pablo
    Sandoval Orozco, Ana Lucila
    Garcia Villalba, Luis Javier
    ENTROPY, 2022, 24 (07)
  • [44] Reactive Reinforcement Learning in Asynchronous Environments
    Travnik, Jaden B.
    Mathewson, Kory W.
    Sutton, Richard S.
    Pilarski, Patrick M.
    FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [45] SegTrans: Semantic Segmentation With Transfer Learning for MLS Point Clouds
    Shen, Shuo
    Xia, Yan
    Eich, Andreas
    Xu, Yusheng
    Yang, Bisheng
    Stilla, Uwe
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [46] A Reinforced Active Learning Algorithm for Semantic Segmentation in Complex Imaging
    Usmani, Usman Ahmad
    Watada, Junzo
    Jaafar, Jafreezal
    Aziz, Izzatdin Abdul
    Roy, Arunava
    IEEE ACCESS, 2021, 9 : 168415 - 168432
  • [47] Reinforcement Learning in Latent Heterogeneous Environments
    Chen, Elynn Y.
    Song, Rui
    Jordan, Michael I.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 3113 - 3126
  • [48] Review of Multimodal Environments for Reinforcement Learning
    Z. A. Volovikova
    M. P. Kuznetsova
    A. A. Skrynnik
    A. I. Panov
    Doklady Mathematics, 2024, 110 (Suppl 1) : S110 - S116
  • [49] Contrastive Learning-Based Domain Adaptation for Semantic Segmentation
    Bhagwatkar, Rishika
    Kemekar, Saurabh
    Domatoti, Vinay
    Khan, Khursheed Munir
    Singh, Anamika
    2022 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2022, : 239 - 244
  • [50] Learning With Style: Continual Semantic Segmentation Across Tasks and Domains
    Toldo, Marco
    Michieli, Umberto
    Zanuttigh, Pietro
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7434 - 7450