Exploiting semantic segmentation to boost reinforcement learning in video game environments

被引：1

作者：

Montalvo, Javier ^{[1
]}

Garcia-Martin, Alvaro ^{[1
]}

Bescos, Jesus ^{[1
]}

机构：

[1] Univ Autonoma Madrid, VPULab, Ciudad Univ Cantoblanco, E-28049 Madrid, Spain

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 07期

关键词：

Semantic segmentation; Reinforcement learning; Domain adaptation; Synthetic data;

D O I：

10.1007/s11042-022-13695-1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this work we explore enhancing performance of reinforcement learning algorithms in video game environments by feeding it better, more relevant data. For this purpose, we use semantic segmentation to transform the images that would be used as input for the reinforcement learning algorithm from their original domain to a simplified semantic domain with just silhouettes and class labels instead of textures and colors, and then we train the reinforcement learning algorithm with these simplified images. We have conducted different experiments to study multiple aspects: feasibility of our proposal, and potential benefits to model generalization and transfer learning. Experiments have been performed with the Super Mario Bros video game as the testing environment. Our results show multiple advantages for this method. First, it proves that using semantic segmentation enables reaching higher performance than the baseline reinforcement learning algorithm without modifying the actual algorithm, and in fewer episodes; second, it shows noticeable performance improvements when training on multiple levels at the same time; and finally, it allows to apply transfer learning for models trained on visually different environments. We conclude that using semantic segmentation can certainly help reinforcement learning algorithms that work with visual data, by refining it. Our results also suggest that other computer vision techniques may also be beneficial for data prepossessing. Models and code will be available on github upon acceptance.

引用

页码：10961 / 10979

页数：19

共 50 条

[41] Video Semantic Segmentation leveraging Dense Optical Flow
Lup, Vasile
Nedevschi, Sergiu
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 369 - 376
[42] Video Semantic Segmentation via Sparse Temporal Transformer
Li, Jiangtong
Wang, Wentao
Chen, Junjie
Niu, Li
Si, Jianlou
Qian, Chen
Zhang, Liqing
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 59 - 68
[43] FASSVid: Fast and Accurate Semantic Segmentation for Video Sequences
Portillo-Portillo, Jose
Sanchez-Perez, Gabriel
Toscano-Medina, Linda K.
Hernandez-Suarez, Aldo
Olivares-Mercado, Jesus
Perez-Meana, Hector
Velarde-Alvarado, Pablo
Sandoval Orozco, Ana Lucila
Garcia Villalba, Luis Javier
ENTROPY, 2022, 24 (07)
[44] Reactive Reinforcement Learning in Asynchronous Environments
Travnik, Jaden B.
Mathewson, Kory W.
Sutton, Richard S.
Pilarski, Patrick M.
FRONTIERS IN ROBOTICS AND AI, 2018, 5
[45] SegTrans: Semantic Segmentation With Transfer Learning for MLS Point Clouds
Shen, Shuo
Xia, Yan
Eich, Andreas
Xu, Yusheng
Yang, Bisheng
Stilla, Uwe
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[46] A Reinforced Active Learning Algorithm for Semantic Segmentation in Complex Imaging
Usmani, Usman Ahmad
Watada, Junzo
Jaafar, Jafreezal
Aziz, Izzatdin Abdul
Roy, Arunava
IEEE ACCESS, 2021, 9 : 168415 - 168432
[47] Reinforcement Learning in Latent Heterogeneous Environments
Chen, Elynn Y.
Song, Rui
Jordan, Michael I.
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 3113 - 3126
[48] Review of Multimodal Environments for Reinforcement Learning
Z. A. Volovikova
M. P. Kuznetsova
A. A. Skrynnik
A. I. Panov
Doklady Mathematics, 2024, 110 (Suppl 1) : S110 - S116
[49] Contrastive Learning-Based Domain Adaptation for Semantic Segmentation
Bhagwatkar, Rishika
Kemekar, Saurabh
Domatoti, Vinay
Khan, Khursheed Munir
Singh, Anamika
2022 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2022, : 239 - 244
[50] Learning With Style: Continual Semantic Segmentation Across Tasks and Domains
Toldo, Marco
Michieli, Umberto
Zanuttigh, Pietro
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7434 - 7450

← 1 2 3 4 5 →