Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function

被引:0
作者
Ruman, Marko [1 ]
Guy, Tatiana V. [1 ,2 ]
机构
[1] Czech Acad Sci, Inst Informat Theory & Automat, Dept Adapt Syst, Prague 18200, Czech Republic
[2] Czech Univ Life Sci, Fac Econ & Management, Dept Informat Engn, Prague 16500, Czech Republic
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Decision making; Games; Network architecture; Generative adversarial networks; Deep reinforcement learning; Knowledge transfer; Standards; Deep learning; Markov decision process; reinforcement learning; transfer learning; knowledge transfer;
D O I
10.1109/ACCESS.2024.3497589
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning has demonstrated superhuman performance in complex decision-making tasks, but it struggles with generalization and knowledge reuse-key aspects of true intelligence. This article introduces a novel approach that modifies Cycle Generative Adversarial Networks specifically for reinforcement learning, enabling effective one-to-one knowledge transfer between two tasks. Our method enhances the loss function with two new components: model loss, which captures dynamic relationships between source and target tasks, and Q-loss, which identifies states significantly influencing the target decision policy. Tested on the 2-D Atari game Pong, our method achieved 100% knowledge transfer in identical tasks and either 100% knowledge transfer or a 30% reduction in training time for a rotated task, depending on the network architecture. In contrast, using standard Generative Adversarial Networks or Cycle Generative Adversarial Networks led to worse performance than training from scratch in the majority of cases. The results demonstrate that the proposed method ensured enhanced knowledge generalization in deep reinforcement learning.
引用
收藏
页码:177204 / 177218
页数:15
相关论文
共 50 条
  • [1] Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer
    Mai, Yuxiang
    Zang, Yifan
    Yin, Qiyue
    Ni, Wancheng
    Huang, Kaiqi
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 566 - 576
  • [2] GRL: Knowledge graph completion with GAN-based reinforcement learning
    Wang, Qi
    Ji, Yuede
    Hao, Yongsheng
    Cao, Jie
    KNOWLEDGE-BASED SYSTEMS, 2020, 209
  • [3] GAN-Based Planning Model in Deep Reinforcement Learning
    Chen, Song
    Jiang, Junpeng
    Zhang, Xiaofang
    Wu, Jinjin
    Lu, Gongzheng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 323 - 334
  • [4] GAN-based Intrinsic Exploration for Sample Efficient Reinforcement Learning
    Kamar, Dogay
    Ure, Nazim Kemal
    Unal, Gozde
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 264 - 272
  • [5] A lightweight GAN-based fault diagnosis method based on knowledge distillation and deep transfer learning
    Zhong, Hongyu
    Yu, Samson
    Trinh, Hieu
    Yuan, Rui
    Lv, Yong
    Wang, Yanan
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (03)
  • [6] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    IEEE ACCESS, 2022, 10 : 114402 - 114413
  • [7] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
    Weijie Li
    Ming Yue
    Jinyong Shangguan
    Ye Jin
    International Journal of Control, Automation and Systems, 2023, 21 : 563 - 574
  • [8] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
    Li, Weijie
    Yue, Ming
    Shangguan, Jinyong
    Jin, Ye
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (02) : 563 - 574
  • [9] Efficient deep reinforcement learning under task variations via knowledge transfer for drone control
    Jang, Sooyoung
    Kim, Hyung-Il
    ICT EXPRESS, 2024, 10 (03): : 576 - 582
  • [10] GAN-based Deep Distributional Reinforcement Learning for Resource Management in Network Slicing
    Hua, Yuxiu
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    Chen, Xianfu
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,