Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function

被引:0
作者
Ruman, Marko [1 ]
Guy, Tatiana V. [1 ,2 ]
机构
[1] Czech Acad Sci, Inst Informat Theory & Automat, Dept Adapt Syst, Prague 18200, Czech Republic
[2] Czech Univ Life Sci, Fac Econ & Management, Dept Informat Engn, Prague 16500, Czech Republic
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Decision making; Games; Network architecture; Generative adversarial networks; Deep reinforcement learning; Knowledge transfer; Standards; Deep learning; Markov decision process; reinforcement learning; transfer learning; knowledge transfer;
D O I
10.1109/ACCESS.2024.3497589
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning has demonstrated superhuman performance in complex decision-making tasks, but it struggles with generalization and knowledge reuse-key aspects of true intelligence. This article introduces a novel approach that modifies Cycle Generative Adversarial Networks specifically for reinforcement learning, enabling effective one-to-one knowledge transfer between two tasks. Our method enhances the loss function with two new components: model loss, which captures dynamic relationships between source and target tasks, and Q-loss, which identifies states significantly influencing the target decision policy. Tested on the 2-D Atari game Pong, our method achieved 100% knowledge transfer in identical tasks and either 100% knowledge transfer or a 30% reduction in training time for a rotated task, depending on the network architecture. In contrast, using standard Generative Adversarial Networks or Cycle Generative Adversarial Networks led to worse performance than training from scratch in the majority of cases. The results demonstrate that the proposed method ensured enhanced knowledge generalization in deep reinforcement learning.
引用
收藏
页码:177204 / 177218
页数:15
相关论文
共 50 条
  • [21] Enhancement of Image Classification Using Transfer Learning and GAN-Based Synthetic Data Augmentation
    Chatterjee, Subhajit
    Hazra, Debapriya
    Byun, Yung-Cheol
    Kim, Yong-Woon
    MATHEMATICS, 2022, 10 (09)
  • [22] Knowledge Reasoning Method Based on Deep Transfer Reinforcement Learning: DTRLpath
    Lin, Shiming
    Ye, Ling
    Zhuang, Yijie
    Lu, Lingyun
    Zheng, Shaoqiu
    Huang, Chenxi
    Kwee, Ng Yin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 299 - 317
  • [23] Intelligent Anti-Jamming Based on Deep Reinforcement Learning and Transfer Learning
    Janiar, Siavash Barqi
    Wang, Ping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8825 - 8834
  • [24] MPR-RL: Multi-Prior Regularized Reinforcement Learning for Knowledge Transfer
    Yang, Quantao
    Stork, Johannes A.
    Stoyanov, Todor
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7652 - 7659
  • [25] Bandwidth Improvement for Patch Antenna via Knowledge-Based Deep Reinforcement Learning
    Su, Yue
    Yin, Yifan
    Li, Shunli
    Zhao, Hongxin
    Yin, Xiaoxing
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2024, 23 (12): : 4094 - 4098
  • [26] Towards Post-disaster Damage Assessment using Deep Transfer Learning and GAN-based Data Augmentation
    Banerjee, Sourasekhar
    Patel, Yashwant Singh
    Kumar, Pushkar
    Bhuyan, Monowar
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 372 - 377
  • [27] Reusing Source Task Knowledge via Transfer Approximator in Reinforcement Transfer Learning
    Cheng, Qiao
    Wang, Xiangke
    Niu, Yifeng
    Shen, Lincheng
    SYMMETRY-BASEL, 2019, 11 (01):
  • [28] An Online Search Method for Representative Risky Fault Chains Based on Reinforcement Learning and Knowledge Transfer
    Zhang, Zhimei
    Yao, Rui
    Huang, Shaowei
    Chen, Ying
    Mei, Shengwei
    Sun, Kai
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (03) : 1856 - 1867
  • [29] Deep Reinforcement Learning with Knowledge Transfer for Online Rides Order Dispatching
    Wang, Zhaodong
    Qin, Zhiwei
    Tang, Xiaocheng
    Ye, Jieping
    Zhu, Hongtu
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 617 - 626
  • [30] RL-Routing: An SDN Routing Algorithm Based on Deep Reinforcement Learning
    Chen, Yi-Ren
    Rezapour, Amir
    Tzeng, Wen-Guey
    Tsai, Shi-Chun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 3185 - 3199