Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function

被引：0

作者：

Ruman, Marko ^{[1
]}

Guy, Tatiana V. ^{[1
,2
]}

机构：

[1] Czech Acad Sci, Inst Informat Theory & Automat, Dept Adapt Syst, Prague 18200, Czech Republic

[2] Czech Univ Life Sci, Fac Econ & Management, Dept Informat Engn, Prague 16500, Czech Republic

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Training; Decision making; Games; Network architecture; Generative adversarial networks; Deep reinforcement learning; Knowledge transfer; Standards; Deep learning; Markov decision process; reinforcement learning; transfer learning; knowledge transfer;

D O I：

10.1109/ACCESS.2024.3497589

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep reinforcement learning has demonstrated superhuman performance in complex decision-making tasks, but it struggles with generalization and knowledge reuse-key aspects of true intelligence. This article introduces a novel approach that modifies Cycle Generative Adversarial Networks specifically for reinforcement learning, enabling effective one-to-one knowledge transfer between two tasks. Our method enhances the loss function with two new components: model loss, which captures dynamic relationships between source and target tasks, and Q-loss, which identifies states significantly influencing the target decision policy. Tested on the 2-D Atari game Pong, our method achieved 100% knowledge transfer in identical tasks and either 100% knowledge transfer or a 30% reduction in training time for a rotated task, depending on the network architecture. In contrast, using standard Generative Adversarial Networks or Cycle Generative Adversarial Networks led to worse performance than training from scratch in the majority of cases. The results demonstrate that the proposed method ensured enhanced knowledge generalization in deep reinforcement learning.

引用

页码：177204 / 177218

页数：15

共 50 条

[1] Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer
Mai, Yuxiang
Zang, Yifan
Yin, Qiyue
Ni, Wancheng
Huang, Kaiqi
IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 566 - 576
[2] GRL: Knowledge graph completion with GAN-based reinforcement learning
Wang, Qi
Ji, Yuede
Hao, Yongsheng
Cao, Jie
KNOWLEDGE-BASED SYSTEMS, 2020, 209
[3] GAN-Based Planning Model in Deep Reinforcement Learning
Chen, Song
Jiang, Junpeng
Zhang, Xiaofang
Wu, Jinjin
Lu, Gongzheng
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 323 - 334
[4] GAN-based Intrinsic Exploration for Sample Efficient Reinforcement Learning
Kamar, Dogay
Ure, Nazim Kemal
Unal, Gozde
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 264 - 272
[5] A lightweight GAN-based fault diagnosis method based on knowledge distillation and deep transfer learning
Zhong, Hongyu
Yu, Samson
Trinh, Hieu
Yuan, Rui
Lv, Yong
Wang, Yanan
MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (03)
[6] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
Liu, Jiayi
Wang, Gang
Guo, Xiangke
Wang, Siyuan
Fu, Qiang
IEEE ACCESS, 2022, 10 : 114402 - 114413
[7] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
Weijie Li
Ming Yue
Jinyong Shangguan
Ye Jin
International Journal of Control, Automation and Systems, 2023, 21 : 563 - 574
[8] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
Li, Weijie
Yue, Ming
Shangguan, Jinyong
Jin, Ye
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (02) : 563 - 574
[9] Efficient deep reinforcement learning under task variations via knowledge transfer for drone control
Jang, Sooyoung
Kim, Hyung-Il
ICT EXPRESS, 2024, 10 (03): : 576 - 582
[10] GAN-based Deep Distributional Reinforcement Learning for Resource Management in Network Slicing
Hua, Yuxiu
Li, Rongpeng
Zhao, Zhifeng
Zhang, Honggang
Chen, Xianfu
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,

← 1 2 3 4 5 →