Task Relabelling for Multi-task Transfer using Successor Features

被引：0

作者：

Balla, Martin ^{[1
]}

Perez-Liebana, Diego ^{[1
]}

机构：

[1] Queen Mary Univ London, London, England

来源：

2022 IEEE CONFERENCE ON GAMES, COG | 2022年

基金：

英国工程与自然科学研究理事会;

关键词：

Reinforcement Learning; Successor Features; Multi-task Learning; Transfer Learning; REINFORCEMENT; LEVEL; GAME;

D O I：

10.1109/CoG51982.2022.9893550

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Deep Reinforcement Learning has been very successful recently with various works on complex domains. Most works are concerned with learning a single policy that solves the target task, but is fixed in the sense that if the environment changes the agent is unable to adapt to it. Successor Features (SFs) proposes a mechanism that allows learning policies that are not tied to any particular reward function. In this work we investigate how SFs may be pre-trained without observing any reward in a custom environment that features resource collection, traps and crafting. After pre-training we expose the SF agents to various target tasks and see how well they can transfer to new tasks. Transferring is done without any further training on the SF agents, instead just by providing a task vector. For training the SFs we propose a task relabelling method which greatly improves the agent's performance.

引用

页码：353 / 360

页数：8

共 50 条

[1] Multi-task Transfer with Practice
Pattnaik, Upasana
Lee, Minwoo
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[2] MULTI-TASK DISTILLATION: TOWARDS MITIGATING THE NEGATIVE TRANSFER IN MULTI-TASK LEARNING
Meng, Ze
Yao, Xin
Sun, Lifeng
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 389 - 393
[3] Multi-task gradient descent for multi-task learning
Bai, Lu
Ong, Yew-Soon
He, Tiantian
Gupta, Abhishek
MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
[4] Multi-task gradient descent for multi-task learning
Lu Bai
Yew-Soon Ong
Tiantian He
Abhishek Gupta
Memetic Computing, 2020, 12 : 355 - 369
[5] Multilingual multi-task quantum transfer learning
Buonaiuto, Giuseppe
Guarasci, Raffaele
De Pietro, Giuseppe
Esposito, Massimo
QUANTUM MACHINE INTELLIGENCE, 2025, 7 (01)
[6] Multi-task clustering through instances transfer
Zhang, Xiaotong
Zhang, Xianchao
Liu, Han
Liu, Xinyue
NEUROCOMPUTING, 2017, 251 : 145 - 155
[7] Automating Knowledge Transfer with Multi-Task Optimization
Scott, Eric O.
De Jong, Kenneth A.
2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2252 - 2259
[8] Multi-Task Learning Using Shared and Task Specific Information
Srijith, P. K.
Shevade, Shirish
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 125 - 132
[9] Task-conditioned adaptation of visual features in multi-task policy learning
Marza, Pierre
Matignon, Laetitia
Simonin, Olivier
Wolf, Christian
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17847 - 17856
[10] Predicting Glaucoma Progression using Multi-task Learning with Heterogeneous Features
Maya, Shigeru
Morino, Kai
Yamanishi, Kenji
2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 261 - 270

← 1 2 3 4 5 →