Task Relabelling for Multi-task Transfer using Successor Features

被引:0
|
作者
Balla, Martin [1 ]
Perez-Liebana, Diego [1 ]
机构
[1] Queen Mary Univ London, London, England
来源
2022 IEEE CONFERENCE ON GAMES, COG | 2022年
基金
英国工程与自然科学研究理事会;
关键词
Reinforcement Learning; Successor Features; Multi-task Learning; Transfer Learning; REINFORCEMENT; LEVEL; GAME;
D O I
10.1109/CoG51982.2022.9893550
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep Reinforcement Learning has been very successful recently with various works on complex domains. Most works are concerned with learning a single policy that solves the target task, but is fixed in the sense that if the environment changes the agent is unable to adapt to it. Successor Features (SFs) proposes a mechanism that allows learning policies that are not tied to any particular reward function. In this work we investigate how SFs may be pre-trained without observing any reward in a custom environment that features resource collection, traps and crafting. After pre-training we expose the SF agents to various target tasks and see how well they can transfer to new tasks. Transferring is done without any further training on the SF agents, instead just by providing a task vector. For training the SFs we propose a task relabelling method which greatly improves the agent's performance.
引用
收藏
页码:353 / 360
页数:8
相关论文
共 50 条
  • [1] Multi-task Transfer with Practice
    Pattnaik, Upasana
    Lee, Minwoo
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [2] MULTI-TASK DISTILLATION: TOWARDS MITIGATING THE NEGATIVE TRANSFER IN MULTI-TASK LEARNING
    Meng, Ze
    Yao, Xin
    Sun, Lifeng
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 389 - 393
  • [3] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [4] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [5] Multilingual multi-task quantum transfer learning
    Buonaiuto, Giuseppe
    Guarasci, Raffaele
    De Pietro, Giuseppe
    Esposito, Massimo
    QUANTUM MACHINE INTELLIGENCE, 2025, 7 (01)
  • [6] Multi-task clustering through instances transfer
    Zhang, Xiaotong
    Zhang, Xianchao
    Liu, Han
    Liu, Xinyue
    NEUROCOMPUTING, 2017, 251 : 145 - 155
  • [7] Automating Knowledge Transfer with Multi-Task Optimization
    Scott, Eric O.
    De Jong, Kenneth A.
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2252 - 2259
  • [8] Multi-Task Learning Using Shared and Task Specific Information
    Srijith, P. K.
    Shevade, Shirish
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 125 - 132
  • [9] Task-conditioned adaptation of visual features in multi-task policy learning
    Marza, Pierre
    Matignon, Laetitia
    Simonin, Olivier
    Wolf, Christian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17847 - 17856
  • [10] Predicting Glaucoma Progression using Multi-task Learning with Heterogeneous Features
    Maya, Shigeru
    Morino, Kai
    Yamanishi, Kenji
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 261 - 270