Unsupervised Task Clustering for Multi-task Reinforcement Learning

被引:3
|
作者
Ackermann, Johannes [1 ]
Richter, Oliver [2 ]
Wattenhofer, Roger [2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Swiss Fed Inst Technol, Zurich, Switzerland
关键词
MIXTURES;
D O I
10.1007/978-3-030-86486-6_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning, transfer learning and multi-task learning have recently laid a path towards more generally applicable reinforcement learning agents that are not limited to a single task. However, most existing approaches implicitly assume a uniform similarity between tasks. We argue that this assumption is limiting in settings where the relationship between tasks is unknown a-priori. In this work, we propose a general approach to automatically cluster together similar tasks during training. Our method, inspired by the expectation-maximization algorithm, succeeds at finding clusters of related tasks and uses these to improve sample complexity. We achieve this by designing an agent with multiple policies. In the expectation step, we evaluate the performance of the policies on all tasks and assign each task to the best performing policy. In the maximization step, each policy trains by sampling tasks from its assigned set. This method is intuitive, simple to implement and orthogonal to other multi-task learning algorithms. We show the generality of our approach by evaluating on simple discrete and continuous control tasks, as well as complex bipedal walker tasks and Atari games. Results show improvements in sample complexity as well as a more general applicability when compared to other approaches.
引用
收藏
页码:222 / 237
页数:16
相关论文
共 50 条
  • [41] Local Learning-based Multi-task Clustering
    Zhong, Guo
    Pun, Chi-Man
    KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [42] UNSUPERVISED MULTI-TASK LEARNING FOR 3D SUBTOMOGRAM IMAGE ALIGNMENT, CLUSTERING AND SEGMENTATION
    Zhu, Haoyi
    Wang, Chuting
    Wang, Yuanxin
    Fan, Zhaoxin
    Uddin, Mostofa Rafid
    Gao, Xin
    Zhang, Jing
    Zeng, Xiangrui
    Xu, Min
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2751 - 2755
  • [43] Unsupervised Multi-Task Domain Adaptation
    Yang, Shih-Min
    Yeh, Mei-Chen
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1679 - 1685
  • [44] Unsupervised Multi-task Learning Dialogue Management Extended Abstract
    Sushravya, G. M.
    Sengupta, Shubhashis
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 196 - 202
  • [45] Unsupervised Joint Multi-Task Learning of Vision Geometry Tasks
    Jha, Prabhash Kumar
    Tsanev, Doychin
    Lukic, Luka
    2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 215 - 221
  • [46] Decision making on robot with multi-task using deep reinforcement learning for each task
    Shimoguchi, Yuya
    Kurashige, Kentarou
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3460 - 3465
  • [47] Computational task offloading algorithm based on deep reinforcement learning and multi-task dependency
    Zhang, Xiaoqi
    Lin, Tengxiang
    Lin, Cheng-Kuan
    Chen, Zhen
    Cheng, Hongju
    THEORETICAL COMPUTER SCIENCE, 2024, 993
  • [48] Multi-task Deep Reinforcement Learning Optimal Dispatchng Based on Grid Operation Scenario Clustering
    Deng B.
    Chen J.
    Ding Q.
    Pan Z.
    Yu T.
    Wang K.
    Hou J.
    Dianwang Jishu/Power System Technology, 2023, 47 (03): : 978 - 987
  • [49] Learning Sparse Task Relations in Multi-Task Learning
    Zhang, Yu
    Yang, Qiang
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2914 - 2920
  • [50] Task Variance Regularized Multi-Task Learning
    Mao, Yuren
    Wang, Zekai
    Liu, Weiwei
    Lin, Xuemin
    Hu, Wenbin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8615 - 8629