Unsupervised Task Clustering for Multi-task Reinforcement Learning

被引:3
|
作者
Ackermann, Johannes [1 ]
Richter, Oliver [2 ]
Wattenhofer, Roger [2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Swiss Fed Inst Technol, Zurich, Switzerland
关键词
MIXTURES;
D O I
10.1007/978-3-030-86486-6_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning, transfer learning and multi-task learning have recently laid a path towards more generally applicable reinforcement learning agents that are not limited to a single task. However, most existing approaches implicitly assume a uniform similarity between tasks. We argue that this assumption is limiting in settings where the relationship between tasks is unknown a-priori. In this work, we propose a general approach to automatically cluster together similar tasks during training. Our method, inspired by the expectation-maximization algorithm, succeeds at finding clusters of related tasks and uses these to improve sample complexity. We achieve this by designing an agent with multiple policies. In the expectation step, we evaluate the performance of the policies on all tasks and assign each task to the best performing policy. In the maximization step, each policy trains by sampling tasks from its assigned set. This method is intuitive, simple to implement and orthogonal to other multi-task learning algorithms. We show the generality of our approach by evaluating on simple discrete and continuous control tasks, as well as complex bipedal walker tasks and Atari games. Results show improvements in sample complexity as well as a more general applicability when compared to other approaches.
引用
收藏
页码:222 / 237
页数:16
相关论文
共 50 条
  • [1] Unsupervised Human Activity Representation Learning with Multi-task Deep Clustering
    Ma, Haojie
    Zhang, Zhijie
    Li, Wenzhong
    Lu, Sanglu
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (01):
  • [2] Multi-task reinforcement learning in humans
    Momchil S. Tomov
    Eric Schulz
    Samuel J. Gershman
    Nature Human Behaviour, 2021, 5 : 764 - 773
  • [3] Multi-task reinforcement learning in humans
    Tomov, Momchil S.
    Schulz, Eric
    Gershman, Samuel J.
    NATURE HUMAN BEHAVIOUR, 2021, 5 (06) : 764 - +
  • [4] Multi-Task Reinforcement Learning for Quadrotors
    Xing, Jiaxu
    Geles, Ismail
    Song, Yunlong
    Aljalbout, Elie
    Scaramuzza, Davide
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2112 - 2119
  • [5] Sparse Multi-Task Reinforcement Learning
    Calandriello, Daniele
    Lazaric, Alessandro
    Restelli, Marcello
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [6] Sparse multi-task reinforcement learning
    Calandriello, Daniele
    Lazaric, Alessandro
    Restelli, Marcello
    INTELLIGENZA ARTIFICIALE, 2015, 9 (01) : 5 - 20
  • [7] Convex Multi-Task Learning by Clustering
    Barzilai, Aviad
    Crammer, Koby
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 65 - 73
  • [8] Network Clustering for Multi-task Learning
    Mu, Zhiying
    Gao, Dehong
    Guo, Sensen
    NEURAL PROCESSING LETTERS, 2025, 57 (01)
  • [9] Multi-task Learning with Modular Reinforcement Learning
    Xue, Jianyong
    Alexandre, Frederic
    FROM ANIMALS TO ANIMATS 16, 2022, 13499 : 127 - 138
  • [10] Learning a navigation task in changing environments by multi-task reinforcement learning
    Grossmann, A
    Poli, R
    ADVANCES IN ROBOT LEARNING, PROCEEDINGS, 2000, 1812 : 23 - 43