Unsupervised Task Clustering for Multi-task Reinforcement Learning

被引：3

作者：

Ackermann, Johannes ^{[1
]}

Richter, Oliver ^{[2
]}

Wattenhofer, Roger ^{[2
]}

机构：

[1] Tech Univ Munich, Munich, Germany

[2] Swiss Fed Inst Technol, Zurich, Switzerland

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES | 2021年 / 12975卷

关键词：

MIXTURES;

D O I：

10.1007/978-3-030-86486-6_14

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Meta-learning, transfer learning and multi-task learning have recently laid a path towards more generally applicable reinforcement learning agents that are not limited to a single task. However, most existing approaches implicitly assume a uniform similarity between tasks. We argue that this assumption is limiting in settings where the relationship between tasks is unknown a-priori. In this work, we propose a general approach to automatically cluster together similar tasks during training. Our method, inspired by the expectation-maximization algorithm, succeeds at finding clusters of related tasks and uses these to improve sample complexity. We achieve this by designing an agent with multiple policies. In the expectation step, we evaluate the performance of the policies on all tasks and assign each task to the best performing policy. In the maximization step, each policy trains by sampling tasks from its assigned set. This method is intuitive, simple to implement and orthogonal to other multi-task learning algorithms. We show the generality of our approach by evaluating on simple discrete and continuous control tasks, as well as complex bipedal walker tasks and Atari games. Results show improvements in sample complexity as well as a more general applicability when compared to other approaches.

引用

页码：222 / 237

页数：16

共 50 条

[1] Unsupervised Human Activity Representation Learning with Multi-task Deep Clustering
Ma, Haojie
Zhang, Zhijie
Li, Wenzhong
Lu, Sanglu
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (01):
[2] Multi-task reinforcement learning in humans
Momchil S. Tomov
Eric Schulz
Samuel J. Gershman
Nature Human Behaviour, 2021, 5 : 764 - 773
[3] Multi-task reinforcement learning in humans
Tomov, Momchil S.
Schulz, Eric
Gershman, Samuel J.
NATURE HUMAN BEHAVIOUR, 2021, 5 (06) : 764 - +
[4] Multi-Task Reinforcement Learning for Quadrotors
Xing, Jiaxu
Geles, Ismail
Song, Yunlong
Aljalbout, Elie
Scaramuzza, Davide
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2112 - 2119
[5] Sparse Multi-Task Reinforcement Learning
Calandriello, Daniele
Lazaric, Alessandro
Restelli, Marcello
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[6] Sparse multi-task reinforcement learning
Calandriello, Daniele
Lazaric, Alessandro
Restelli, Marcello
INTELLIGENZA ARTIFICIALE, 2015, 9 (01) : 5 - 20
[7] Convex Multi-Task Learning by Clustering
Barzilai, Aviad
Crammer, Koby
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 65 - 73
[8] Network Clustering for Multi-task Learning
Mu, Zhiying
Gao, Dehong
Guo, Sensen
NEURAL PROCESSING LETTERS, 2025, 57 (01)
[9] Multi-task Learning with Modular Reinforcement Learning
Xue, Jianyong
Alexandre, Frederic
FROM ANIMALS TO ANIMATS 16, 2022, 13499 : 127 - 138
[10] Learning a navigation task in changing environments by multi-task reinforcement learning
Grossmann, A
Poli, R
ADVANCES IN ROBOT LEARNING, PROCEEDINGS, 2000, 1812 : 23 - 43

← 1 2 3 4 5 →