Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

被引：0

作者：

Oh, Junhyuk ^{[1
]}

Singh, Satinder ^{[1
]}

Lee, Honglak ^{[1
,2
]}

Kohli, Pushmeet ^{[3
]}

机构：

[1] Univ Michigan, Ann Arbor, MI 48109 USA

[2] Google Brain, Mountain View, CA USA

[3] Microsoft Res, Mountain View, CA USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalization over unseen instructions, we propose a new objective which encourages learning correspondences between similar subtasks by making analogies. For generalization over sequential instructions, we present a hierarchical architecture where a meta controller learns to use the acquired skills for executing the instructions. To deal with delayed reward, we propose a new neural architecture in the meta controller that learns when to update the subtask, which makes learning more efficient. Experimental results on a stochastic 3D domain show that the proposed ideas are crucial for generalization to longer instructions as well as unseen instructions.Y

引用

页数：10

共 50 条

[21] PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction
Bai, Fengshuo
Zhang, Hongming
Tao, Tianyang
Wu, Zhiheng
Wang, Yanna
Xu, Bo
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6728 - 6736
[22] Multi-task Deep Reinforcement Learning: a Combination of Rainbow and DisTraL
Andalibi, Milad
Setoodeh, Peyman
Mansourieh, Ali
Asemani, Mohammad Hassan
2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
[23] Multi-task reinforcement learning in humans
Momchil S. Tomov
Eric Schulz
Samuel J. Gershman
Nature Human Behaviour, 2021, 5 : 764 - 773
[24] Multi-Task Reinforcement Learning for Quadrotors
Xing, Jiaxu
Geles, Ismail
Song, Yunlong
Aljalbout, Elie
Scaramuzza, Davide
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2112 - 2119
[25] Multi-task reinforcement learning in humans
Tomov, Momchil S.
Schulz, Eric
Gershman, Samuel J.
NATURE HUMAN BEHAVIOUR, 2021, 5 (06) : 764 - +
[26] Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies
Sohn, Sungryull
Oh, Junhyuk
Lee, Honglak
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[27] Sparse Multi-Task Reinforcement Learning
Calandriello, Daniele
Lazaric, Alessandro
Restelli, Marcello
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[28] Sparse multi-task reinforcement learning
Calandriello, Daniele
Lazaric, Alessandro
Restelli, Marcello
INTELLIGENZA ARTIFICIALE, 2015, 9 (01) : 5 - 20
[29] Decision making on robot with multi-task using deep reinforcement learning for each task
Shimoguchi, Yuya
Kurashige, Kentarou
2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3460 - 3465
[30] Computational task offloading algorithm based on deep reinforcement learning and multi-task dependency
Zhang, Xiaoqi
Lin, Tengxiang
Lin, Cheng-Kuan
Chen, Zhen
Cheng, Hongju
THEORETICAL COMPUTER SCIENCE, 2024, 993

← 1 2 3 4 5 →