Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

被引:0
|
作者
Oh, Junhyuk [1 ]
Singh, Satinder [1 ]
Lee, Honglak [1 ,2 ]
Kohli, Pushmeet [3 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Google Brain, Mountain View, CA USA
[3] Microsoft Res, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalization over unseen instructions, we propose a new objective which encourages learning correspondences between similar subtasks by making analogies. For generalization over sequential instructions, we present a hierarchical architecture where a meta controller learns to use the acquired skills for executing the instructions. To deal with delayed reward, we propose a new neural architecture in the meta controller that learns when to update the subtask, which makes learning more efficient. Experimental results on a stochastic 3D domain show that the proposed ideas are crucial for generalization to longer instructions as well as unseen instructions.Y
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141
  • [42] Federated reinforcement learning for robot motion planning with zero-shot generalization
    Yuan, Zhenyuan
    Xu, Siyuan
    Zhu, Minghui
    AUTOMATICA, 2024, 166
  • [43] Task Aligned Generative Meta-learning for Zero-shot Learning
    Liu, Zhe
    Li, Yun
    Yao, Lina
    Wang, Xianzhi
    Long, Guodong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8723 - 8731
  • [44] Zero-Shot Text-to-SQL Learning with Auxiliary Task
    Chang, Shuaichen
    Liu, Pengfei
    Tang, Yun
    Huang, Jing
    He, Xiaodong
    Zhou, Bowen
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7488 - 7495
  • [45] Multi-Task Reinforcement Learning with Soft Modularization
    Yang, Ruihan
    Xu, Huazhe
    Wu, Yi
    Wang, Xiaolong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [46] Multi-Task Multi-Agent Reinforcement Learning With Interaction and Task Representations
    Li, Chao
    Dong, Shaokang
    Yang, Shangdong
    Hu, Yujing
    Ding, Tianyu
    Li, Wenbin
    Gao, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [47] Adversarial Online Multi-Task Reinforcement Learning
    Nguyen, Quan
    Mehta, Nishant A.
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 1124 - 1165
  • [48] Adversarial Online Multi-Task Reinforcement Learning
    Nguyen, Quan
    Mehta, Nishant A.
    Proceedings of Machine Learning Research, 2023, 201 : 1124 - 1165
  • [49] Multi-task Batch Reinforcement Learning with Metric Learning
    Li, Jiachen
    Quan Vuong
    Liu, Shuang
    Liu, Minghua
    Ciosek, Kamil
    Christensen, Henrik
    Su, Hao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [50] Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning
    Yoo, Minjong
    Cho, Sangwoo
    Woo, Honguk
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,