Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

被引:0
|
作者
Oh, Junhyuk [1 ]
Singh, Satinder [1 ]
Lee, Honglak [1 ,2 ]
Kohli, Pushmeet [3 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Google Brain, Mountain View, CA USA
[3] Microsoft Res, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalization over unseen instructions, we propose a new objective which encourages learning correspondences between similar subtasks by making analogies. For generalization over sequential instructions, we present a hierarchical architecture where a meta controller learns to use the acquired skills for executing the instructions. To deal with delayed reward, we propose a new neural architecture in the meta controller that learns when to update the subtask, which makes learning more efficient. Experimental results on a stochastic 3D domain show that the proposed ideas are crucial for generalization to longer instructions as well as unseen instructions.Y
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
    Wu, Zheng
    Xie, Yichen
    Lian, Wenzhao
    Wang, Changhao
    Guo, Yanjiang
    Chen, Jianyu
    Schaal, Stefan
    Tomizuka, Masayoshi
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7169 - 7175
  • [32] Multi-task Learning with Modular Reinforcement Learning
    Xue, Jianyong
    Alexandre, Frederic
    FROM ANIMALS TO ANIMATS 16, 2022, 13499 : 127 - 138
  • [33] A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
    Kirk R.
    Zhang A.
    Grefenstette E.
    Rocktäschel T.
    Journal of Artificial Intelligence Research, 2023, 76 : 201 - 264
  • [34] Learning a navigation task in changing environments by multi-task reinforcement learning
    Grossmann, A
    Poli, R
    ADVANCES IN ROBOT LEARNING, PROCEEDINGS, 2000, 1812 : 23 - 43
  • [35] A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
    Kirk, Robert
    Zhang, Amy
    Grefenstette, Edward
    Rocktaeschel, Tim
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 76 : 201 - 264
  • [36] Multi-Task Neural Sequence Labeling for Zero-Shot Cross-Language Boilerplate Removal
    Wu, Yu-Hao
    Chang, Chia-Hui
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 326 - 334
  • [37] Scalable Parallel Task Scheduling for Autonomous Driving Using Multi-Task Deep Reinforcement Learning
    Qi, Qi
    Zhang, Lingxin
    Wang, Jingyu
    Sun, Haifeng
    Zhuang, Zirui
    Liao, Jianxin
    Yu, F. Richard
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13861 - 13874
  • [38] A Hybrid Multi-Task Learning Approach for Optimizing Deep Reinforcement Learning Agents
    Varghese, Nelson Vithayathil
    Mahmoud, Qusay H.
    IEEE ACCESS, 2021, 9 : 44681 - 44703
  • [39] Study on deep reinforcement learning for multi-task scheduling in cloud manufacturing
    Xiao, Jiuhong
    Cai, Yishuai
    Chen, Yong
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2025,
  • [40] Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
    Xu, Zhiyuan
    Wu, Kun
    Che, Zhengping
    Tang, Jian
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33