Fast Task Adaptation Based on the Combination of Model-Based and Gradient-Based Meta Learning

被引:12
|
作者
Xu, Zhixiong [1 ]
Chen, Xiliang [1 ]
Cao, Lei [1 ]
机构
[1] Army Engn Univ, Inst Command & Control Engn, Nanjing 210000, Peoples R China
关键词
Task analysis; Adaptation models; Reinforcement learning; Trajectory; Games; Data models; Training; Fast adaptation; gradient; metalearning; model-based; reinforcement learning;
D O I
10.1109/TCYB.2020.3028378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning (DRL) recently has attained remarkable results in various domains, including games, robotics, and recommender system. Nevertheless, an urgent problem in the practical application of DRL is fast adaptation. To this end, this article proposes a new and versatile metalearning approach called fast task adaptation via metalearning (FTAML), which leverages the strengths of the model-based methods and gradient-based metalearning methods for training the initial parameters of the model, such that the model is able to efficiently master unseen tasks with a little amount of data from the tasks. The proposed algorithm makes it possible to separate task optimization and task identification, specifically, the model-based learner helps to identify the pattern of a task, while the gradient-based metalearner is capable of consistently improving the performance with only a few gradient update steps through making use of the task embedding produced by the model-based learner. In addition, the choice of network for the model-based learner in the proposed method is also discussed, and the performance of networks with different depths is explored. Finally, the simulation results on reinforcement learning problems demonstrate that the proposed approach outperforms compared metalearning algorithms and delivers a new state-of-the-art performance on a variety of challenging control tasks.
引用
收藏
页码:5209 / 5218
页数:10
相关论文
共 50 条
  • [1] Embedding Model-Based Fast Meta Learning for Downlink Beamforming Adaptation
    Zhang, Juping
    Yuan, Yi
    Zheng, Gan
    Krikidis, Ioannis
    Wong, Kai-Kit
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (01) : 149 - 162
  • [2] Data-Efficient Task Generalization via Probabilistic Model-Based Meta Reinforcement Learning
    Bhardwaj, Arjun
    Rothfuss, Jonas
    Sukhija, Bhavya
    As, Yarden
    Hutter, Marco
    Coros, Stelian
    Krause, Andreas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3918 - 3925
  • [3] Transfer Learning and Meta Learning-Based Fast Downlink Beamforming Adaptation
    Yuan, Yi
    Zheng, Gan
    Wong, Kai-Kit
    Ottersten, Bjorn
    Luo, Zhi-Quan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (03) : 1742 - 1755
  • [4] Model-Based Transfer Reinforcement Learning Based on Graphical Model Representations
    Sun, Yuewen
    Zhang, Kun
    Sun, Changyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 1035 - 1048
  • [5] Case-Based Task Generalization in Model-Based Reinforcement Learning
    Zholus, Artem
    Panov, Aleksandr, I
    ARTIFICIAL GENERAL INTELLIGENCE, AGI 2021, 2022, 13154 : 344 - 354
  • [6] A Gradient-based reinforcement learning model of market equilibration
    He, Zhongzhi
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 152
  • [7] Adversarial gradient-based meta learning with metric-based test
    Zhang, Yangguang
    Wang, Can
    Shi, Qihao
    Feng, Yan
    Chen, Chun
    KNOWLEDGE-BASED SYSTEMS, 2023, 263
  • [8] Reinforcement learning for enhanced online gradient-based parameter adaptation in metaheuristics
    Tatsis, Vasileios A.
    Parsopoulos, Konstantinos E.
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
  • [9] Model gradient: unified model and policy learning in model-based reinforcement learning
    Chengxing Jia
    Fuxiang Zhang
    Tian Xu
    Jing-Cheng Pang
    Zongzhang Zhang
    Yang Yu
    Frontiers of Computer Science, 2024, 18
  • [10] Model gradient: unified model and policy learning in model-based reinforcement learning
    Jia, Chengxing
    Zhang, Fuxiang
    Xu, Tian
    Pang, Jing-Cheng
    Zhang, Zongzhang
    Yu, Yang
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (04)