Variational Skill Embeddings for Meta Reinforcement Learning

被引:4
作者
Chien, Jen-Tzung [1 ]
Lai, Weiwei [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Elect & Comp Engn, Hsinchu, Taiwan
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
D O I
10.1109/IJCNN54540.2023.10191425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta reinforcement learning (meta-RL) aims to learn useful prior knowledge across tasks which can be generalized to unseen but similar tasks with only a small number of adaptation steps. Traditionally, the gradient-based metal RL was proposed to use the gradients to learn the parameters of an adaptive policy from different tasks which likely lacked sample efficiency. Recently, the context-based meta-RL improved the efficiency by learning the embeddings of the trajectories based on context representation. The learned policy can be adapted to new tasks, but the performance is bounded due to a simple context encoder. To deal with this insufficiency, this paper presents a novel regularized meta-RL where the generalization of policy is enhanced through a context-based meta-RL where the conditional variational autoencoder consisting of a context-skill encoder and a soft-actor-critic decoder is implemented. The proposed method pursues the model regularization by discovering the shared skill patterns across tasks in implementation of context-based meta-RL. The experiments on a number of benchmark tasks show the merit of variational skill embeddings for regularized meta-RL.
引用
收藏
页数:8
相关论文
共 40 条
[1]  
Alemi AA., 2017, PROC 5 INT C LEARN R, P1
[2]  
[Anonymous], 2019, P INT C LEARN REPR
[3]  
Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[4]  
Bing Zhenshan, 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence
[5]  
Chen M.-Y., 2023, P IEEE INT C AC SPEE, P1
[6]   Bayesian Multi-Temporal-Difference Learning [J].
Chien, Jen-Tzung ;
Chiu, Yi-Chung .
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
[7]   VARIATIONAL SEQUENTIAL MODELING, LEARNING AND UNDERSTANDING [J].
Chien, Jen-Tzung ;
Tsai, Chih-Jung .
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, :480-486
[8]  
Chien JT, 2021, ASIAPAC SIGN INFO PR, P2028
[9]  
Chien JT, 2020, ASIAPAC SIGN INFO PR, P1611
[10]  
Chien JT, 2021, EUR SIGNAL PR CONF, P1527, DOI 10.23919/Eusipco47968.2020.9287440