State Representation Learning for Goal-Conditioned Reinforcement Learning

被引:0
|
作者
Steccanella, Lorenzo [1 ]
Jonsson, Anders [1 ]
机构
[1] Univ Pompeu Fabra, Dept Informat & Commun Technol, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
Representation learning; Goal-conditioned reinforcement learning; Reward shaping; Reinforcement learning;
D O I
10.1007/978-3-031-26412-2_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel state representation for reward-free Markov decision processes. The idea is to learn, in a self-supervised manner, an embedding space where distances between pairs of embedded states correspond to the minimum number of actions needed to transition between them. Compared to previous methods, our approach does not require any domain knowledge, learning from offline and unlabeled data. We show how this representation can be leveraged to learn goalconditioned policies, providing a notion of similarity between states and goals and a useful heuristic distance to guide planning and reinforcement learning algorithms. Finally, we empirically validate our method in classic control domains and multi-goal environments, demonstrating that our method can successfully learn representations in large and/or continuous domains.
引用
收藏
页码:84 / 99
页数:16
相关论文
共 50 条
  • [31] Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
    Ding, Hongyu
    Tang, Yuanze
    Wu, Qing
    Wang, Bo
    Chen, Chunlin
    Wang, Zhi
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (12) : 2233 - 2247
  • [32] Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: A Short Survey
    Colas C.
    Karch T.
    Sigaud O.
    Oudeyer P.-Y.
    Journal of Artificial Intelligence Research, 2022, 74 : 1159 - 1199
  • [33] Goal-Conditioned Reinforcement Learning within a Human-Robot Disassembly Environment
    Elguea-Aguinaco, Inigo
    Serrano-Munoz, Antonio
    Chrysostomou, Dimitrios
    Inziarte-Hidalgo, Ibai
    Bogh, Simon
    Arana-Arexolaleiba, Nestor
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [34] A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning
    Lee, Gyeong Taek
    Kim, Kangjin
    IEEE ACCESS, 2023, 11 : 33812 - 33825
  • [35] MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation
    Jang, Seongwon
    Jeong, Hyemi
    Yang, Hyunseok
    ROBOTICS, 2023, 12 (04)
  • [36] Deep Reinforcement Learning Based on Local GNN for Goal-Conditioned Deformable Object Rearranging
    Deng, Yuhong
    Xia, Chongkun
    Wang, Xueqian
    Chen, Lipeng
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1131 - 1138
  • [37] Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy
    Cao, Chenyang
    Yan, Zichen
    Lu, Renhao
    Tan, Junbo
    Wang, Xueqian
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2838 - 2844
  • [38] Goal-Conditioned Q-learning as Knowledge Distillation
    Levine, Alexander
    Feizi, Soheil
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8500 - 8509
  • [39] Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach
    He X.
    Lv C.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 244 - 253
  • [40] Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
    Li, Yunfei
    Gao, Tian
    Yang, Jiaqi
    Xu, Huazhe
    Wu, Yi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,