State Representation Learning for Goal-Conditioned Reinforcement Learning

被引:0
|
作者
Steccanella, Lorenzo [1 ]
Jonsson, Anders [1 ]
机构
[1] Univ Pompeu Fabra, Dept Informat & Commun Technol, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
Representation learning; Goal-conditioned reinforcement learning; Reward shaping; Reinforcement learning;
D O I
10.1007/978-3-031-26412-2_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel state representation for reward-free Markov decision processes. The idea is to learn, in a self-supervised manner, an embedding space where distances between pairs of embedded states correspond to the minimum number of actions needed to transition between them. Compared to previous methods, our approach does not require any domain knowledge, learning from offline and unlabeled data. We show how this representation can be leveraged to learn goalconditioned policies, providing a notion of similarity between states and goals and a useful heuristic distance to guide planning and reinforcement learning algorithms. Finally, we empirically validate our method in classic control domains and multi-goal environments, demonstrating that our method can successfully learn representations in large and/or continuous domains.
引用
收藏
页码:84 / 99
页数:16
相关论文
共 50 条
  • [1] Contrastive Learning as Goal-Conditioned Reinforcement Learning
    Eysenbach, Benjamin
    Zhang, Tianjun
    Levine, Sergey
    Salakhutdinov, Ruslan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Compact Goal Representation Learning via Information Bottleneck in Goal-Conditioned Reinforcement Learning
    Zou, Qiming
    Suzuki, Einoshin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 2368 - 2381
  • [3] Weakly Supervised Disentangled Representation for Goal-Conditioned Reinforcement Learning
    Qian, Zhifeng
    You, Mingyu
    Zhou, Hongjun
    He, Bin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 2202 - 2209
  • [4] Representation-Based Robustness in Goal-Conditioned Reinforcement Learning
    Yin, Xiangyu
    Wu, Sihao
    Liu, Jiaxu
    Fang, Meng
    Zhao, Xingyu
    Huang, Xiaowei
    Ruan, Wenjie
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21761 - 21769
  • [5] Goal-Conditioned Reinforcement Learning with Imagined Subgoals
    Chane-Sane, Elliot
    Schmid, Cordelia
    Laptev, Ivan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [6] Goal-conditioned offline reinforcement learning through state space partitioning
    Wang, Mianchu
    Jin, Yue
    Montana, Giovanni
    MACHINE LEARNING, 2024, 113 (05) : 2435 - 2465
  • [7] Goal-conditioned offline reinforcement learning through state space partitioning
    Mianchu Wang
    Yue Jin
    Giovanni Montana
    Machine Learning, 2024, 113 : 2435 - 2465
  • [8] Sample-Efficient Goal-Conditioned Reinforcement Learning via Predictive Information Bottleneck for Goal Representation Learning
    Zou, Qiming
    Suzuki, Einoshin
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9523 - 9529
  • [9] Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
    Hansen-Estruch, Philippe
    Zhang, Amy
    Nair, Ashvin
    Yin, Patrick
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [10] Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning
    Feng, Xiaoyun
    Jiang, Li
    Yu, Xudong
    Xu, Haoran
    Sun, Xiaoyan
    Wang, Jie
    Zhan, Xianyuan
    Chan, Wai Kin
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (01) : 102 - 112