State Representation Learning for Goal-Conditioned Reinforcement Learning

被引:0
|
作者
Steccanella, Lorenzo [1 ]
Jonsson, Anders [1 ]
机构
[1] Univ Pompeu Fabra, Dept Informat & Commun Technol, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
Representation learning; Goal-conditioned reinforcement learning; Reward shaping; Reinforcement learning;
D O I
10.1007/978-3-031-26412-2_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel state representation for reward-free Markov decision processes. The idea is to learn, in a self-supervised manner, an embedding space where distances between pairs of embedded states correspond to the minimum number of actions needed to transition between them. Compared to previous methods, our approach does not require any domain knowledge, learning from offline and unlabeled data. We show how this representation can be leveraged to learn goalconditioned policies, providing a notion of similarity between states and goals and a useful heuristic distance to guide planning and reinforcement learning algorithms. Finally, we empirically validate our method in classic control domains and multi-goal environments, demonstrating that our method can successfully learn representations in large and/or continuous domains.
引用
收藏
页码:84 / 99
页数:16
相关论文
共 50 条
  • [41] Generating Goal-conditioned Sub-goals for Hierarchical Learning
    Choi, Jinwoo
    Seo, Seung-Woo
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [42] Learning Domain Invariant Representations in Goal-conditioned Block MDPs
    Han, Beining
    Zheng, Chongyi
    Chan, Harris
    Paster, Keiran
    Zhang, Michael R.
    Ba, Jimmy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] Goal-Conditioned Reinforcement Learning With Adaptive Intrinsic Curiosity and Universal Value Network Fitting for Robotic Manipulation
    Sun, Zihao
    Yuan, Xianfeng
    Xu, Qingyang
    Pang, Bao
    Song, Yong
    Song, Rui
    Li, Yibin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (03) : 2244 - 2253
  • [44] A Fully Controllable UAV Using Curriculum Learning and Goal-Conditioned Reinforcement Learning: From Straight Forward to Round Trip Missions
    Kim, Hyeonmin
    Choi, Jongkwan
    Do, Hyungrok
    Lee, Gyeong Taek
    DRONES, 2025, 9 (01)
  • [45] Real-time path planning of controllable UAV by subgoals using goal-conditioned reinforcement learning
    Lee, GyeongTaek
    Kim, KangJin
    Jang, Jaeyeon
    APPLIED SOFT COMPUTING, 2023, 146
  • [46] Ricci Planner: Zero-Shot Transfer for Goal-Conditioned Reinforcement Learning via Geometric Flow
    Song, Wongeun
    Lee, Jungwoo
    IEEE ACCESS, 2024, 12 : 24027 - 24038
  • [47] Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability
    Zhu, Hanlin
    Zhang, Amy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks
    Seita, Daniel
    Florence, Pete
    Tompson, Jonathan
    Coumans, Erwin
    Sindhwani, Vikas
    Goldberg, Ken
    Zeng, Andy
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4568 - 4575
  • [49] Using Goal-Conditioned Reinforcement Learning With Deep Imitation to Control Robot Arm in Flexible Flat Cable Assembly Task
    Li, Jingchen
    Shi, Haobin
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6217 - 6228
  • [50] Learning Multi-Object Dense Descriptor for Autonomous Goal-Conditioned Grasping
    Yang, Shuo
    Zhang, Wei
    Song, Ran
    Cheng, Jiyu
    Li, Yibin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 4110 - 4117