A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning

被引:4
|
作者
Lee, Gyeong Taek [1 ,2 ]
Kim, Kangjin [3 ,4 ]
机构
[1] State Univ New Jersey, Rutgers Univ, Dept Ind & Syst Engn, Piscataway, NJ 08854 USA
[2] AImtory, Seoul 06249, South Korea
[3] Brigham & Womens Hosp, Dept Med, Channing Div Network Med, Boston, MA 02115 USA
[4] Harvard Med Sch, Boston, MA 02115 USA
关键词
Trajectory; Training; Behavioral sciences; Robots; Reinforcement learning; Task analysis; Memory; Controllable agent; path planning; goal-conditioned reinforcement learning; bidirectional memory editing; MEMORY; ENVIRONMENTS;
D O I
10.1109/ACCESS.2023.3264264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of path planning is to search for a path from the starting point to the goal. Numerous studies, however, have dealt with a single predefined goal. That is, an agent who has completed learning cannot reach other goals that have not been visited in the training. In the present study, we propose a novel reinforcement learning (RL) framework for an agent reachable to any subgoal as well as the final goal in path planning. To do this, we utilize goal-conditioned RL and propose bidirectional memory editing to obtain various bidirectional trajectories of the agent. Bidirectional memory editing can generate various behavior and subgoals of the agent from the limited trajectory. Then, the generated subgoals and behaviors of the agent are trained on the policy network so that the agent can reach any subgoals from any starting point. In addition, we present reward shaping for the short path of the agent to reach the goal. In the experimental result, the agent was able to reach the various goals that had never been visited by the agent during the training. We confirmed that the agent could perform difficult missions, such as a round trip, and the agent used the shorter route with reward shaping.
引用
收藏
页码:33812 / 33825
页数:14
相关论文
共 50 条
  • [1] Real-time path planning of controllable UAV by subgoals using goal-conditioned reinforcement learning
    Lee, GyeongTaek
    Kim, KangJin
    Jang, Jaeyeon
    APPLIED SOFT COMPUTING, 2023, 146
  • [2] Goal-Conditioned Reinforcement Learning with Imagined Subgoals
    Chane-Sane, Elliot
    Schmid, Cordelia
    Laptev, Ivan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
    Li, Jinning
    Tang, Chen
    Tomizuka, Masayoshi
    Zhan, Wei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10216 - 10223
  • [4] Contrastive Learning as Goal-Conditioned Reinforcement Learning
    Eysenbach, Benjamin
    Zhang, Tianjun
    Levine, Sergey
    Salakhutdinov, Ruslan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Goal-Conditioned Reinforcement Learning With Disentanglement-Based Reachability Planning
    Qian, Zhifeng
    You, Mingyu
    Zhou, Hongjun
    Xu, Xuanhui
    He, Bin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08): : 4721 - 4728
  • [6] State Representation Learning for Goal-Conditioned Reinforcement Learning
    Steccanella, Lorenzo
    Jonsson, Anders
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 84 - 99
  • [7] Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
    Hansen-Estruch, Philippe
    Zhang, Amy
    Nair, Ashvin
    Yin, Patrick
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [8] Goal-Conditioned Reinforcement Learning for Ultrasound Navigation Guidance
    Amadou, Abdoul Aziz
    Singh, Vivek
    Ghesu, Florin C.
    Kim, Young-Ho
    Stanciulescu, Laura
    Sai, Harshitha P.
    Sharma, Puneet
    Young, Alistair
    Rajani, Ronak
    Rhode, Kawal
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XI, 2024, 15011 : 319 - 329
  • [9] Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
    Zeng, Zilai
    Zhang, Ce
    Wang, Shijie
    Sun, Chen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning
    Feng, Xiaoyun
    Jiang, Li
    Yu, Xudong
    Xu, Haoran
    Sun, Xiaoyan
    Wang, Jie
    Zhan, Xianyuan
    Chan, Wai Kin
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (01) : 102 - 112