A Guided-to-Autonomous Policy Learning method of Deep Reinforcement Learning in Path Planning

被引:0
|
作者
Zhao, Wang [1 ]
Zhang, Ye [1 ]
Li, Haoyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
path planning; Deep Reinforcement Learning; training efficiency; composite optimization; Guided-to-Autonomous Policy Learning;
D O I
10.1109/ICCA62789.2024.10591821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study introduces a Guided-to-Autonomous Policy Learning (GAPL) method that improves the training efficiency and composite optimization of Deep Reinforcement Learning (DRL) in path planning. Under this method, firstly, we introduce the concept of guiding rewards as a reward enhancement mechanism, which, based on Rapidly-exploring Random Trees (RRT) and Artificial Potential Field (APF) algorithm, effectively addresses the challenge of training efficiency. We then propose the Guided-to-Autonomous Reward Transition (GART) model to solve the combined challenges of balancing training efficiency with composite optimization problems, which lies in the evolutionary refinement of the reward structure, initially dominated by guiding rewards, transiting progressively toward a focus on rewards that emphasize composite optimization, specifically minimizing the distance and time to the end point. Simulated experiments in static obstacle settings and mixed dynamic-static obstacle environments demonstrate that: 1) guiding rewards play a significant role in enhancing training efficiency; 2) the GAPL method yields superior composite optimization outcomes for path planning compared to conventional methods, and it effectively addresses the issue of training efficiency in conventional DRL method.
引用
收藏
页码:665 / 672
页数:8
相关论文
共 50 条
  • [41] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Zijian HU
    Xiaoguang GAO
    Kaifang WAN
    Yiwei ZHAI
    Qianglong WANG
    Chinese Journal of Aeronautics, 2021, (12) : 187 - 204
  • [42] A decentralized path planning model based on deep reinforcement learning
    Guo, Dong
    Ji, Shouwen
    Yao, Yanke
    Chen, Cheng
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 117
  • [43] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Zijian HU
    Xiaoguang GAO
    Kaifang WAN
    Yiwei ZHAI
    Qianglong WANG
    Chinese Journal of Aeronautics, 2021, 34 (12) : 187 - 204
  • [44] Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
    Hu, Zijian
    Gao, Xiaoguang
    Wan, Kaifang
    Zhai, Yiwei
    Wang, Qianglong
    CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (12) : 187 - 204
  • [45] Deep Reinforcement Learning for Indoor Mobile Robot Path Planning
    Gao, Junli
    Ye, Weijie
    Guo, Jing
    Li, Zhongjuan
    SENSORS, 2020, 20 (19) : 1 - 15
  • [46] A path planning method based on deep reinforcement learning for AUV in complex marine environment
    Zhang, An
    Wang, Weixiang
    Bi, Wenhao
    Huang, Zhanjun
    OCEAN ENGINEERING, 2024, 313
  • [47] Path Planning for the Robotic Manipulator in Dynamic Environments Based on a Deep Reinforcement Learning Method
    Jie Liu
    Hwa Jen Yap
    Anis Salwa Mohd Khairuddin
    Journal of Intelligent & Robotic Systems, 111 (1)
  • [48] Object Detection with Deep Neural Networks for Reinforcement Learning in the Task of Autonomous Vehicles Path Planning at the Intersection
    Yudin, D. A.
    Skrynnik, A.
    Krishtopik, A.
    Belkin, I
    Panov, A., I
    OPTICAL MEMORY AND NEURAL NETWORKS, 2019, 28 (04) : 283 - 295
  • [49] iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
    Maw, Aye Aye
    Tyan, Maxim
    Nguyen, Tuan Anh
    Lee, Jae-Woo
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [50] Path Planning Based on Deep Reinforcement Learning for Autonomous Underwater Vehicles Under Ocean Current Disturbance
    Chu, Zhenzhong
    Wang, Fulun
    Lei, Tingjun
    Luo, Chaomin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 108 - 120