A Guided-to-Autonomous Policy Learning method of Deep Reinforcement Learning in Path Planning

被引:0
|
作者
Zhao, Wang [1 ]
Zhang, Ye [1 ]
Li, Haoyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
path planning; Deep Reinforcement Learning; training efficiency; composite optimization; Guided-to-Autonomous Policy Learning;
D O I
10.1109/ICCA62789.2024.10591821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study introduces a Guided-to-Autonomous Policy Learning (GAPL) method that improves the training efficiency and composite optimization of Deep Reinforcement Learning (DRL) in path planning. Under this method, firstly, we introduce the concept of guiding rewards as a reward enhancement mechanism, which, based on Rapidly-exploring Random Trees (RRT) and Artificial Potential Field (APF) algorithm, effectively addresses the challenge of training efficiency. We then propose the Guided-to-Autonomous Reward Transition (GART) model to solve the combined challenges of balancing training efficiency with composite optimization problems, which lies in the evolutionary refinement of the reward structure, initially dominated by guiding rewards, transiting progressively toward a focus on rewards that emphasize composite optimization, specifically minimizing the distance and time to the end point. Simulated experiments in static obstacle settings and mixed dynamic-static obstacle environments demonstrate that: 1) guiding rewards play a significant role in enhancing training efficiency; 2) the GAPL method yields superior composite optimization outcomes for path planning compared to conventional methods, and it effectively addresses the issue of training efficiency in conventional DRL method.
引用
收藏
页码:665 / 672
页数:8
相关论文
共 50 条
  • [1] Explainable Deep Reinforcement Learning for UAV autonomous path planning
    He, Lei
    Aouf, Nabil
    Song, Bifeng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [2] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [3] Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    APPLIED OCEAN RESEARCH, 2022, 129
  • [4] An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
    Guo, Siyu
    Zhang, Xiuguo
    Zheng, Yisong
    Du, Yiquan
    SENSORS, 2020, 20 (02)
  • [5] Path planning of autonomous UAVs using reinforcement learning
    Chronis, Christos
    Anagnostopoulos, Georgios
    Politi, Elena
    Garyfallou, Antonios
    Varlamis, Iraklis
    Dimitrakopoulos, George
    12TH EASN INTERNATIONAL CONFERENCE ON "INNOVATION IN AVIATION & SPACE FOR OPENING NEW HORIZONS", 2023, 2526
  • [6] Path Planning for Autonomous Balloon Navigation with Reinforcement Learning
    He, Yingzhe
    Guo, Kai
    Wang, Chisheng
    Fu, Keyi
    Zheng, Jiehao
    ELECTRONICS, 2025, 14 (01):
  • [7] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SENSORS, 2023, 23 (12)
  • [8] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang Y.
    Wang Y.-H.
    Yin Z.-Z.
    Wan P.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [9] A path planning method based on deep reinforcement learning for crowd evacuation
    Meng X.
    Liu H.
    Li W.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (6) : 2925 - 2939
  • [10] Path Following with Deep Reinforcement Learning for Autonomous Cars
    Alomari, Khaled
    Mendoza, Ricardo Carrillo
    Goehring, Daniel
    Rojas, Raul
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS (ROBOVIS), 2021, : 173 - 181