A Guided-to-Autonomous Policy Learning method of Deep Reinforcement Learning in Path Planning

被引:0
|
作者
Zhao, Wang [1 ]
Zhang, Ye [1 ]
Li, Haoyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian, Peoples R China
来源
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024 | 2024年
基金
中国国家自然科学基金;
关键词
path planning; Deep Reinforcement Learning; training efficiency; composite optimization; Guided-to-Autonomous Policy Learning;
D O I
10.1109/ICCA62789.2024.10591821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study introduces a Guided-to-Autonomous Policy Learning (GAPL) method that improves the training efficiency and composite optimization of Deep Reinforcement Learning (DRL) in path planning. Under this method, firstly, we introduce the concept of guiding rewards as a reward enhancement mechanism, which, based on Rapidly-exploring Random Trees (RRT) and Artificial Potential Field (APF) algorithm, effectively addresses the challenge of training efficiency. We then propose the Guided-to-Autonomous Reward Transition (GART) model to solve the combined challenges of balancing training efficiency with composite optimization problems, which lies in the evolutionary refinement of the reward structure, initially dominated by guiding rewards, transiting progressively toward a focus on rewards that emphasize composite optimization, specifically minimizing the distance and time to the end point. Simulated experiments in static obstacle settings and mixed dynamic-static obstacle environments demonstrate that: 1) guiding rewards play a significant role in enhancing training efficiency; 2) the GAPL method yields superior composite optimization outcomes for path planning compared to conventional methods, and it effectively addresses the issue of training efficiency in conventional DRL method.
引用
收藏
页码:665 / 672
页数:8
相关论文
共 50 条
  • [21] Deep Reinforcement Learning for Indoor Mobile Robot Path Planning
    Gao, Junli
    Ye, Weijie
    Guo, Jing
    Li, Zhongjuan
    SENSORS, 2020, 20 (19) : 1 - 15
  • [22] A path planning method based on deep reinforcement learning for AUV in complex marine environment
    Zhang, An
    Wang, Weixiang
    Bi, Wenhao
    Huang, Zhanjun
    OCEAN ENGINEERING, 2024, 313
  • [23] Path Planning Based on Deep Reinforcement Learning for Autonomous Underwater Vehicles Under Ocean Current Disturbance
    Chu, Zhenzhong
    Wang, Fulun
    Lei, Tingjun
    Luo, Chaomin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 108 - 120
  • [24] Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning
    You, Changxi
    Lu, Jianbo
    Filev, Dimitar
    Tsiotras, Panagiotis
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 114 : 1 - 18
  • [25] Efficient Deep Reinforcement Learning for Optimal Path Planning
    Ren, Jing
    Huang, Xishi
    Huang, Raymond N.
    ELECTRONICS, 2022, 11 (21)
  • [26] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [27] Robot Path Planning Based on Deep Reinforcement Learning
    Zhang, Rui
    Jiang, Yuhao
    Wu Fenghua
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1697 - 1701
  • [28] Obstacle avoidance planning of autonomous vehicles using deep reinforcement learning
    Qian, Yubin
    Feng, Song
    Hu, Wenhao
    Wang, Wanqiu
    ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (12)
  • [29] H-MAS Architecture and Reinforcement Learning method for autonomous robot path planning
    Lamini, Chaymaa
    Fathi, Youssef
    Benhlima, Said
    2017 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2017,
  • [30] Benchmarking Off-Policy Deep Reinforcement Learning Algorithms for UAV Path Planning
    Garg, Shaswat
    Masnavi, Houman
    Fidan, Baris
    Janabi-Sharifi, Farrokh
    Mantegh, Iraj
    2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 317 - 323