A Guided-to-Autonomous Policy Learning method of Deep Reinforcement Learning in Path Planning

被引:0
|
作者
Zhao, Wang [1 ]
Zhang, Ye [1 ]
Li, Haoyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
path planning; Deep Reinforcement Learning; training efficiency; composite optimization; Guided-to-Autonomous Policy Learning;
D O I
10.1109/ICCA62789.2024.10591821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study introduces a Guided-to-Autonomous Policy Learning (GAPL) method that improves the training efficiency and composite optimization of Deep Reinforcement Learning (DRL) in path planning. Under this method, firstly, we introduce the concept of guiding rewards as a reward enhancement mechanism, which, based on Rapidly-exploring Random Trees (RRT) and Artificial Potential Field (APF) algorithm, effectively addresses the challenge of training efficiency. We then propose the Guided-to-Autonomous Reward Transition (GART) model to solve the combined challenges of balancing training efficiency with composite optimization problems, which lies in the evolutionary refinement of the reward structure, initially dominated by guiding rewards, transiting progressively toward a focus on rewards that emphasize composite optimization, specifically minimizing the distance and time to the end point. Simulated experiments in static obstacle settings and mixed dynamic-static obstacle environments demonstrate that: 1) guiding rewards play a significant role in enhancing training efficiency; 2) the GAPL method yields superior composite optimization outcomes for path planning compared to conventional methods, and it effectively addresses the issue of training efficiency in conventional DRL method.
引用
收藏
页码:665 / 672
页数:8
相关论文
共 50 条
  • [21] H-MAS Architecture and Reinforcement Learning method for autonomous robot path planning
    Lamini, Chaymaa
    Fathi, Youssef
    Benhlima, Said
    2017 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2017,
  • [22] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Liu, Yanglong
    Chen, Zuguo
    Li, Yonggang
    Lu, Ming
    Chen, Chaoyang
    Zhang, Xuzhuo
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
  • [23] EPPE: An Efficient Progressive Policy Enhancement framework of deep reinforcement learning in path planning
    Zhao, Wang
    Zhang, Ye
    Xie, Zikang
    NEUROCOMPUTING, 2024, 596
  • [24] Benchmarking Off-Policy Deep Reinforcement Learning Algorithms for UAV Path Planning
    Garg, Shaswat
    Masnavi, Houman
    Fidan, Baris
    Janabi-Sharifi, Farrokh
    Mantegh, Iraj
    2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 317 - 323
  • [25] Acquisition of Automated Guided Vehicle Route Planning Policy Using Deep Reinforcement Learning
    Kamoshida, Ryota
    Kazama, Yoriko
    2017 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LOGISTICS AND TRANSPORT (ICALT), 2017, : 1 - 6
  • [26] Autonomous Vehicle Driving Path Control with Deep Reinforcement Learning
    Tiong, Teckchai
    Saad, Ismail
    Teo, Kenneth Tze Kin
    bin Lago, Herwansyah
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 84 - 92
  • [27] Path Following for Autonomous Mobile Robots with Deep Reinforcement Learning
    Cao, Yu
    Ni, Kan
    Kawaguchi, Takahiro
    Hashimoto, Seiji
    SENSORS, 2024, 24 (02)
  • [28] Path planning of autonomous underwater vehicle in unknown environment based on improved deep reinforcement learning
    Tang, Zhicheng
    Cao, Xiang
    Zhou, Zihan
    Zhang, Zhoubin
    Xu, Chen
    Dou, Jianbin
    OCEAN ENGINEERING, 2024, 301
  • [29] Path Planning of Autonomous Mobile Robot in Comprehensive Unknown Environment Using Deep Reinforcement Learning
    Bai, Zekun
    Pang, Hui
    He, Zhaonian
    Zhao, Bin
    Wang, Tong
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 22153 - 22166
  • [30] An End-to-End Reinforcement Learning Method for Automated Guided Vehicle Path Planning
    Sun Yu
    Li Haisheng
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2020, 2020, 11574