A LARGE-SCALE PATH PLANNING ALGORITHM FOR UNDERWATER ROBOTS BASED ON DEEP REINFORCEMENT LEARNING

被引:0
作者
Wang, Wenhui [1 ]
Li, Leqing [1 ]
Ye, Fumeng [1 ]
Peng, Yumin [1 ]
Ma, Yiming [1 ]
机构
[1] CSG PGC Power Storage Res Inst, Guangzhou 510000, Peoples R China
关键词
DDPG algorithm; reward function; deep reinforcement learning; underwater robot; large-scale path planning; VEHICLE;
D O I
10.2316/J.2024.206-1035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To ensure the effect and improve the accuracy of large-scale path planning for underwater robots, a large-scale algorithm for planning the path for underwater robots based on deep reinforcement learning is proposed. Deep reinforcement learning is analysed, and the idea, structure, network update method, and training process of deep deterministic policy gradients (DDPG) algorithm are described. A fitness learning model of the robot which under water is confirmed to describe the mathematical relationship between the geographical location and operating speed of the underwater robots. On this basis, DDPG algorithm is applied in large-scale path planning of underwater robots. TensorFlow is used to build Actor and Critic neural network structures, and design environment state models, action state spaces, and reward functions. In deep reinforcement learning, the large-scale navigation planning for the underwater robot, through exploration-online trial and error, finds the optimal search strategy, and considers obtaining the maximum expected reward during the path planning procedure, achieving the large-scale path planning for the underwater robot. According to the experimental results, the proposed algorithm demonstrates good performance in large-scale path planning for underwater robots and effectively improves both the accuracy and efficiency of the planning process.
引用
收藏
页码:204 / 210
页数:7
相关论文
共 20 条
  • [1] An Energy Efficient Solution for Fuel Cell Heat Recovery in Zero-Emission Ferry Boats: Deep Deterministic Policy Gradient
    Ahmadi, Hoda
    Rafiei, Mehdi
    Afshari Igder, Moseyeb
    Gheisarnejad, Meysam
    Khooban, Mohammad-Hassan
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7571 - 7581
  • [2] Aloba L.T., 2019, Shipbuilding and Marine Infrastructure, V1, P74
  • [3] [Быкова В.С. Bykova V.S.], 2021, [Гироскопия и навигация, Giroskopiya i navigatsiya, Giroskopiya i navigatsiya], V29, P97, DOI 10.17285/0869-7035.0058
  • [4] A fuzzy-based potential field hierarchical reinforcement learning approach for target hunting by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Zuo, Fen
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (05) : 1334 - 1343
  • [5] Non-Communication Decentralized Multi-Robot Collision Avoidance in Grid Map Workspace with Double Deep Q-Network
    Chen, Lin
    Zhao, Yongting
    Zhao, Huanjun
    Zheng, Bin
    [J]. SENSORS, 2021, 21 (03) : 1 - 15
  • [6] Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
    Cicek, Dogan C.
    Duran, Enes
    Saglam, Baturay
    Mutlu, Furkan B.
    Kozat, Suleyman S.
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1255 - 1262
  • [7] Krishnan A., 2020, International Journal of Robotics and Automation, V6, P1
  • [8] Constrained path planning of autonomous underwater vehicle using selectively-hybridized particle swarm optimization algorithms
    Lim, Hui Sheng
    Fan, Shuangshuang
    Chin, Christopher K. H.
    Chai, Shuhong
    Bose, Neil
    Kim, Eonjoo
    [J]. IFAC PAPERSONLINE, 2019, 52 (21): : 315 - 322
  • [9] Sliding Mode Control with Gaussian Process Regression for Underwater Robots
    Lima, Gabriel S.
    Trimpe, Sebastian
    Bessa, Wallace M.
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (3-4) : 487 - 498
  • [10] Multi-autonomous underwater vehicles collaboratively search for intelligent targets in an unknown environment in the presence of interception
    Ma, Xi-wen
    Chen, Yan-li
    Bai, Gui-qiang
    Sha, Yong-bai
    Liu, Jun
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2021, 235 (09) : 1539 - 1554