A 2D Optimal Path Planning Algorithm for Autonomous Underwater Vehicle Driving in Unknown Underwater Canyons

被引:27
|
作者
Sun, Yushan [1 ]
Luo, Xiaokun [1 ]
Ran, Xiangrui [1 ]
Zhang, Guocheng [1 ]
机构
[1] Harbin Engn Univ, Sch Naval Engn, Harbin 150001, Peoples R China
关键词
autonomous underwater vehicle; 2D optimal path planning; deep reinforcement learning; unknown underwater canyons environment;
D O I
10.3390/jmse9030252
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This research aims to solve the safe navigation problem of autonomous underwater vehicles (AUVs) in deep ocean, which is a complex and changeable environment with various mountains. When an AUV reaches the deep sea navigation, it encounters many underwater canyons, and the hard valley walls threaten its safety seriously. To solve the problem on the safe driving of AUV in underwater canyons and address the potential of AUV autonomous obstacle avoidance in uncertain environments, an improved AUV path planning algorithm based on the deep deterministic policy gradient (DDPG) algorithm is proposed in this work. This method refers to an end-to-end path planning algorithm that optimizes the strategy directly. It takes sensor information as input and driving speed and yaw angle as outputs. The path planning algorithm can reach the predetermined target point while avoiding large-scale static obstacles, such as valley walls in the simulated underwater canyon environment, as well as sudden small-scale dynamic obstacles, such as marine life and other vehicles. In addition, this research aims at the multi-objective structure of the obstacle avoidance of path planning, modularized reward function design, and combined artificial potential field method to set continuous rewards. This research also proposes a new algorithm called deep SumTree-deterministic policy gradient algorithm (SumTree-DDPG), which improves the random storage and extraction strategy of DDPG algorithm experience samples. According to the importance of the experience samples, the samples are classified and stored in combination with the SumTree structure, high-quality samples are extracted continuously, and SumTree-DDPG algorithm finally improves the speed of the convergence model. Finally, this research uses Python language to write an underwater canyon simulation environment and builds a deep reinforcement learning simulation platform on a high-performance computer to conduct simulation learning training for AUV. Data simulation verified that the proposed path planning method can guide the under-actuated underwater robot to navigate to the target without colliding with any obstacles. In comparison with the DDPG algorithm, the stability, training's total reward, and robustness of the improved Sumtree-DDPG algorithm planner in this study are better.
引用
收藏
页码:1 / 27
页数:24
相关论文
共 50 条
  • [21] Autonomous Underwater Vehicle Path Planning Method of Soft Actor-Critic Based on Game Training
    Wang, Zhuo
    Lu, Hao
    Qin, Hongde
    Sui, Yancheng
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (12)
  • [22] Underwater Dynamic Target Tracking of Autonomous Underwater Vehicle Based on MPC Algorithm
    Wei, Yali
    Zhu, Daqi
    Chu, Zhenzhong
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON UNDERWATER SYSTEM TECHNOLOGY: THEORY AND APPLICATIONS (USYS), 2018,
  • [23] Autonomous underwater vehicle path planning based on actor-multi-critic reinforcement learning
    Wang, Zhuo
    Zhang, Shiwei
    Feng, Xiaoning
    Sui, Yancheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2021, 235 (10) : 1787 - 1796
  • [24] H∞ control for path tracking of autonomous underwater vehicle motion
    Wang, Lin-Lin
    Wang, Hong-Jian
    Pan, Li-Xin
    ADVANCES IN MECHANICAL ENGINEERING, 2015, 7 (05) : 1 - 18
  • [25] Path-following optimal control of autonomous underwater vehicle based on deep reinforcement learning
    Wang, Zhanyuan
    Li, Yulong
    Ma, Caipeng
    Yan, Xun
    Jiang, Dapeng
    OCEAN ENGINEERING, 2023, 268
  • [26] Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle
    Jiang, Dong
    Huang, Jie
    Fang, Zheng
    Cheng, Chunxi
    Sha, Qixin
    He, Bo
    Li, Guangliang
    OCEAN ENGINEERING, 2022, 260
  • [27] Event-Based Path-Planning and Path-Following in Unknown Environments for Underactuated Autonomous Underwater Vehicles
    Ulyanov, Sergey
    Bychkov, Igor
    Maksimkin, Nikolay
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 22
  • [28] A survey on path planning for persistent autonomy of autonomous underwater vehicles
    Zeng, Zheng
    Lian, Lian
    Sammut, Karl
    He, Fangpo
    Tang, Youhong
    Lammas, Andrew
    OCEAN ENGINEERING, 2015, 110 : 303 - 313
  • [29] Research on Autonomous Underwater Vehicle Path Optimization Using a Field Theory-Guided A* Algorithm
    Xu, Zhiyuan
    Shen, Yong
    Xie, Zhexue
    Liu, Yihua
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (10)
  • [30] Optimal Underwater Coverage of a Cellular Region by Autonomous Underwater Vehicle Using Line Sweep Motion
    Choi, Myoung Hwan
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2012, 7 (06) : 1023 - 1033