Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle

被引：70

作者：

Hadi, Behnaz ^{[1
]}

Khosravi, Alireza ^{[1
]}

Sarhadi, Pouria ^{[2
]}

机构：

[1] Babol Noshirvani Univ Technol, Dept Elect & Comp Engn, Babol, Iran

[2] Univ Hertfordshire, Sch Phys Engn & Comp Sci, Hatfield, England

来源：

APPLIED OCEAN RESEARCH | 2022年 / 129卷

关键词：

Autonomous underwater vehicle (AUV); Deep reinforcement learning (DRL); Motion planning; Obstacle avoidance; Adaptive actor-critic network; ANTIWINDUP COMPENSATOR; OPTIMIZATION; DESIGN; AUV;

D O I：

10.1016/j.apor.2022.103326

中图分类号：

P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Research into intelligent motion planning methods has been driven by the growing autonomy of autonomous underwater vehicles (AUV) in complex unknown environments. Deep reinforcement learning (DRL) algorithms with actor-critic structures are optimal adaptive solutions that render online solutions for completely unknown systems. The present study proposes an adaptive motion planning and obstacle avoidance technique based on deep reinforcement learning for an AUV. The research employs a twin-delayed deep deterministic policy algorithm, which is suitable for Markov processes with continuous actions. Environmental observations are the vehicle's sensor navigation information. Motion planning is carried out without having any knowledge of the environment. A comprehensive reward function has been developed for control purposes. The proposed system is robust to the disturbances caused by ocean currents. The simulation results show that the motion planning system can precisely guide an AUV with six-degrees-of-freedom dynamics towards the target. In addition, the intelligent agent has appropriate generalization power.

引用

页数：14

共 44 条

[11]

Fossen T.I., 1999, Guidance and Control of Ocean Vehicles

[12] ADAPTIVE-CONTROL OF NONLINEAR-SYSTEMS - A CASE-STUDY OF UNDERWATER ROBOTIC SYSTEMS [J].