Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle

被引:70
作者
Hadi, Behnaz [1 ]
Khosravi, Alireza [1 ]
Sarhadi, Pouria [2 ]
机构
[1] Babol Noshirvani Univ Technol, Dept Elect & Comp Engn, Babol, Iran
[2] Univ Hertfordshire, Sch Phys Engn & Comp Sci, Hatfield, England
关键词
Autonomous underwater vehicle (AUV); Deep reinforcement learning (DRL); Motion planning; Obstacle avoidance; Adaptive actor-critic network; ANTIWINDUP COMPENSATOR; OPTIMIZATION; DESIGN; AUV;
D O I
10.1016/j.apor.2022.103326
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Research into intelligent motion planning methods has been driven by the growing autonomy of autonomous underwater vehicles (AUV) in complex unknown environments. Deep reinforcement learning (DRL) algorithms with actor-critic structures are optimal adaptive solutions that render online solutions for completely unknown systems. The present study proposes an adaptive motion planning and obstacle avoidance technique based on deep reinforcement learning for an AUV. The research employs a twin-delayed deep deterministic policy algorithm, which is suitable for Markov processes with continuous actions. Environmental observations are the vehicle's sensor navigation information. Motion planning is carried out without having any knowledge of the environment. A comprehensive reward function has been developed for control purposes. The proposed system is robust to the disturbances caused by ocean currents. The simulation results show that the motion planning system can precisely guide an AUV with six-degrees-of-freedom dynamics towards the target. In addition, the intelligent agent has appropriate generalization power.
引用
收藏
页数:14
相关论文
共 44 条
[11]  
Fossen T.I., 1999, Guidance and Control of Ocean Vehicles
[12]   ADAPTIVE-CONTROL OF NONLINEAR-SYSTEMS - A CASE-STUDY OF UNDERWATER ROBOTIC SYSTEMS [J].
FOSSEN, TI ;
SAGATUN, SI .
JOURNAL OF ROBOTIC SYSTEMS, 1991, 8 (03) :393-412
[13]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[14]  
Garau B, 2005, IEEE INT CONF ROBOT, P194
[15]  
Gaskett C., 1999, P AUSTR C ROBOTICS A
[16]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[17]   An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning [J].
Guo, Siyu ;
Zhang, Xiuguo ;
Zheng, Yisong ;
Du, Yiquan .
SENSORS, 2020, 20 (02)
[18]   A Review of the Path Planning and Formation Control for Multiple Autonomous Underwater Vehicles [J].
Hadi, Behnaz ;
Khosravi, Alireza ;
Sarhadi, Pouria .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 101 (04)
[19]   Deep Reinforcement Learning Controller for 3D Path Following and Collision Avoidance by Autonomous Underwater Vehicles [J].
Havenstrom, Simen Theie ;
Rasheed, Adil ;
San, Omer .
FRONTIERS IN ROBOTICS AND AI, 2021, 7
[20]   Reinforcement learning and optimal adaptive control: An overview and implementation examples [J].
Khan, Said G. ;
Herrmann, Guido ;
Lewis, Frank L. ;
Pipe, Tony ;
Melhuish, Chris .
ANNUAL REVIEWS IN CONTROL, 2012, 36 (01) :42-59