Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle

被引:70
作者
Hadi, Behnaz [1 ]
Khosravi, Alireza [1 ]
Sarhadi, Pouria [2 ]
机构
[1] Babol Noshirvani Univ Technol, Dept Elect & Comp Engn, Babol, Iran
[2] Univ Hertfordshire, Sch Phys Engn & Comp Sci, Hatfield, England
关键词
Autonomous underwater vehicle (AUV); Deep reinforcement learning (DRL); Motion planning; Obstacle avoidance; Adaptive actor-critic network; ANTIWINDUP COMPENSATOR; OPTIMIZATION; DESIGN; AUV;
D O I
10.1016/j.apor.2022.103326
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Research into intelligent motion planning methods has been driven by the growing autonomy of autonomous underwater vehicles (AUV) in complex unknown environments. Deep reinforcement learning (DRL) algorithms with actor-critic structures are optimal adaptive solutions that render online solutions for completely unknown systems. The present study proposes an adaptive motion planning and obstacle avoidance technique based on deep reinforcement learning for an AUV. The research employs a twin-delayed deep deterministic policy algorithm, which is suitable for Markov processes with continuous actions. Environmental observations are the vehicle's sensor navigation information. Motion planning is carried out without having any knowledge of the environment. A comprehensive reward function has been developed for control purposes. The proposed system is robust to the disturbances caused by ocean currents. The simulation results show that the motion planning system can precisely guide an AUV with six-degrees-of-freedom dynamics towards the target. In addition, the intelligent agent has appropriate generalization power.
引用
收藏
页数:14
相关论文
共 44 条
[1]   Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning [J].
Anderlini, Enrico ;
Parker, Gordon G. ;
Thomas, Giles .
APPLIED SCIENCES-BASEL, 2019, 9 (17)
[2]   Reinforcement Learning Based Obstacle Avoidance for Autonomous Underwater Vehicle [J].
Bhopale, Prashant ;
Kazi, Faruk ;
Singh, Navdeep .
JOURNAL OF MARINE SCIENCE AND APPLICATION, 2019, 18 (02) :228-238
[3]   RL-based path planning for an over-actuated floating vehicle under disturbances [J].
Blekas, Konstantinos ;
Vlachos, Kostas .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 101 :93-102
[4]   Reinforcement learning for control: Performance, stability, and deep approximators [J].
Busoniu, Lucian ;
de Bruin, Tim ;
Tolic, Domagoj ;
Kober, Jens ;
Palunko, Ivana .
ANNUAL REVIEWS IN CONTROL, 2018, 46 :8-28
[5]  
Carlucho I., 2018, OCEANS 2018 MTSIEEE
[6]   Sparus II AUV-A Hovering Vehicle for Seabed Inspection [J].
Carreras, Marc ;
David Hernandez, Juan ;
Vidal, Eduard ;
Palomeras, Narcis ;
Ribas, David ;
Ridao, Pere .
IEEE JOURNAL OF OCEANIC ENGINEERING, 2018, 43 (02) :344-355
[7]   Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vessels [J].
Cheng, Yin ;
Zhang, Weidong .
NEUROCOMPUTING, 2018, 272 :63-73
[8]  
Cruz N.A., 2011, Autonomous Underwater Vehicles
[9]   Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning [J].
Cui, Rongxin ;
Yang, Chenguang ;
Li, Yang ;
Sharma, Sanjay .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (06) :1019-1029
[10]  
Fossen T.I., 2011, Handbook of Marine Craft Hydrodynamics and Motion Control, DOI DOI 10.1002/9781119994138