Adversarial deep reinforcement learning based robust depth tracking control for underactuated autonomous underwater vehicle

被引：30

作者：

Wang, Zhao ^{[1
]}

Xiang, Xianbo ^{[1
,2
]}

Duan, Yu ^{[1
]}

Yang, Shaolong ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Naval Architecture & Ocean Engn, Wuhan 430074, Peoples R China

[2] Huazhong Univ Sci & Technol, State Key Lab Intelligent Mfg Equipment & Technol, Wuhan 430074, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 130卷

基金：

中国国家自然科学基金;

关键词：

Autonomous underwater vehicle; Robust control; Depth tracking; Adversarial deep reinforcement learning; Comparative experimental validation; GUIDANCE LAW; PATHS;

D O I：

10.1016/j.engappai.2023.107728

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an adversarial deep reinforcement learning-based control method is proposed to address the issue of robust depth tracking of an underactuated autonomous underwater vehicle in the presence of intrinsic coupled dynamics and external disturbances. First, long-short-term-memory neural network is presented to memorize and predict the changes in the state of vehicle, and a cascaded multilayer perception projects the output into action space of vehicle. Subsequently, adversarial deep reinforcement learning scheme is applied to the training of control agent by introducing an adversary which counteracts the control behavior, whereby the agent is enabled to learn the control strategy in different distributions of state transition. For evaluation of the performance, a control agent is pre-trained in simulation environment based on the reliable digital model of a real vehicle, and the simulation environment is paced at one iteration per second to align with real-time operations to ensure the portability of training result. Furthermore, the training cost is also extremely reduced. Finally, experiments are conducted with time-varying disturbances to further prove the feasibility of the proposed learning-based control scheme on a prototype of underwater vehicle in towing tank. Moreover, comparative experiment results show the better robustness performance of the learning-based control agent than that of classic line-of-sight based proportional-integral-derivative and adaptive line-of-sight based proportional-integral-derivative controllers in different scenarios.

引用

页数：16

共 47 条

[1] Survey on traditional and AI based estimation techniques for hydrodynamic coefficients of autonomous underwater vehicle [J].

Ahmed, Faheem ;

Xiang, Xianbo ;

Jiang, Chaicheng ;

Xiang, Gong ;

Yang, Shaolong .

OCEAN ENGINEERING, 2023, 268

[2] Robust Model Predictive Control Based on Active Disturbance Rejection Control for a Robotic Autonomous Underwater Vehicle [J].

Arcos-Legarda, Jaime ;

Gutierrez, Alvaro .

JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (05)

[3] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[4] Nonlinear model predictive control for hydrobatics: Experiments with an underactuated AUV [J].

Bhat, Sriharsha ;

Panteli, Chariklia ;

Stenius, Ivan ;

Dimarogonas, Dimos V. .

JOURNAL OF FIELD ROBOTICS, 2023, 40 (07) :1840-1859

[5] High-resolution visual seafloor mapping and classification using long range capable AUV for ship-free benthic surveys [J].

Bodenmann, Adrian ;

Cappelletto, Jose ;

Massot-Campos, Miquel ;

Newborough, Darryl ;

Chaney, Ed ;

Marlow, Rachel ;

Templeton, Robert ;

Phillips, Alexander B. ;

Bett, Brian J. ;

Wardell, Catherine ;

Thornton, Blair .

2023 IEEE UNDERWATER TECHNOLOGY, UT, 2023,

[6] Comparison of two second-order sliding mode control algorithms for an articulated intervention AUV: Theory and experimental results [J].

Borlaug, Ida-Louise G. ;

Pettersen, Kristin Y. ;

Gravdahl, Jan Tommy .

OCEAN ENGINEERING, 2021, 222

[7]

Staudemeyer RC, 2019, Arxiv, DOI [arXiv:1909.09586, 10.48550/arXiv.1909.09586]

[8] Reinforcement learning based model-free optimized trajectory tracking strategy design for an AUV [J].

Duan, Kairong ;

Fong, Simon ;

Chen, C. L. Philip .

NEUROCOMPUTING, 2022, 469 :289-297

[9] Robust Adaptive Neural Network Tracking Control With Optimized Super-Twisting Sliding-Mode Technique for Induction Motor Drive System [J].

El-Sousy, Fayez F. M. ;

Amin, Mahmoud M. ;

Mohammed, Osama A. .

IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2022, 58 (03) :4134-4157

[10] Reinforcement learning-based saturated adaptive robust neural-network control of underactuated autonomous underwater vehicles [J].

Elhaki, Omid ;

Shojaei, Khoshnam ;

Mehrmohammadi, Parisa .

EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197

← 1 2 3 4 5 →