Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning

被引：1

作者：

Li, Xinmao ^{[1
,2
]}

Geng, Lingbo ^{[1
]}

Liu, Kaizhou ^{[1
]}

Zhao, Yifeng ^{[1
,2
]}

Du, Weifeng ^{[1
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

OCEAN ENGINEERING | 2024年 / 313卷

基金：

中国国家自然科学基金;

关键词：

Autonomous underwater vehicle; Offline reinforcement learning; Physics-informed reinforcement learning; Physics informed neural network; Motion control; TRAJECTORY TRACKING;

D O I：

10.1016/j.oceaneng.2024.119432

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.

引用

页数：14

共 31 条

[1] Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning
Carlucho, Ignacio
De Paula, Mariano
Wang, Sen
Petillot, Yvan
Acosta, Gerardo G.
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 : 71 - 86
[2] Deisenroth M., 2011, ICML
[3] Learning-based robust optimal tracking controller design for unmanned underwater vehicles with full-state and input constraints
Dong, Botao
Shi, Yi
Xie, Wei
Chen, Weixing
Zhang, Weidong
[J]. OCEAN ENGINEERING, 2023, 271
[4] AUV position tracking and trajectory control based on fast-deployed deep reinforcement learning method
Fang, Yuan
Huang, Zhenwei
Pu, Jinyun
Zhang, Jinsong
[J]. OCEAN ENGINEERING, 2022, 245
[5] A data-driven tracking control framework using physics-informed neural networks and deep reinforcement learning for dynamical systems
Faria, R. R.
Capron, B. D. O.
Secchi, A. R.
De Souza, M. B.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
[6] Fossen T.I., 2011, HDB MARINE CRAFT HYD, DOI [10.1002/9781119994138, DOI 10.1002/9781119994138]
[7] Haarnoja T, 2019, Arxiv, DOI [arXiv:1812.05905, 10.48550/arxiv.1812.05905, DOI 10.48550/ARXIV.1812.05905]
[8] Adaptive Neural Network Control of a Marine Vessel With Constraints Using the Asymmetric Barrier Lyapunov Function
He, Wei
Yin, Zhao
Sun, Changyin
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (07) : 1641 - 1651
[9] A general motion controller based on deep reinforcement learning for an autonomous underwater vehicle with unknown disturbances
Huang, Fei
Xu, Jian
Wu, Di
Cui, Yunfei
Yan, Zheping
Xing, Wen
Zhang, Xun
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
[10] A general motion control architecture for an autonomous underwater vehicle with actuator faults and unknown disturbances through deep reinforcement learning
Huang, Fei
Xu, Jian
Yin, Liangang
Wu, Di
Cui, Yunfei
Yan, Zheping
Chen, Tao
[J]. OCEAN ENGINEERING, 2022, 263

← 1 2 3 4 →