Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning
被引:1
作者:
Li, Xinmao
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Li, Xinmao
[1
,2
]
Geng, Lingbo
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Geng, Lingbo
[1
]
Liu, Kaizhou
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Liu, Kaizhou
[1
]
Zhao, Yifeng
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Zhao, Yifeng
[1
,2
]
Du, Weifeng
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
Du, Weifeng
[1
]
机构:
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.
机构:
Naval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
Fang, Yuan
Huang, Zhenwei
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Energy & Power Engn, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
Huang, Zhenwei
Pu, Jinyun
论文数: 0引用数: 0
h-index: 0
机构:
Naval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
Pu, Jinyun
Zhang, Jinsong
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Energy & Power Engn, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Faria, R. R.
Capron, B. D. O.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Capron, B. D. O.
Secchi, A. R.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Univ Fed Rio de Janeiro, Programa Engn Quim, PEQ COPPE, BR-21941972 Rio De Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Secchi, A. R.
De Souza, M. B.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Univ Fed Rio de Janeiro, Programa Engn Quim, PEQ COPPE, BR-21941972 Rio De Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
机构:
Naval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
Fang, Yuan
Huang, Zhenwei
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Energy & Power Engn, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
Huang, Zhenwei
Pu, Jinyun
论文数: 0引用数: 0
h-index: 0
机构:
Naval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
Pu, Jinyun
Zhang, Jinsong
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Energy & Power Engn, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R ChinaNaval Univ Engn, Coll Power Engn, Wuhan 430033, Peoples R China
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Faria, R. R.
Capron, B. D. O.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Capron, B. D. O.
Secchi, A. R.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Univ Fed Rio de Janeiro, Programa Engn Quim, PEQ COPPE, BR-21941972 Rio De Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Secchi, A. R.
De Souza, M. B.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil
Univ Fed Rio de Janeiro, Programa Engn Quim, PEQ COPPE, BR-21941972 Rio De Janeiro, BrazilUniv Fed Rio de Janeiro, Escola Quim, EPQB, BR-21941909 Rio de Janeiro, Brazil