Direct Adaptive Pole-Placement Controller using Deep Reinforcement Learning: Application to AUV Control

被引：5

作者：

Chaffre, Thomas ^{[1
,2
,4
]}

Le Chenadec, Gilles ^{[1
]}

Sammut, Karl ^{[2
,4
]}

Chauveau, Estelle ^{[3
]}

Clement, Benoit ^{[1
,2
,4
]}

机构：

[1] ENSTA Bretagne, CNRS, UMR 6285, Lab STICC, Brest, France

[2] Flinders Univ S Australia, Ctr Maritime Engn, Bedford Pk, SA, Australia

[3] Naval Grp Res, Ollioules, France

[4] CNRS, CROSSING IRL 2010, Adelaide, SA, Australia

来源：

IFAC PAPERSONLINE | 2021年 / 54卷 / 16期

关键词：

Adaptive control; Pole-placement; Deep reinforcement learning; Underwater vehicle;

D O I：

10.1016/j.ifacol.2021.10.113

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we investigate a direct adaptive learning-based tuning strategy for the control of an underwater vehicle under unknown disturbances. This process can be seen as a double integrator without delay and is usually regulated using a PD/PID type controller. A trade-off between performance and robustness may be found when tuning their parameters because a single optimal controller for multiple operating condition does not exist. Therefore, we use a re-parametrization of the PID controller gains in a space of poles where controller stability is guaranteed. We propose to use the maximum entropy deep reinforcement learning algorithm called SAC to explore this space. The adaptation procedure is able to capture a great variety of desired pole locations in order to adapt to process variations without measuring them. Simulation outcomes show the advantages of this approach. Copyright (C) 2021 The Authors.

引用

页码：333 / 340

页数：8

共 38 条

[1] [Anonymous], 2008, 2008 IEEE HOT CHIPS
[2] Ariyur K.B., 2003, REAL TIME OPTIMIZATI, DOI DOI 10.1002/0471669784.FMATTER
[3] Barto A.G., 1998, INTRO REINFORCEMENT, V1st, DOI 10.1109/tnn.1998.712192
[4] Berg V., 2012, Development and Commissioning of a DP system for ROV SF 30k
[5] Brunton S. L., 2019, DATA DRIVEN SCI ENG
[6] Carlton J., 2018, Marine Propellers and Propulsion
[7] Chaffre T., 2021, ABS210112501 ARXIV
[8] H infinity design with pole placement constraints: An LMI approach
Chilali, M
Gahinet, P
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1996, 41 (03) : 358 - 367
[9] De Paula M., 2018, OC 2018 MTSIEEE, P1
[10] Elliott H., 1982, Proceedings of the 21st IEEE Conference on Decision & Control, P260

← 1 2 3 4 →