Online Approximate Optimal Station Keeping of a Marine Craft in the Presence of an Irrotational Current

被引:23
作者
Walters, Patrick [1 ]
Kamalapurkar, Rushikesh [2 ]
Voight, Forrest [3 ]
Schwartz, Eric M. [3 ]
Dixon, Warren E. [1 ]
机构
[1] Univ Florida, Dept Mech & Aerosp Engn, Gainesville, FL 32611 USA
[2] Oklahoma State Univ, Dept Mech & Aerosp Engn, Stillwater, OK 74074 USA
[3] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
基金
美国国家科学基金会;
关键词
Adaptive dynamic programming (ADP); marine craft; nonlinear control; station keeping; ADAPTIVE-CONTROL; FEEDBACK-CONTROL; SYSTEMS;
D O I
10.1109/TRO.2018.2791600
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Online approximation of the optimal station-keeping strategy for a marine craft subject to an irrotational current is considered. An approximate policy that minimizes a user-defined cost function over an infinite time horizon is obtained using an actor-critic-identifier-based adaptive dynamic programming technique. The hydrodynamic drift dynamics are assumed to be unknown; therefore, a concurrent learning-based system identifier is developed to identify the unknown model parameters. The identified model is used to implement an adaptive model-based reinforcement learning technique to estimate the unknown value function. The developed policy guarantees uniformly ultimately bounded convergence of the vehicle to the desired station and uniformly ultimately bounded convergence of the approximated policies to the optimal polices without the requirement of persistence of excitation. The developed strategy is validated using an autonomous underwater vehicle, where the three degrees-of-freedom in the horizontal plane are regulated. The experiments are conducted in a second-magnitude spring located in central Florida.
引用
收藏
页码:486 / 496
页数:11
相关论文
共 27 条
[1]  
[Anonymous], 2004, IEEE T AUTOMAT CONTR, DOI DOI 10.1109/TAC.1972.1100008
[2]   Successive Galerkin approximation algorithms for nonlinear optimal and robust control [J].
Beard, RW ;
McLain, TW .
INTERNATIONAL JOURNAL OF CONTROL, 1998, 71 (05) :717-743
[3]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[4]   Concurrent learning adaptive control of linear systems with exponentially convergent bounds [J].
Chowdhary, Girish ;
Yucelen, Tansel ;
Muehlegg, Maximillian ;
Johnson, Eric N. .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2013, 27 (04) :280-301
[5]   Theory and Flight-Test Validation of a Concurrent-Learning Adaptive Controller [J].
Chowdhary, Girish V. ;
Johnson, Eric N. .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2011, 34 (02) :592-607
[6]  
Dixon WE, 2003, CONTROL ENGN SER BIR
[7]   Nonlinear RISE-Based Control of an Autonomous Underwater Vehicle [J].
Fischer, Nicholas ;
Hughes, Devin ;
Walters, Patrick ;
Schwartz, Eric M. ;
Dixon, Warren E. .
IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (04) :845-852
[8]  
Fossen T.I., 2011, HDB MARINE CRAFT HYD, DOI [10.1002/9781119994138, DOI 10.1002/9781119994138]
[9]   Nonlinear output feedback control of dynamically positioned ships using vectorial observer backstepping [J].
Fossen, TI ;
Grovlen, A .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 1998, 6 (01) :121-128
[10]  
Gelb A., 1989, Applied Optimal Estimation