Reactive control of a two-body point absorber using reinforcement learning

被引:53
作者
Anderlini, E. [1 ,2 ,3 ,4 ]
Forehand, D. I. M. [1 ]
Bannon, E. [2 ]
Xiao, Q. [3 ]
Abusara, M. [4 ]
机构
[1] Univ Edinburgh, Inst Energy Syst, Faraday Bldg,Colin Maclaurin Rd, Edinburgh EH9 3DW, Midlothian, Scotland
[2] Wave Energy Scotland, 10 Inverness Campus, Inverness IV2 5NA, Scotland
[3] Univ Strathclyde, Dept Naval Architecture Ocean & Marine Engn, 100 Montrose St, Glasgow G4 0LZ, Lanark, Scotland
[4] Univ Exeter, Coll Engn Math & Phys Sci, Penryn Campus, Penryn TR10 9FE, England
基金
英国工程与自然科学研究理事会;
关键词
Reinforcement learning (RL); Q-learning; Reactive control; Point absorber; Wave energy converter (WEC); WAVE-ENERGY CONVERTER; MODEL-PREDICTIVE CONTROL; POWER TAKE-OFF; CONTROL STRATEGIES; LATCHING CONTROL; SEA; DEVICE; OPERATION; ARRAYS; DESIGN;
D O I
10.1016/j.oceaneng.2017.08.017
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In this article, reinforcement learning is used to obtain optimal reactive control of a two-body point absorber. In particular, the Q-learning algorithm is adopted for the maximization of the energy extraction in each sea state. The controller damping and stiffness coefficients are varied in steps, observing the associated reward, which corresponds to an increase in the absorbed power, or penalty, owing to large displacements. The generated power is averaged over a time horizon spanning several wave cycles due to the periodicity of ocean waves, discarding the transient effects at the start of each new episode. The model of a two-body point absorber is developed in order to validate the control strategy in both regular and irregular waves. In all analysed sea states, the controller learns the optimal damping and stiffness coefficients. Furthermore, the scheme is independent of internal models of the device response, which means that it can adapt to variations in the unit dynamics with time and does not suffer from modelling errors.
引用
收藏
页码:650 / 658
页数:9
相关论文
共 35 条
[1]   Model Predictive Control of a Nonlinear 2-Body Point Absorber Wave Energy Converter With Estimated State Feedback [J].
Amann, Kai Uwe ;
Magana, Mario Edgardo ;
Sawodny, Oliver .
IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2015, 6 (02) :336-345
[2]  
Anderlini E., 2017, IEEE T SUSTAIN ENERG
[3]   Control of a Point Absorber Using Reinforcement Learning [J].
Anderlini, Enrico ;
Forehand, David I. M. ;
Stansell, Paul ;
Xiao, Qing ;
Abusara, Mohammad .
IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2016, 7 (04) :1681-1690
[4]  
[Anonymous], 2015, TECH REP
[5]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[6]   Optimal latching control of a wave energy device in regular and irregular waves [J].
Babarit, A. ;
Clement, A. H. .
APPLIED OCEAN RESEARCH, 2006, 28 (02) :77-91
[7]   Comparison of latching control strategies for a heaving wave energy device in random sea [J].
Babarit, A ;
Duclos, G ;
Clément, AH .
APPLIED OCEAN RESEARCH, 2004, 26 (05) :227-238
[8]   Declutching control of a wave energy converter [J].
Babarit, Aurelien ;
Guglielmi, Michel ;
Clement, Alain H. .
OCEAN ENGINEERING, 2009, 36 (12-13) :1015-1024
[9]  
Brekken T. K., 2011, P IEEE TRONDH POWERT, P1
[10]  
BUDAL K, 1977, MAR SCI COMMUN, V3, P133