Control of a Point Absorber Using Reinforcement Learning

被引:70
作者
Anderlini, Enrico [1 ]
Forehand, David I. M. [2 ]
Stansell, Paul [3 ]
Xiao, Qing [4 ]
Abusara, Mohammad [5 ]
机构
[1] Ind Doctoral Ctr Offshore Renewable Energy, Edinburgh EH9 3JL, Midlothian, Scotland
[2] Univ Edinburgh, Inst Energy Syst, Edinburgh EH9 3DW, Midlothian, Scotland
[3] Dell SecureWorks, Edinburgh EH3 5DA, Midlothian, Scotland
[4] Univ Strathclyde, Dept Naval Architecture Ocean & Marine Engn, Glasgow G4 0LZ, Lanark, Scotland
[5] Univ Exeter, Coll Engn Math & Phys Sci, Penryn TR10 9FE, England
基金
英国工程与自然科学研究理事会;
关键词
Wave energy converter (WEC); power take-off (PTO) system; reinforcement learning (RL); Q-learning; WAVE-ENERGY CONVERTERS; POWER TAKE-OFF; CONTROL STRATEGIES; LATCHING CONTROL; SYSTEM; OPERATION; DEVICE; SEA;
D O I
10.1109/TSTE.2016.2568754
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This work presents the application of reinforcement learning for the optimal resistive control of a point absorber. The model-free Q-learning algorithm is selected in order to maximise energy absorption in each sea state. Step changes are made to the controller damping, observing the associated penalty, for excessive motions, or reward, i.e. gain in associated power. Due to the general periodicity of gravity waves, the absorbed power is averaged over a time horizon lasting several wave periods. The performance of the algorithm is assessed through the numerical simulation of a point absorber subject to motions in heave in both regular and irregular waves. The algorithm is found to converge towards the optimal controller damping in each sea state. Additionally, the model-free approach ensures the algorithm can adapt to changes to the device hydrodynamics over time and is unbiased by modelling errors.
引用
收藏
页码:1681 / 1690
页数:10
相关论文
共 31 条