Energy optimization of wind turbines via a neural control policy based on reinforcement learning Markov chain Monte Carlo algorithm

被引:15
作者
Aghaei, Vahid Tavakol [1 ,2 ]
Agababaoglu, Arda [3 ]
Bawo, Biram [4 ]
Naseradinmousavi, Peiman [5 ]
Yildirim, Sinan [6 ]
Yesilyurt, Serhat [4 ]
Onat, Ahmet [7 ]
机构
[1] Istinye Univ, Fac Engn & Nat Sci Elect & Elect Engn, Istanbul, Turkiye
[2] Sabanci Univ, Ctr Excellence Funct Surfaces & Interfaces Nanodi, TR-34956 Istanbul, Turkiye
[3] ITU Ari Technol Pk, Delivers AI, Istanbul, Turkiye
[4] Sabanci Univ, Fac Engn & Nat Sci Mechatron Engn, Istanbul, Turkiye
[5] San Diego State Univ, Dynam Syst & Control Lab, San Diego, CA USA
[6] Sabanci Univ, Fac Engn & Nat Sci Ind Engn, Istanbul, Turkiye
[7] Istanbul Tech Univ, Fac Elect & Elect Engn, Control & Automat Engn Dept, Istanbul, Turkiye
关键词
Reinforcement learning (RL); Wind turbine energy maximization; Neural control; Markov chain Monte Carlo (MCMC); Bayesian learning; Deep deterministic policy gradient (DDPG); MODEL-PREDICTIVE CONTROL; TERM-MEMORY NETWORK; BAYESIAN OPTIMIZATION; CONVERSION SYSTEMS; MPPT ALGORITHM; POWER; TRACKING; FARM; DESIGN; EXTRACTION;
D O I
10.1016/j.apenergy.2023.121108
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This study focuses on the numerical analysis and optimal control of vertical-axis wind turbines (VAWT) using Bayesian reinforcement learning (RL). We specifically address small-scale wind turbines, which are well-suited to local and compact production of electrical energy on a small scale, such as urban and rural infrastructure installations. Existing literature concentrates on large scale wind turbines which run in unobstructed, mostly constant wind profiles. However urban installations generally must cope with rapidly changing wind patterns. To bridge this gap, we formulate and implement an RL strategy using the Markov chain Monte Carlo (MCMC) algorithm to optimize the long-term energy output of a wind turbine. Our MCMC-based RL algorithm is a model-free and gradient-free algorithm, in which the designer does not have to know the precise dynamics of the plant and its uncertainties. Our method addresses the uncertainties by using a multiplicative reward structure, in contrast with additive reward used in conventional RL approaches. We have shown numerically that the method specifically overcomes the shortcomings typically associated with conventional solutions, including, but not limited to, component aging, modeling errors, and inaccuracies in the estimation of wind speed patterns. Our results show that the proposed method is especially successful in capturing power from wind transients; by modulating the generator load and hence the rotor torque load, so that the rotor tip speed quickly reaches the optimum value for the anticipated wind speed. This ratio of rotor tip speed to wind speed is known to be critical in wind power applications. The wind to load energy efficiency of the proposed method was shown to be superior to two other methods; the classical maximum power point tracking method and a generator controlled by deep deterministic policy gradient (DDPG) method.
引用
收藏
页数:19
相关论文
共 86 条
[31]   Adaptive neural dynamic surface control for uniform energy exploitation of floating wind turbine [J].
Keighobadi, Jafar ;
KhalafAnsar, Hadi Mohammadian ;
Naseradinmousavi, Peiman .
APPLIED ENERGY, 2022, 316
[32]   Robust nonlinear control of atomic force microscope via immersion and invariance [J].
Keighobadi, Jafar ;
Hosseini-Pishrobat, Mehran ;
Faraji, Javad ;
Oveisi, Atta ;
Nestorovic, Tamara .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (04) :1031-1050
[33]   Wind energy in the city: An interurban comparison of social acceptance of wind energy projects [J].
Khorsand, Iman ;
Kormos, Christine ;
MacDonald, Erin G. ;
Crawford, Curran .
ENERGY RESEARCH & SOCIAL SCIENCE, 2015, 8 :66-77
[34]   A reinforcement learning approach for MPPT control method of photovoltaic sources [J].
Kofinas, P. ;
Doltsinis, S. ;
Dounis, A. I. ;
Vouros, G. A. .
RENEWABLE ENERGY, 2017, 108 :461-473
[35]   Long short term memory-convolutional neural network based deep hybrid approach for solar irradiance forecasting [J].
Kumari, Pratima ;
Toshniwal, Durga .
APPLIED ENERGY, 2021, 295
[36]  
Lasheen M, 2015, J ELECT ENG, V15, P21
[37]   A grey wolf optimized fuzzy logic based MPPT for shaded solar photovoltaic systems in microgrids [J].
Laxman, Bhukya ;
Annamraju, Anil ;
Srikanth, Nandiraju Venkata .
INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2021, 46 (18) :10653-10665
[38]  
Levine S, 2016, J MACH LEARN RES, V17
[39]   Drift-free current sensorless MPPT algorithm in photovoltaic systems [J].
Li, Xingshuo ;
Wen, Huiqing ;
Hu, Yihua ;
Jiang, Lin .
SOLAR ENERGY, 2019, 177 :118-126
[40]  
Lillicrap T. P., 2015, ARXIV