Data-driven optimal control of wind turbines using reinforcement learning with function approximation

被引:5
作者
Peng, Shenglin [1 ]
Feng, Qianmei [1 ]
机构
[1] Univ Houston, Dept Ind Engn, Houston, TX 77204 USA
基金
美国国家科学基金会;
关键词
Markov decision process; Reinforcement learning; Function approximation; Optimal control; Wind turbines; KERNEL; ENERGY;
D O I
10.1016/j.cie.2022.108934
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a reinforcement learning approach with function approximation for maximizing the power output of wind turbines (WTs). The optimal control of wind turbines majorly uses the maximum power point tracking (MPPT) strategy for sequential decision-making that can be modeled as a Markov decision process (MDP). In the literature, the continuous control variables are typically discretized to cope with the curse of dimensionality in traditional dynamic programming methods. To provide a more accurate prediction, we formulate the problem into an MDP with continuous state and action spaces by utilizing the function approximation in reinforcement learning. The commonly used pitch angle is selected as a control variable we are concerned with, which is regarded as the system state along with some other controllable and uncontrollable variables proven to affect the power output. Computational studies of real data are conducted to demonstrate that the proposed method outperforms the existing methods in the literature in obtaining the optimal power output.
引用
收藏
页数:8
相关论文
共 39 条
  • [1] A review of maximum power point tracking algorithms for wind energy systems
    Abdullah, M. A.
    Yatim, A. H. M.
    Tan, C. W. A.
    Saidur, R.
    [J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2012, 16 (05) : 3220 - 3227
  • [2] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [3] STABILITY SIMULATION OF WIND TURBINE SYSTEMS
    ANDERSON, PM
    BOSE, A
    [J]. IEEE TRANSACTIONS ON POWER APPARATUS AND SYSTEMS, 1983, 102 (12): : 3791 - 3795
  • [4] [Anonymous], 2002, A Distribution-Free Theory of Nonparametric Regression
  • [5] Nonlinear control of variable-speed wind turbines for generator torque limiting and power optimization
    Boukhezzar, B.
    Siguerdidjane, H.
    Hand, M. Maureen
    [J]. JOURNAL OF SOLAR ENERGY ENGINEERING-TRANSACTIONS OF THE ASME, 2006, 128 (04): : 516 - 530
  • [6] LOCALLY WEIGHTED REGRESSION - AN APPROACH TO REGRESSION-ANALYSIS BY LOCAL FITTING
    CLEVELAND, WS
    DEVLIN, SJ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (403) : 596 - 610
  • [7] Debruyne M, 2008, J MACH LEARN RES, V9, P2377
  • [8] ENGIE, 2019, ENGIE OP
  • [9] NON-PARAMETRIC ESTIMATION OF A MULTIVARIATE PROBABILITY DENSITY
    EPANECHN.VA
    [J]. THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1969, 14 (01): : 153 - &
  • [10] Ernst D, 2005, J MACH LEARN RES, V6, P503