Predicting Energy Generation in Large Wind Farms: A Data-Driven Study with Open Data and Machine Learning

被引:1
|
作者
Paula, Matheus [1 ]
Casaca, Wallace [2 ]
Colnago, Marilaine [3 ]
da Silva, Jose R. [1 ]
Oliveira, Kleber [1 ]
Dias, Mauricio A. [4 ]
Negri, Rogerio [5 ]
机构
[1] Sao Paulo State Univ UNESP, Fac Engn & Sci, BR-19274000 Rosana, Brazil
[2] Sao Paulo State Univ UNESP, Inst Biosci Letters & Exact Sci, BR-15054000 Sao Jose Do Rio Preto, Brazil
[3] Sao Paulo State Univ UNESP, Inst Chem, BR-4800060 Araraquara, Brazil
[4] Sao Paulo State Univ UNESP, Fac Sci & Technol, BR-19060080 Presidente Prudente, Brazil
[5] Sao Paulo State Univ UNESP, Sci & Technol Inst, BR-12245000 Sao Jose Dos Campos, Brazil
基金
巴西圣保罗研究基金会;
关键词
wind energy; forecasting; wind farms; machine learning; data science; LONG-TERM WIND; FORECAST; SYSTEM; BRAZIL;
D O I
10.3390/inventions8050126
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Wind energy has become a trend in Brazil, particularly in the northeastern region of the country. Despite its advantages, wind power generation has been hindered by the high volatility of exogenous factors, such as weather, temperature, and air humidity, making long-term forecasting a highly challenging task. Another issue is the need for reliable solutions, especially for large-scale wind farms, as this involves integrating specific optimization tools and restricted-access datasets collected locally at the power plants. Therefore, in this paper, the problem of forecasting the energy generated at the Praia Formosa wind farm, an eco-friendly park located in the state of Ceara, Brazil, which produces around 7% of the state's electricity, was addressed. To proceed with our data-driven analysis, publicly available data were collected from multiple Brazilian official sources, combining them into a unified database to perform exploratory data analysis and predictive modeling. Specifically, three machine-learning-based approaches were applied: Extreme Gradient Boosting, Random Forest, and Long Short-Term Memory Network, as well as feature-engineering strategies to enhance the precision of the machine intelligence models, including creating artificial features and tuning the hyperparameters. Our findings revealed that all implemented models successfully captured the energy-generation trends, patterns, and seasonality from the complex wind data. However, it was found that the LSTM-based model consistently outperformed the others, achieving a promising global MAPE of 4.55%, highlighting its accuracy in long-term wind energy forecasting. Temperature, relative humidity, and wind speed were identified as the key factors influencing electricity production, with peak generation typically occurring from August to November.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Application of Data Science and Machine Learning in the Prediction of College Dropout: A Data-Driven Predictive Approach
    Felix Jimenez, Axel Frederick
    Sanchez Lee, Vania Stephany
    Ibarra Belmonte, Isaul
    Parra Gonzalez, Ezra Federico
    2023 12TH INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS IMPROVEMENT, CIMPS 2023, 2023, : 234 - 243
  • [32] Data-Driven Methods for Predicting ADHD Diagnosis and Related Impairment: The Potential of a Machine Learning Approach
    Goh, Patrick K.
    Elkins, Anjeli R.
    Bansal, Pevitr S.
    Eng, Ashley G.
    Martel, Michelle M.
    RESEARCH ON CHILD AND ADOLESCENT PSYCHOPATHOLOGY, 2023, 51 (05): : 679 - 691
  • [33] Predicting failure pressure of corroded gas pipelines: A data-driven approach using machine learning
    Xiao, Rui
    Zayed, Tarek
    Meguid, Mohamed A.
    Sushama, Laxmi
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2024, 184 : 1424 - 1441
  • [34] Data-Driven Machine Learning Approach for Predicting the Higher Heating Value of Different Biomass Classes
    Afolabi, Inioluwa Christianah
    Epelle, Emmanuel, I
    Gunes, Burcu
    Gulec, Fatih
    Okolie, Jude A.
    CLEAN TECHNOLOGIES, 2022, 4 (04): : 1227 - 1241
  • [35] Data-Driven Methods for Predicting ADHD Diagnosis and Related Impairment: The Potential of a Machine Learning Approach
    Patrick K. Goh
    Anjeli R. Elkins
    Pevitr S. Bansal
    Ashley G. Eng
    Michelle M. Martel
    Research on Child and Adolescent Psychopathology, 2023, 51 : 679 - 691
  • [36] A hybrid data-driven machine learning framework for predicting the performance of coal and biomass gasification processes
    Yang, Qingchun
    Zhang, Jinliang
    Zhou, Jianlong
    Zhao, Lei
    Zhang, Dawei
    FUEL, 2023, 346
  • [37] Data-driven baseline generation for post-retrofit energy saving assessment, a comparison of statistical and machine learning methods
    Kuivjogi, Helena
    Vasman, Sofia
    Petlenkov, Eduard
    Thalfeldt, Martin
    Kurnitski, Jarek
    JOURNAL OF BUILDING ENGINEERING, 2024, 98
  • [38] Data-Driven Wind Farm Control via Multiplayer Deep Reinforcement Learning
    Dong, Hongyang
    Zhao, Xiaowei
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2023, 31 (03) : 1468 - 1475
  • [39] A DATA-DRIVEN WORKFLOW FOR PREDICTION OF FRACTURING PARAMETERS WITH MACHINE LEARNING
    Zhu, Zhihua
    Hsu, Maoya
    Kun, Ding
    Wang, Tianyu
    He, Xiaodong
    Tian, Shouceng
    THERMAL SCIENCE, 2024, 28 (2A): : 1085 - 1090
  • [40] Prediction of casing damage: A data-driven, machine learning approach
    Zhao Y.
    Jiang H.
    Li H.
    International Journal of Circuits, Systems and Signal Processing, 2020, 14 : 1047 - 1053