Wind power data cleaning using RANSAC-based polynomial and linear regression with adaptive threshold

被引:0
作者
Yang, Haineng [1 ,2 ]
Tang, Jie [1 ]
Shao, Wu [1 ]
Yin, Jintian [1 ]
Liu, Baiyang [1 ]
机构
[1] Shaoyang Univ, Sch Elect Engn, Shaoyang 422000, Peoples R China
[2] Shaoyang Polytech, Sch Automot & Intelligent Mfg, Shaoyang 422000, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Wind power; Data cleaning; Random Sample Consensus algorithm; Anomalous data; Polynomial regression; Adaptive threshold robust regression; CURVE; TURBINE;
D O I
10.1038/s41598-025-89177-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As the global demand for clean energy continues to rise, wind power has become one of the most important renewable energy sources. However, wind power data often contains a high proportion of dense anomalies, which not only significantly affect the accuracy of wind power forecasting models but may also mislead grid scheduling decisions, thereby jeopardizing grid security. To address this issue, this paper proposes an adaptive threshold robust regression model (RPR model) based on the combination of the Random Sample Consensus (RANSAC) algorithm and polynomial linear regression for wind power data cleaning. The model successfully captures the nonlinear relationship between wind speed and power by extending the polynomial features of wind speed and power, enabling the linear regression model to handle the nonlinearity. By combining the RANSAC algorithm and polynomial linear regression, a robust polynomial regression model is constructed to tackle anomalous data and enhance the accuracy of data cleaning. During the cleaning process, the model first fits the raw data by randomly selecting a minimal sample set, then dynamically adjusts the decision thresholds based on the median of residuals and median absolute deviation (MAD), ensuring effective identification and cleaning of anomalous data. The model's robustness allows it to maintain efficient cleaning performance even with a high proportion of anomalous data, addressing the limitations of existing methods when handling densely distributed anomalies. The effectiveness and innovation of the proposed method were validated by applying it to real data from a wind farm operated by Longyuan Power. Compared to other commonly used cleaning methods, such as the Bidirectional Change Point Grouping Quartile Statistical Model, Principal Contour Image Processing Model, DBSCAN Clustering Model, and Support Vector Machine (SVM) Model, experimental results showed that the proposed method delivered the best performance in improving data quality. Specifically, the method significantly reduced the average absolute error (MAE) of the wind power forecasting model by 72.1%, which is higher than the reductions observed in other methods (ranging from 37.3 to 52.7%). Moreover, it effectively reduced the prediction error of the Convolutional Neural Network (CNN) + Gated Recurrent Unit (GRU) forecasting model, ensuring high prediction accuracy. The adaptive threshold robust regression model proposed in this study is innovative and has significant application potential. It provides an effective new approach for wind power data cleaning, applicable not only to conventional scenarios with low proportions of anomalous data but also to complex datasets with a high proportion of dense anomalies.
引用
收藏
页数:23
相关论文
共 49 条
  • [1] Abed RA., 2024, Measurement: Sens, V35, DOI DOI 10.1016/J.MEASEN.2024.101299
  • [2] A new IGDT-based robust model for day-ahead scheduling of smart power system integrated with compressed air energy storage and dynamic rating of transformers and lines
    Aghdam, Elyar Asadzadeh
    Moslemi, Sahar
    Nakisaee, Mohammad Sadegh
    Fakhrooeian, Mahan
    Al-Hassanawy, Ali Jawad Kadhim
    Masali, Milad Hadizadeh
    Seyyedi, Abbas Zare Ghaleh
    [J]. JOURNAL OF ENERGY STORAGE, 2025, 105
  • [3] Baidu A. I., About us
  • [4] Wind and solar power forecasting based on hybrid CNN-ABiLSTM, CNN-transformer-MLP models
    Bashir, Tasarruf
    Wang, Huifang
    Tahir, Mustafa
    Zhang, Yixiang
    [J]. RENEWABLE ENERGY, 2025, 239
  • [5] Bharti S, 2022, 2022 IEEE INTERNATIONAL CONFERENCE ON POWER ELECTRONICS, DRIVES AND ENERGY SYSTEMS, PEDES
  • [6] Multi-objective design of the energy storage-based combined heat and power off-grid system to supply of thermal and electricity consumption energies
    Ghobadi, Kasra
    Rashid, Sara Mahmoudi
    Zare-Ghaleh-Seyyedi, Abbas
    Moosanezhad, Jaber
    Khan, Ashraf Ali
    [J]. JOURNAL OF ENERGY STORAGE, 2023, 73
  • [7] Haotian Z., 2022, Acta Energiae Solaris Sinica, V43, P167
  • [8] Spatiotemporal forecasting using multi-graph neural network assisted dual domain transformer for wind power
    Hou, Guolian
    Li, Qingwei
    Huang, Congzhi
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2025, 325
  • [9] Wind power forecasting method of large-scale wind turbine clusters based on DBSCAN clustering and an enhanced hunter-prey optimization algorithm
    Hou, Guolian
    Wang, Junjie
    Fan, Yuzhen
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2024, 307
  • [10] Daily condition monitoring of grid-connected wind turbine via high-fidelity power curve and its comprehensive rating
    Hu, Yang
    Xi, Yunhua
    Pan, Chenyang
    Li, Gengda
    Chen, Baowei
    [J]. RENEWABLE ENERGY, 2020, 146 : 2095 - 2111