CARE to Compare: A Real-World Benchmark Dataset for Early Fault Detection in Wind Turbine Data

被引:0
|
作者
Gück, Christian [1 ]
Roelofs, Cyriana M. A. [1 ]
Faulstich, Stefan [1 ]
机构
[1] Fraunhofer IEE, Joseph-Beuys-Straße 8, Kassel
关键词
anomaly detection; condition monitoring; dataset; early fault detection; predictive maintenance; wind turbines;
D O I
10.3390/data9120138
中图分类号
学科分类号
摘要
Early fault detection plays a crucial role in the field of predictive maintenance for wind turbines, yet the comparison of different algorithms poses a difficult task because domain-specific public datasets are scarce. Many comparisons of different approaches either use benchmarks composed of data from many different domains, inaccessible data, or one of the few publicly available datasets that lack detailed information about the faults. Moreover, many publications highlight a couple of case studies where fault detection was successful. With this paper, we publish a high quality dataset that contains data from 36 wind turbines across 3 different wind farms as well as the most detailed fault information of any public wind turbine dataset as far as we know. The new dataset contains 89 years worth of real-world operating data of wind turbines, distributed across 44 labeled time frames for anomalies that led up to faults, as well as 51 time series representing normal behavior. Additionally, the quality of training data is ensured by turbine-status-based labels for each data point. Furthermore, we propose a new scoring method, called CARE (Coverage, Accuracy, Reliability and Earliness), which takes advantage of the information depth that is present in the dataset to identify good early fault detection models for wind turbines. This score considers the anomaly detection performance, the ability to recognize normal behavior properly, and the capability to raise as few false alarms as possible while simultaneously detecting anomalies early. Dataset: https://doi.org/10.5281/zenodo.14006163 (accessed on 29 October 2024). Dataset License: CC BY-SA 4.0 International. © 2024 by the authors.
引用
收藏
相关论文
共 42 条
  • [21] Construction of Data-Driven Performance Digital Twin for a Real-World Gas Turbine Anomaly Detection Considering Uncertainty
    Ma, Yangfeifei
    Zhu, Xinyun
    Lu, Jilong
    Yang, Pan
    Sun, Jianzhong
    SENSORS, 2023, 23 (15)
  • [22] A Data-Mining Approach for Wind Turbine Fault Detection Based on SCADA Data Analysis Using Artificial Neural Networks
    Santolamazza, Annalisa
    Dadi, Daniele
    Introna, Vito
    ENERGIES, 2021, 14 (07)
  • [23] Parallel Multiple CNNs With Temporal Predictions for Wind Turbine Blade Cracking Early Fault Detection
    Lu, Quan
    Ye, Wanxing
    Yin, Linfei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 (1-11) : 1 - 11
  • [24] Wind Turbine Gearbox Fault Detection Using Multiple Sensors With Features Level Data Fusion
    Lu, Y.
    Tang, J.
    Luo, H.
    JOURNAL OF ENGINEERING FOR GAS TURBINES AND POWER-TRANSACTIONS OF THE ASME, 2012, 134 (04):
  • [25] Operational Variables for Improving Industrial Wind Turbine Yaw Misalignment Early Fault Detection Capabilities Using Data-Driven Techniques
    Pandit, Ravi
    Infield, David
    Dodwell, Tim
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [26] Early Fault Warning Method of Wind Turbine Main Transmission System Based on SCADA and CMS Data
    Chen, Huanguo
    Chen, Jie
    Dai, Juchuan
    Tao, Hanyu
    Wang, Xutao
    MACHINES, 2022, 10 (11)
  • [27] Data Fusion Based on an Iterative Learning Algorithm for Fault Detection in Wind Turbine Pitch Control Systems
    Acho, Leonardo
    Pujol-Vazquez, Gisela
    SENSORS, 2021, 21 (24)
  • [28] Airborne Sound Analysis for the Detection of Bearing Faults in Railway Vehicles with Real-World Data
    Kreuzer, Matthias
    Schmidt, David
    Wokusch, Simon
    Kellermann, Walter
    2023 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT, ICPHM, 2023, : 232 - 238
  • [29] Data-driven Semi-supervised Anomaly Detection using Real-World Call Data Record
    Jaffry, Shan
    Shah, Syed Tariq
    Hasan, Syed Faraz
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2020,
  • [30] WebHound: a data-driven intrusion detection from real-world web access logs
    Wei, Te-En
    Lee, Hahn-Ming
    Jeng, Albert B.
    Lamba, Hemank
    Faloutsos, Christos
    SOFT COMPUTING, 2019, 23 (22) : 11947 - 11965