CARE to Compare: A Real-World Benchmark Dataset for Early Fault Detection in Wind Turbine Data

被引:0
|
作者
Gück, Christian [1 ]
Roelofs, Cyriana M. A. [1 ]
Faulstich, Stefan [1 ]
机构
[1] Fraunhofer IEE, Joseph-Beuys-Straße 8, Kassel
关键词
anomaly detection; condition monitoring; dataset; early fault detection; predictive maintenance; wind turbines;
D O I
10.3390/data9120138
中图分类号
学科分类号
摘要
Early fault detection plays a crucial role in the field of predictive maintenance for wind turbines, yet the comparison of different algorithms poses a difficult task because domain-specific public datasets are scarce. Many comparisons of different approaches either use benchmarks composed of data from many different domains, inaccessible data, or one of the few publicly available datasets that lack detailed information about the faults. Moreover, many publications highlight a couple of case studies where fault detection was successful. With this paper, we publish a high quality dataset that contains data from 36 wind turbines across 3 different wind farms as well as the most detailed fault information of any public wind turbine dataset as far as we know. The new dataset contains 89 years worth of real-world operating data of wind turbines, distributed across 44 labeled time frames for anomalies that led up to faults, as well as 51 time series representing normal behavior. Additionally, the quality of training data is ensured by turbine-status-based labels for each data point. Furthermore, we propose a new scoring method, called CARE (Coverage, Accuracy, Reliability and Earliness), which takes advantage of the information depth that is present in the dataset to identify good early fault detection models for wind turbines. This score considers the anomaly detection performance, the ability to recognize normal behavior properly, and the capability to raise as few false alarms as possible while simultaneously detecting anomalies early. Dataset: https://doi.org/10.5281/zenodo.14006163 (accessed on 29 October 2024). Dataset License: CC BY-SA 4.0 International. © 2024 by the authors.
引用
收藏
相关论文
共 42 条
  • [1] Data-driven fault detection and isolation scheme for a wind turbine benchmark
    de Bessa, Iury Valente
    Palhares, Reinaldo Martinez
    Silveira Vasconcelos D'Angelo, Marcos Flavio
    Chaves Filho, Joao Edgar
    RENEWABLE ENERGY, 2016, 87 : 634 - 645
  • [2] Standardisation of wind turbine SCADA data for gearbox fault detection
    Ferguson, David
    McDonald, Alasdair
    Carroll, James
    Lee, Hyunjoo
    JOURNAL OF ENGINEERING-JOE, 2019, (18): : 5147 - 5151
  • [3] CylinDeRS: A Benchmark Visual Dataset for Robust Gas Cylinder Detection and Attribute Classification in Real-World Scenes
    Stavrothanasopoulos, Klearchos
    Gkountakos, Konstantinos
    Ioannidis, Konstantinos
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    SENSORS, 2025, 25 (04)
  • [4] Application of SCADA data in wind turbine fault detection - a review
    Ma, Junyan
    Yuan, Yiping
    SENSOR REVIEW, 2023, 43 (01) : 1 - 11
  • [5] The MVTec Anomaly Detection Dataset: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
    Paul Bergmann
    Kilian Batzner
    Michael Fauser
    David Sattlegger
    Carsten Steger
    International Journal of Computer Vision, 2021, 129 : 1038 - 1059
  • [6] The MVTec Anomaly Detection Dataset: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
    Bergmann, Paul
    Batzner, Kilian
    Fauser, Michael
    Sattlegger, David
    Steger, Carsten
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) : 1038 - 1059
  • [7] PeanutAD: A Real-World Dataset for Anomaly Detection in Agricultural Product Processing Line
    Nguyen, Duc-Hai
    Do, Trong-Hiep
    Nguyen, Quoc-Khanh
    Nguyen, Hoang-Linh-Phuong
    Nguyen, Thi-Huong
    Tran, Duc-Tan
    Nguyen, Van-Toi
    2024 IEEE TENTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, ICCE 2024, 2024, : 427 - 432
  • [8] Multivariate SCADA Data Analysis Methods for Real-World Wind Turbine Power Curve Monitoring
    Astolfi, Davide
    Castellani, Francesco
    Lombardi, Andrea
    Terzi, Ludovico
    ENERGIES, 2021, 14 (04)
  • [9] RGB-D Human Matting: A Real-World Benchmark Dataset and a Baseline Method
    Peng, Bo
    Zhang, Mingliang
    Lei, Jianjun
    Fu, Huazhu
    Shen, Haifeng
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4041 - 4053
  • [10] Toward Real-World Multi-View Object Classification: Dataset, Benchmark, and Analysis
    Wang, Ren
    Kim, Tae Sung
    Kim, Jin-Sung
    Lee, Hyuk-Jae
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5653 - 5664