A robust fleet-based anomaly detection framework applied to wind turbine vibration data

被引:8
作者
Leite, Gustavo de Novaes Pires [1 ,2 ,8 ]
Farias, Felipe Costa [7 ]
de Sa, Tiago Gomes [2 ]
da Costa, Alexandre Carlos Araujo [2 ]
Brennand, Leonardo Jose Petribu [2 ]
de Souza, Marrison Gabriel Guedes [3 ]
Villa, Alvaro Antonio Ochoa [1 ,4 ]
Droguett, Enrique Lopez [5 ,6 ]
机构
[1] IFPE Fed Inst Educ Sci & Technol Pernambuco, DACI CACTR, Campus Recife, Recife, Brazil
[2] Univ Fed Pernambuco, CER UFPE, Ctr Renewable Energy, Recife, Brazil
[3] NEOG New Energy Opt Geracao Energia, Recife, Brazil
[4] Univ Fed Pernambuco, Mech Engn Dept PPGEM, UFPE, Recife, Brazil
[5] Univ Calif Los Angeles, Garrick Inst Risk Sci, Los Angeles, CA 90095 USA
[6] Univ Calif Los Angeles, Dept Civil & Environm Engn, Los Angeles, CA 90095 USA
[7] IFPE Fed Inst Educ Sci & Technol Pernambuco, Campus Paulista, Recife, Brazil
[8] IFPE Fed Inst Educ Sci & Technol Pernambuco, DACI CACTR, Av Prof Luiz Freire 500,Campus Recife, BR-50740545 Recife, PE, Brazil
关键词
Machine learning methods; Fault Detection models; Unsupervised learning; Condition monitoring system; Cross-validation; Receiver operating characteristic curve; FAULT-DIAGNOSIS;
D O I
10.1016/j.engappai.2023.106859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large amounts of unlabeled data are produced from wind turbine condition monitoring systems to catch their operational status. With this unmanageable amount of data, developing robust systems with good performance on unseen test data to detect incipient wind turbine faults is crucial to maximizing wind farm performance. This paper presents an implementation of a robust unsupervised machine-learning approach capable of executing fleet-based anomaly detection in wind turbines' critical components. The proposed methodology is applied to noisy, unlabeled, and unstructured vibration data, which must go through the databank decoding, data engineering, preprocessing, and feature extraction. Twelve operational wind turbines with varying health conditions are used to train, validate, and test the models. Features from different domains (time, frequency, and mechanical domain) are extracted and represented in the model's input. A labeling procedure from expert analysis regarding the condition of each wind turbine component through the evaluation of CMS output was carried out. Combining distinctive approaches to optimize eleven unsupervised machine learning algorithms through an unusual 5x2 cross-validation approach applied to real, noisy, and unstructured wind turbine data represents the paper's novelty. The methodology selected the six best models (k-nearest neighbors, clustering-based local outlier, histogram-based outlier, isolation forest, principal component analysis, and minimum covariance determinant) based on robust performance metrics such as accuracy, F1-score, precision, recall, and area under the ROC (Receiver Operating Characteristic Curve). These models generalized the problem well and returned reasonable classification metrics for such a complex problem, with values above 90% for the area under the ROC.
引用
收藏
页数:19
相关论文
共 62 条
  • [1] Al-taei R., 2019, HIGH PERFORM COMPUT, V891, P98
  • [2] A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams
    Alghushairy, Omar
    Alsini, Raed
    Soule, Terence
    Ma, Xiaogang
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) : 1 - 24
  • [3] Ali M., 2020, PyCaret: An open source, low-code machine learning library in Python
  • [4] [Anonymous], 2001, 6 PAC AS C KNOWL DIS
  • [5] Wind turbine reliability: A comprehensive review towards effective condition monitoring development
    Artigao, Estefania
    Martin-Martinez, Sergio
    Honrubia-Escribano, Andres
    Gomez-Lazaro, Emilio
    [J]. APPLIED ENERGY, 2018, 228 : 1569 - 1583
  • [6] Convolutional neural network-based deep transfer learning for fault detection of gas turbine combustion chambers
    Bai, Mingliang
    Yang, Xusheng
    Liu, Jinfu
    Liu, Jiao
    Yu, Daren
    [J]. APPLIED ENERGY, 2021, 302
  • [7] Wind Fleet Generator Fault Detection via SCADA Alarms and Autoencoders
    Beretta, Mattia
    Jose Cardenas, Juan
    Koch, Cosmin
    Cusido, Jordi
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 15
  • [8] Review, analysis and synthesis of prognostic-based decision support methods for condition based maintenance
    Bousdekis, Alexandros
    Magoutas, Babis
    Apostolou, Dimitris
    Mentzas, Gregoris
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2018, 29 (06) : 1303 - 1316
  • [9] An explainable artificial intelligence approach for unsupervised fault detection and diagnosis in rotating machinery
    Brito, Lucas C.
    Susto, Gian Antonio
    Brito, Jorge N.
    Duarte, Marcus A., V
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 163
  • [10] Principal component analysis
    Bro, Rasmus
    Smilde, Age K.
    [J]. ANALYTICAL METHODS, 2014, 6 (09) : 2812 - 2831