Highly imbalanced fault classification of wind turbines using data resampling and hybrid ensemble method approach

被引:16
|
作者
Chatterjee, Subhajit [1 ]
Byun, Yung-Cheol [2 ]
机构
[1] Jeju Natl Univ, Dept Comp Engn, Jeju 63243, South Korea
[2] Jeju Natl Univ, Inst Informat Sci & Technol, Dept Comp Engn, Major Elect Engn, Jeju 63243, South Korea
关键词
Wind turbines; Deep learning; Machine learning; Fault classification; Imbalanced data; Ensemble learning; NEURAL-NETWORK; DIAGNOSIS; RELIABILITY; IDENTIFICATION; COMPONENTS;
D O I
10.1016/j.engappai.2023.107104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning-based incipient fault diagnostic techniques have achieved surprisingly well in wind turbines. Due to component failures, wind turbines must undergo active maintenance, substantially influencing revenue and power generation. Unfortunately, there are consistently uneven data distributions between samples with faults and those without faults, resulting in incorrect fault classification. Wind turbine fault classification has a significant data imbalance problem, compromising learning attention for majority and minority classes. Machine learning methodologies based on Generative Adversarial Networks (GAN), over-sampling, and under sampling techniques for generating synthetic data have been widely employed to address the imbalance data problem. However, the traditional synthetic minority oversampling technique (SMOTE) accomplishes oversampling using linear interpolation between close minority class samples, which could be confusing, subpar, and indistinguishable from the majority class. This study suggests combining over and under-sampling using adaptive SMOTE and edited nearest neighbors (ASMOTE-ENN) that incorporate over-sampling with adaptive SMOTE and under-sampling with ENN to improve the quality of the generated samples. With this resampling technique, noise in an imbalanced dataset is reduced on three levels by using an adaptive nearest neighbor selection algorithm to find the nearest neighbors that are visible. Then use SMOTE to create samples that precisely fall into the minority class, and later use the ENN technique to eliminate instances that contribute to noise afterwards. Resampling data created by combining over-and under-sampling approaches to match the data distribution over all classes is the foundation of the suggested method's efficacy. A hybrid ensemble method is used for effective classification, including boosting, bagging, and stacking techniques. The original unbalanced and balanced data using the ASMOTE-ENN algorithm were classified using the proposed hybrid ensemble method. The classification results show that the proposed strategy is more accurate than a few imbalanced fault diagnosis techniques.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Highly Imbalanced Classification of Gout Using Data Resampling and Ensemble Method
    Si, Xiaonan
    Wang, Lei
    Xu, Wenchang
    Wang, Biao
    Cheng, Wenbo
    ALGORITHMS, 2024, 17 (03)
  • [2] A Combination of Resampling and Ensemble Method for Text Classification on Imbalanced Data
    Feng, Haijun
    Qin, Wen
    Wang, Huijing
    Li, Yi
    Hu, Guangwu
    BIG DATA, BIGDATA 2021, 2022, 12988 : 3 - 16
  • [3] Imbalanced Data Classification Based on a Hybrid Resampling SVM Method
    Cao, Lu
    Zhai, Yikui
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 1533 - 1536
  • [4] Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach
    Salunkhe, Uma R.
    Mali, Suresh N.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELLING AND SECURITY (CMS 2016), 2016, 85 : 725 - 732
  • [5] Ensemble Approach for the Classification of Imbalanced Data
    Nikulin, Vladimir
    McLachlan, Geoffrey J.
    Ng, Shu Kay
    AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 291 - +
  • [6] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Shi, Peibei
    Wang, Zhong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (06) : 2250 - 2266
  • [7] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    SHI Peibei
    WANG Zhong
    Journal of Systems Science & Complexity, 2021, 34 (06) : 2250 - 2266
  • [8] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Peibei Shi
    Zhong Wang
    Journal of Systems Science and Complexity, 2021, 34 : 2250 - 2266
  • [9] Wind Turbines Fault Classification Treatment Method
    Ren, Liying
    Yong, Bin
    SYMMETRY-BASEL, 2022, 14 (04):
  • [10] A Hybrid Approach for Fault Detection in Wind Turbines
    Francisco Manrique, Ruben
    Joaquin Parrado, Jose
    2014 9TH COMPUTING COLOMBIAN CONFERENCE (9CCC), 2014, : 222 - 227