Impact of Data Leakage in Vibration Signals Used for Bearing Fault Diagnosis

被引:0
作者
Wheat, Lesley [1 ,2 ]
Mohrenschildt, Martin V. [1 ]
Habibi, Saeid [2 ]
Al-Ani, Dhafar [3 ]
机构
[1] McMaster Univ, Dept Comp & Software, Hamilton, ON L8S 4L7, Canada
[2] McMaster Univ, CMHT, Dept Mech Engn, Hamilton, ON L8S 4L8, Canada
[3] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4L7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
基金
加拿大自然科学与工程研究理事会;
关键词
Vibrations; Fault diagnosis; Training; Soft sensors; Rotating machines; Vibration measurement; Data models; Robustness; Time factors; Principal component analysis; Bearing; fault diagnosis; vibration analysis; domain shift; data leakage; machine learning; train-test split; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/ACCESS.2024.3497716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bearing fault diagnosis is a well-developed field and an active area of research in which the combination of model-free machine learning techniques with vibration data has become a popular approach. However, vibration data from rotating machines has the potential to contain domain shifts beyond the accepted causes in this research area (different part models, operating conditions and sensor locations) which can enable data leakage between training and test datasets. To demonstrate the impact of data leakage, six common bearing diagnosis methods are applied to two datasets using three data splitting methods to compare classification performance. Diagnosis is preformed using Principal Component Analysis (PCA), Supervised Principal Component Analysis (SPCA) and Linear Discriminant Analysis (LDA) in combination with frequency analysis and envelope analysis feature extraction methods. Datasets from McMaster University and Paderborn University are used as experimental data sources, and produce vastly differing results (over a 40% drop in accuracy) depending on the selected dataset splitting method, revealing a previously unknown domain shift. Despite great results for diagnosis methods using frequency response analysis on the data from McMaster, these results are not expected to generalize due to possible data leakage. Out of fifty-five previous works using the Paderborn dataset, ten are identified as likely to be affected and only six properly address the problem. Recommendations are given for future experiment design, model creation and model evaluation.
引用
收藏
页码:169879 / 169895
页数:17
相关论文
共 50 条
  • [41] Convolutional Neural Network-Based Transformer Fault Diagnosis Using Vibration Signals
    Li, Chao
    Chen, Jie
    Yang, Cheng
    Yang, Jingjian
    Liu, Zhigang
    Davari, Pooya
    SENSORS, 2023, 23 (10)
  • [42] A new hybrid deep signal processing approach for bearing fault diagnosis using vibration signals
    He, Miao
    He, David
    NEUROCOMPUTING, 2020, 396 : 542 - 555
  • [43] OSESgram: Data-Aided Method for Selection of Informative Frequency Bands for Bearing Fault Diagnosis
    Hou, Bingchang
    Chen, Yikai
    Wang, Hong
    Peng, Zhike
    Tsui, Kwok-Leung
    Wang, Dong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [44] Contextual Knowledge-Informed Deep Domain Generalization for Bearing Fault Diagnosis
    Lundstrom, Adam
    O'Nils, Mattias
    Qureshi, Faisal Z.
    IEEE ACCESS, 2024, 12 : 196842 - 196854
  • [45] Bearing Fault Diagnosis Based on Adaptive Convolutional Neural Network With Nesterov Momentum
    Gao, Shuzhi
    Pei, Zhiming
    Zhang, Yimin
    Li, Tianchi
    IEEE SENSORS JOURNAL, 2021, 21 (07) : 9268 - 9276
  • [46] A New Approach of Preprocessing with SVM Optimization Based on PSO for Bearing Fault Diagnosis
    Thelaidjia, T.
    Chenikher, S.
    2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 319 - 324
  • [47] A model of binaural auditory nerve oscillator network for bearing fault diagnosis by integrating two-channel vibration signals
    Xiaoxin Liu
    Yungong Li
    Minghao Sun
    Sun Zhongqiu
    Jingye Zang
    Nonlinear Dynamics, 2023, 111 : 4779 - 4805
  • [48] Early-Stage Fault Diagnosis of Motor Bearing Based on Kurtosis Weighting and Fusion of Current-Vibration Signals
    Zhang, Bingye
    Li, Haibo
    Kong, Weiyi
    Fu, Minjie
    Ma, Jien
    SENSORS, 2024, 24 (11)
  • [49] Invariant Feature Purification Method for Domain Generalization of Rolling Bearing Fault Diagnosis
    Xie, Yining
    Yang, Guojun
    Chen, Hongzhan
    Zhao, Zhichao
    Leng, Xin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [50] A model of binaural auditory nerve oscillator network for bearing fault diagnosis by integrating two-channel vibration signals
    Liu, Xiaoxin
    Li, Yungong
    Sun, Minghao
    Zhongqiu, Sun
    Zang, Jingye
    NONLINEAR DYNAMICS, 2023, 111 (05) : 4779 - 4805