Impact of Data Leakage in Vibration Signals Used for Bearing Fault Diagnosis

被引:0
|
作者
Wheat, Lesley [1 ,2 ]
Mohrenschildt, Martin V. [1 ]
Habibi, Saeid [2 ]
Al-Ani, Dhafar [3 ]
机构
[1] McMaster Univ, Dept Comp & Software, Hamilton, ON L8S 4L7, Canada
[2] McMaster Univ, CMHT, Dept Mech Engn, Hamilton, ON L8S 4L8, Canada
[3] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4L7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
基金
加拿大自然科学与工程研究理事会;
关键词
Vibrations; Fault diagnosis; Training; Soft sensors; Rotating machines; Vibration measurement; Data models; Robustness; Time factors; Principal component analysis; Bearing; fault diagnosis; vibration analysis; domain shift; data leakage; machine learning; train-test split; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/ACCESS.2024.3497716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bearing fault diagnosis is a well-developed field and an active area of research in which the combination of model-free machine learning techniques with vibration data has become a popular approach. However, vibration data from rotating machines has the potential to contain domain shifts beyond the accepted causes in this research area (different part models, operating conditions and sensor locations) which can enable data leakage between training and test datasets. To demonstrate the impact of data leakage, six common bearing diagnosis methods are applied to two datasets using three data splitting methods to compare classification performance. Diagnosis is preformed using Principal Component Analysis (PCA), Supervised Principal Component Analysis (SPCA) and Linear Discriminant Analysis (LDA) in combination with frequency analysis and envelope analysis feature extraction methods. Datasets from McMaster University and Paderborn University are used as experimental data sources, and produce vastly differing results (over a 40% drop in accuracy) depending on the selected dataset splitting method, revealing a previously unknown domain shift. Despite great results for diagnosis methods using frequency response analysis on the data from McMaster, these results are not expected to generalize due to possible data leakage. Out of fifty-five previous works using the Paderborn dataset, ten are identified as likely to be affected and only six properly address the problem. Recommendations are given for future experiment design, model creation and model evaluation.
引用
收藏
页码:169879 / 169895
页数:17
相关论文
共 50 条
  • [11] Information Fusion of the Vibration and Acoustic Signals Based Rolling Bearing Incipient Fault Diagnosis Method
    Ming, Tingfeng
    Zhang, Yongxiang
    Li, Jing
    ADVANCED TECHNOLOGIES IN MANUFACTURING, ENGINEERING AND MATERIALS, PTS 1-3, 2013, 774-776 : 1499 - 1502
  • [12] Fault Feature Extractor Based on Bootstrap Your Own Latent and Data Augmentation Algorithm for Unlabeled Vibration Signals
    Peng, Tengyi
    Shen, Changqing
    Sun, Shilong
    Wang, Dong
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (09) : 9547 - 9555
  • [13] Numerical and experimental analysis of vibratory signals for rolling bearing fault diagnosis
    Bensana, T.
    Mekhilef, S.
    MECHANIKA, 2016, (03): : 217 - 224
  • [14] A Novel Approach for Intelligent Fault Diagnosis in Bearing With Imbalanced Data Based on Cycle-Consistent GAN
    Liao, Wenjie
    Wu, Like
    Xu, Shihui
    Fujimura, Shigeru
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [15] Theoretical and experimental analysis of bispectrum of vibration signals for fault diagnosis of gears
    Shen Guoji
    McLaughlin, Stephen
    Xu Yongcheng
    White, Paul
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2014, 43 (1-2) : 76 - 89
  • [16] A Novel Dual-Domain Adversarial Method for Vibration Signal Denoising in Bearing Fault Diagnosis
    Han, Guangjie
    Shen, Junhao
    Wang, Zhen
    Zhu, Yuanyang
    Xie, Yuhang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [17] Bearing Fault Diagnosis Using Multidomain Fusion-Based Vibration Imaging and Multitask Learning
    Hasan, Md Junayed
    Islam, M. M. Manjurul
    Kim, Jong-Myon
    SENSORS, 2022, 22 (01)
  • [18] DESIGN OF A DATA ACQUISITION SYSTEM TO BE USED IN FAULT DIAGNOSIS
    Bacha, Abdelkabir
    Sabry, Ahmed Haroun
    Benhra, Jamal
    PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [19] A Novel Cross-Domain Data Augmentation and Bearing Fault Diagnosis Method Based on an Enhanced Generative Model
    Sun, Shilong
    Ding, Hao
    Huang, Haodong
    Zhao, Zida
    Wang, Dong
    Xu, Wenfu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 9
  • [20] A Comprehensive Investigation of Fault Signatures and Spectrum Analysis of Vibration Signals in Distributed Bearing Faults
    Afshar, Mojtaba
    Heydarzadeh, Mehrdad
    Akin, Bilal
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2025, 61 (01) : 515 - 526