Impact of Data Leakage in Vibration Signals Used for Bearing Fault Diagnosis

被引:0
|
作者
Wheat, Lesley [1 ,2 ]
Mohrenschildt, Martin V. [1 ]
Habibi, Saeid [2 ]
Al-Ani, Dhafar [3 ]
机构
[1] McMaster Univ, Dept Comp & Software, Hamilton, ON L8S 4L7, Canada
[2] McMaster Univ, CMHT, Dept Mech Engn, Hamilton, ON L8S 4L8, Canada
[3] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4L7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
基金
加拿大自然科学与工程研究理事会;
关键词
Vibrations; Fault diagnosis; Training; Soft sensors; Rotating machines; Vibration measurement; Data models; Robustness; Time factors; Principal component analysis; Bearing; fault diagnosis; vibration analysis; domain shift; data leakage; machine learning; train-test split; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/ACCESS.2024.3497716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bearing fault diagnosis is a well-developed field and an active area of research in which the combination of model-free machine learning techniques with vibration data has become a popular approach. However, vibration data from rotating machines has the potential to contain domain shifts beyond the accepted causes in this research area (different part models, operating conditions and sensor locations) which can enable data leakage between training and test datasets. To demonstrate the impact of data leakage, six common bearing diagnosis methods are applied to two datasets using three data splitting methods to compare classification performance. Diagnosis is preformed using Principal Component Analysis (PCA), Supervised Principal Component Analysis (SPCA) and Linear Discriminant Analysis (LDA) in combination with frequency analysis and envelope analysis feature extraction methods. Datasets from McMaster University and Paderborn University are used as experimental data sources, and produce vastly differing results (over a 40% drop in accuracy) depending on the selected dataset splitting method, revealing a previously unknown domain shift. Despite great results for diagnosis methods using frequency response analysis on the data from McMaster, these results are not expected to generalize due to possible data leakage. Out of fifty-five previous works using the Paderborn dataset, ten are identified as likely to be affected and only six properly address the problem. Recommendations are given for future experiment design, model creation and model evaluation.
引用
收藏
页码:169879 / 169895
页数:17
相关论文
共 50 条
  • [21] Multilevel feature fusion of multi-domain vibration signals for bearing fault diagnosis
    Hui Li
    Daichao Wang
    Signal, Image and Video Processing, 2024, 18 : 99 - 108
  • [22] Multilevel feature fusion of multi-domain vibration signals for bearing fault diagnosis
    Li, Hui
    Wang, Daichao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 99 - 108
  • [23] MEMS Approach for Rolling Bearing Fault Diagnosis Using Vibration Signal Analysis
    Sharma, Gagandeep
    Kaur, Tejbir
    Mangal, Sanjay Kumar
    Dhiman, Nishant Kumar
    Jat, Gopal Lal
    JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2025, 13 (01)
  • [24] Factory-Based Vibration Data for Bearing-Fault Detection
    Lundstrom, Adam
    O'Nils, Mattias
    DATA, 2023, 8 (07)
  • [25] Bearing Vibration Signal Fault Diagnosis Based on LSTM-Cascade CatBoost
    Yang, Miaomiao
    Liu, Weizhi
    Zhang, Wenxuan
    Wang, Mei
    Fang, Xia
    JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (05): : 1155 - 1161
  • [26] A New Statistical Features Based Approach for Bearing Fault Diagnosis Using Vibration Signals
    Altaf, Muhammad
    Akram, Tallha
    Khan, Muhammad Attique
    Iqbal, Muhammad
    Ch, M. Munawwar Iqbal
    Hsu, Ching-Hsien
    SENSORS, 2022, 22 (05)
  • [27] Bearing fault diagnosis method under unbalanced data distribution
    Cao J.
    He Z.-D.
    Yu P.
    Wang J.-H.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (11): : 2523 - 2531
  • [28] Fault diagnosis of the rolling bearing with optical Fiber Bragg Grating vibration sensor
    Wei Peng
    Dai Zejing
    Zheng Leilei
    Li Ming
    OPTICAL MEASUREMENT TECHNOLOGY AND INSTRUMENTATION, 2016, 10155
  • [29] Bearing fault diagnosis from raw vibration signals using multi-layer extreme learning machine
    Zhao Guangquan
    Wu Kankan
    Gao Yongcheng
    Liu Yongmei
    Hu Cong
    PROCEEDINGS OF 2019 14TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS (ICEMI), 2019, : 1287 - 1293
  • [30] A Novel Deep Learning System with Data Augmentation for Machine Fault Diagnosis from Vibration Signals
    Fu, Qiang
    Wang, Huawei
    APPLIED SCIENCES-BASEL, 2020, 10 (17):